Principal Site Reliability Engineer
Own Reliability at Scale Lead design, implementation, and evolution of reliability, availability, and resiliency strategies for large‑scale distributed systems written primarily in Java Apply deep experience operating complex, distributed systems to guide architectural decisions, reliability strategies, and long‑term system evolution Identify systemic risks in application architecture, data flows, and infrastructure, and drive architectural improvements that measurably improve availability, performance, and scalability Set and evolve reliability standards, best practices, and operational principles across R&D Apply advanced software engineering practices to eliminate manual work, reduce operational load, and improve system observability Design and build internal platforms, automation, and tooling that support Java‑based services and their operational needs Contribute to longer‑term reliability and infrastructure strategy aligned with business growth US Citizenship or Green Card holder only for this role due to ITAR requirements. Ability to commute to the Seaport Boston office 2-3 days a week. 7+ years of experience in software engineering, site reliability engineering, or systems engineering roles Extremely strong proficiency with the Java programming language and its ecosystem, including building, debugging, and operating production Java services Deep experience operating complex, distributed systems in production environments Strong software engineering background, with a track record of delivering high‑quality, maintainable code Ability to reason about failure modes across application, data, and infrastructure layers Demonstrated ability to lead complex initiatives that span teams and organizational boundaries Comfortable making high‑impact technical decisions in ambiguous environments Strong communicator who can influence design and operational decisions across a wide range of stakeholders Experience operating or supporting systems using technologies such as MongoDB, ZooKeeper, and RabbitMQ Background in performance tuning and scalability optimization of Java services Experience setting or influencing engineering standards at the organization level Prior involvement in evolving SRE or platform practices in a growing engineering organization Experience designing, operating, or scaling systems in cloud environments such as AWS (preferred), including familiarity with core services, networking models, and reliability features
Recommended Jobs
Quality Engineer
About us Our Business Unit Lighting and Imaging offers challenging lighting and image transmission solutions for applications in the markets of automotive, aviation, medical devices, as well as in…
Body Shop Technician
Annual earnings of $120K+ per year with a sign-on bonus of $10k! Bill DeLuca Family of Dealerships, a family owned business serving the Merrimack Valley and Southern New Hampshire for over 68 years…
Quality Associate(Document Control)
: Quality Document Control Associate The Document Control Associate shall be responsible for maintaining Company documentation and assist in processing document change requests through eQMS sof…
Housekeeper
Job Description Job Description TESCO, Inc. provides comprehensive staffing solutions to clients across Healthcare, Education, and U.S. Government sectors. Pay - $18/hour Shift - 4 pm t…
Environmental Services Tech
Job Description Job Description Overview Shriners Children's has a full time EVS Tech II position available; 11am-7pm. No weekends or on-call. This position is an important part of our te…
MBA Product Manager Intern
At Toast, we're building innovative technology to help restaurants and food & beverage retailers operate more efficiently, connect with their customers in new ways, and grow their businesses. Our Prod…
Senior Data Scientist - Data for Perception Machine Learning
We are seeking an experienced and highly skilled data scientist to join the Perception Data and Labeling team.. The team is responsible for training and evaluation data powering the perception (visio…
Professional Electrical Engineer (PE)
Job Description Job Description Position Overview We are seeking a licensed Professional Electrical Engineer (PE) to join our growing team. This individual will lead electrical system design …
Lead Social Worker (IV)
Pappas Rehabilitation Hospital for Children (PRHC), operated by the Massachusetts Department of Public Health (DPH), is seeking a dedicated and experienced Lead Social Worker (Social Worker IV) to jo…
Central Processing Supervisor
POSITION SUMMARY: Central Processing Supervisor is responsible for the operations of the department and the supervision of staff. Supervises and coordinates the technical tasks of reprocessing…