Site Reliability Engineer II
Our Opportunity:
Site Reliability Engineers – Observability Team are a cross-functional group of systems and software engineers responsible for the operational aspects of Chewy’s e-commerce platform. The team designs, builds, and maintains Chewy’s observability platform—covering metrics, logging, and tracing—while also supporting the infrastructure behind both internet-facing and internal services.
We’re looking for engineers who want to contribute to developing infrastructure software, maintaining it, and scaling Chewy’s technology stack. Come help us build a bigger and better Chewy as a Site Reliability Engineer! You will be part of a small team with a huge impact on our incredible growth. Ideal candidates can clearly communicate complex technical concepts with diverse audiences across the organization. They remain calm under pressure and bring structure to high-pressure, fast-paced tasks and projects.
What You’ll Do:
- Experience coding in one or more programming languages (e.g., Java, Node.js, or Python) with a solid foundation in software design
- Hands-on experience with OpenTelemetry collector, Datadog, and Dynatrace integrations, with familiarity in creating metrics using StatsD and Prometheus
- Experience and familiarity with FluentBit/Fluentd log pipelines, ensuring scalable and reliable log processing
- Worked on Jenkins release processes and Kubernetes running the apps under the guidance of senior engineers
- Strong understanding of monitoring, logging, and tracing data to improve engineering teams’ ability to optimize customer-facing services
- Identify requirements for other operational teams (release engineering, automation, etc.) during the application development phase
- Act as a technology and DevOps engineer to improve the automation areas on the observability platforms
- Participate in the on-call rotation for Level 1-2 support critical issues
What You’ll Need:
- Bachelor’s degree + 4 years of experience, or master's degree + 2 years of experience
- Hands-on experience developing coding skills in Java, Node.js, and infrastructure scripting (e.g., Terraform) for automation and observability enhancements
- Minimum 2+ years of experience building and managing applications in public cloud platforms such as AWS (preferred) or GCP
- Experience working with the open-source community (e.g., troubleshooting, patch submission)
- Strong ability to organize, troubleshoot, and continuously learn
Bonus (if applicable):
- Deep expertise in Datadog / Dynatrace / Splunk or any open source eco system environments
Recommended Jobs
Medical Assistant
Our busy Dermatology practice is seeking an experienced Medical Assistant to join our team! Job Description Utilize Electronic Medical Record system Assists physicians by performing a…
Engineer: Manufacturing
Job Description Job Description Injection Molding Engineer Nanobiosym is on the cusp of rewriting the rules of personalized medicine, novel technologies, and healthcare delivery. Founded by a …
FOOD UNIT LEAD (FULL TIME)
We are hiring immediately for a full time FOOD UNIT LEAD position. Location: St. Vincent Hospital - 123 Summer Street, Worcester, MA 01608. Note: online applications accepted only. Schedule…
Finance and Executive Assistant
Electro Switch headquarters, located in Weymouth MA, is seeking an experienced Finance and Executive Assistant to support the executive team at our main office. This position will have routine financ…
Member of Technical Staff - Machine Learning Engineer, Inference (Pytorch)
Liquid AI, an MIT spin-off, is a foundation model company headquartered in Boston, Massachusetts. Our mission is to build capable and efficient general-purpose AI systems at every scale. Our goal …
Non-CDL DRIVERS (Class E)
**ONSITE JOB OFFERS!!!** Hiring Drivers We are the largest independently owned local residential moving and storage company. We pride ourselves on taking care of our customers and …
Automotive Body Shop Technician
Automotive Body Shop Technician/Paint Techs Mirak Automotive Group has been family owned and operated in Arlington MA since 1936. Our quickly growing collision center is looking to add more automot…
Welder/Mechanic
The Massachusetts Municipal Wholesale Electric Company (MMWEC) brings a competitive edge to Massachusetts municipal utilities dedicated to providing their customers with low-cost and reliable electri…
Senior software development engineer project lead
At Sonos we want to create the ultimate listening experience for our customers and know that it starts by listening to each other. As part of the Sonos team, you’ll collaborate with people of all sty…
Child Watch Staff - Evenings and Weekends
Job Description Job Description Description This position is primarily responsible for representing the YMCA personally, professionally and in a manner in accordance with the mission and goals o…