Data center facility operations engineer
Summary:
Meta was built to help people connect and share, and over the last decade, our tools have played a critical part in changing how people around the world communicate with one another. With over two billion people using the service and hundreds of offices around the globe, a career at Meta offers countless ways to make an impact in a fast-growing organization. Our Data Centers are the foundation upon which our rapidly scaling infrastructure efficiently operates to deliver our advanced services.Meta is seeking an experienced and self-driven Reliability Lead to join our Asset Management & Reliability team within Facility Operations. This person will work at the leading edge of Facility Operations to identify and manage asset reliability risks and various stages of end-to-end asset lifecycle for the Data Center Operations. Managing stakeholders spread across time zones is a significant challenge and key to the success of our individual projects and overall asset management, quality and reliability program.
Required Skills:
Data Center Facility Operations Reliability Engineer Responsibilities:
Prevent operational gaps in reliability engineering expertise across all asset management activities
Proactively review, identify, and mitigate risks of equipment failures, unscheduled downtime, and reactive maintenance
Ensure all new assets are methodically and consistently onboarded into Meta's asset management ecosystem.Maintain rigorous asset onboarding processes to enable accurate tracking and seamless integration into maintenance programs
Establish and maintain a robust asset criticality framework to prioritize resources and mitigate risk
Lead Failure Mode and Effects Analysis (FMEA) to predict failure modes, prioritize risks, and develop preventive actions. Develop and execute Reliability Centered Maintenance (RCM) programs to balance cost, risk, and performance
Assess operational risks associated with asset failures, maintenance strategies, and process deviations
Develop, maintain, and update the Global Maintenance Library of plans, procedures, and best practices
Govern the review and implementation of changes to maintenance strategies and procedures
Ensure all maintenance changes are data-driven, risk assessed, and systematically implemented
Support accurate accounting of asset depreciation and amortization through timely asset tracking
Serve as a subject matter expert and technical lead for Enterprise Asset Management (EAM) implementation and optimization
Create and maintain asset useful life models to forecast replacement needs and optimize total cost of ownership
Provide technical leadership for condition-based, time-based, and specialized reliability maintenance initiatives
Analyze asset health metrics and KPIs to identify risks, predict failures, and measure reliability improvements
Collaborate with Operations and Maintenance to optimize scheduling and execution of maintenance activities
Mentor staff in reliability methodologies and foster a environment of proactive asset management
Sustain continuous improvement of asset management workstreams and processes
25% to 50% travel domestically and internationally
Minimum Qualifications:
Minimum Qualifications:
Bachelor's degree in Mechanical, Electrical Reliability Engineering or similar technical discipline
10+ years of experience in reliability engineering (related to electrical or mechanical cooling equipment)
Experienced in Reliability Centered Maintenance (RCM) and Failure Maintenance Effect Analysis (FMEA) activities for maintenance /process/equipment design optimization to meet reliability requirements
Proficient in usage of EAM solutions to extract data and develop meaningful insights
Certifications in Maintenance & Reliability such as CMRP, CRL, CRE
Knowledgeable of relevant ISO standards (ISO 14224, ISO 17359, ISO 55000)
Experience with Program/Project management and cross-functional team management
Preferred Qualifications:
Preferred Qualifications:
Experience with data center equipment such as critical cooling systems, generators, main switchboards, network gear
Proficient in data analysis techniques that can include Process Control, Reliability modeling and prediction, Fault Tree Analysis, Weibull Tree Analysis, Six Sigma (6s) Methodology
Proficient in developing and executing test plans for assets
Certifications in Maintenance & Reliability such as CMRP, CRL, CRE
Knowledgeable of relevant ISO standards (ISO 14224, ISO 17359, ISO 55000)
Public Compensation:
$133,000/year to $190,000/year + bonus + equity + benefits
Industry: Internet
Equal Opportunity:
Meta is proud to be an Equal Employment Opportunity and Affirmative Action employer. We do not discriminate based upon race, religion, color, national origin, sex (including pregnancy, childbirth, or related medical conditions), sexual orientation, gender, gender identity, gender expression, transgender status, sexual stereotypes, age, status as a protected veteran, status as an individual with a disability, or other applicable legally protected characteristics. We also consider qualified applicants with criminal histories, consistent with applicable federal, state and local law. Meta participates in the E-Verify program in certain locations, as required by law. Please note that Meta may leverage artificial intelligence and machine learning technologies in connection with applications for employment.
Meta is committed to providing reasonable accommodations for candidates with disabilities in our recruiting process. If you need any assistance or accommodations due to a disability, please let us know at [email protected].
Recommended Jobs
Clinician
Job Description Job Description Join Our Team as a Clinician – Make a Lasting Impact on Families in Need! Are you a passionate, licensed clinician ready to bring your skills to a rewarding outr…
Administrative lead
The Harvard-Radcliffe Collegium Musicum Foundation (HRCMF), the alumni association of Harvard’s nationally acclaimed mixed-voice choir, is seeking an Administrative Lead to join us as an independ…
AI Engineer Intern (Summer 2026)
At Klaviyo, we value the unique backgrounds, experiences and perspectives each Klaviyo (we call ourselves Klaviyos) brings to our workplace each and every day. We believe everyone deserves a fair sho…
Busser
For this position, pay will be variable by location - plus tips. Our Winning Family Starts With You! Check out these great benefits! ~ Flexible schedules to help you balance other life commitme…
Resident Care Director
Do you want to be a part of an organization that is committed to delivering best-in-class results? At EPOCH Senior Living we strive to provide exceptional senior care, while creating an environment …
Non CDL Bus Operator
Franklin Transit Management (FTM) is currently seeking part-time and full-time Demand Response bus drivers. The schedules include working on weekends and times of shifts may vary. All employees are …
Care Coordinator
Are you passionate about making a difference in people's lives through healthcare? Join New England Wellness Solutions in Hanover, MA, as a Care Coordinator. We will train you from A to Z to succeed w…
Manager, Ontology and Data Modeling
Overview Manager, Ontology and Data Modeling The role of the Manager of Ontology and Data Modeling is to develop, implement, and maintain enterprise ontologies in support of Capital One's Data…
Front Office Administrative Assistant
Front Office Administrative Assistant The Front Office Administrative Assistant is the first point of contact for the school and plays a key role in creating a welcoming environment for visitors, …
Software development engineer
Lensa is a career site that helps job seekers find great jobs in the US. We are not a staffing firm or agency. Lensa does not hire directly for these jobs, but promotes jobs on LinkedIn on behalf of …