Home
/
Comprehensive
/
Data Center Production Operations Engineer
Data Center Production Operations Engineer-January 2024
Clonee
Jan 22, 2025
ABOUT META
We’re building a team as diverse as the communities and billions of people we serve every day. Our teammates don’t need to conform here. Lived experiences are an asset, and we value your unique perspe
10,000+ employees
Social Media, Technology
VIEW COMPANY PROFILE >>
About Data Center Production Operations Engineer

  Summary:

  Meta is seeking a forward-thinking experienced Engineer to join the Production Operations Engineering team within Infra Data Centers. Our data centers, and the hundreds of thousands of servers installed in them, are the foundation upon which our rapidly scaling infrastructure efficiently operates and upon which our innovative services are delivered. Meta is at the leading edge of the global data center industry both in terms of how data centers are designed and operated. This person should enjoy working in a fast paced, technical environment where adaptability and flexibility will be key to their success. We seek an IT professional with advanced, hands-on technical skills in server hardware and Linux (ideally in a Data Center environment). Having extensive knowledge of server administration and performing on complex projects in a large-scale distributed data center environment is a core competency of this individual. The candidate should also have deep knowledge and experience in at least one of the following core areas: Hardware, OS repair, Tooling and Automation and Project Management.

  Required Skills:

  Data Center Production Operations Engineer Responsibilities:

  Perform deep dives and analyze complex technical issues within the data center, ranging from automated tooling to hardware failures, Linux OS, and network issues.

  Work as a subject matter expert with cross functional teams on large scale data center projects and initiatives.

  Provide cross data center support and identify potentially larger issues, displaying effective communication when something is identified.

  Work with internal hardware teams and vendors to help drive complex technical issues to resolution, provide an ownership stake in ensuring high quality levels of hardware, and influence future design to ensure ease of serviceability.

  Ability to solve issues at scale using scripting, automation and tooling

  Use data to drive maximum server fleet up-time and utilization rates, by understanding hardware failure rates and SLAs to customers. Identify trends and systemic issues in the fleet and drive resolution.

  Coach/Mentor team members to evaluate and identify better ways to resolve issues and define updates to tools and processes.

  Provide mentorship and be the go-to technical resource for management.

  Build cross functional relationships and have the ability to influence policies and procedures to improve global data center operations.

  Participate in an on-call rotation.

  Daily use of our ticketing system to support servers that are unavailable and need to be returned to capacity.

  Minimum Qualifications:

  Minimum Qualifications:

  BS, BA or BEng in technical field or commensurate experience.

  5+ years of infrastructure or related experience.

  Knowledge of Linux and hardware systems support in an Internet operations environment.

  Experience managing multiple technical issues concurrently

  Knowledge of the interdependencies of data center functions and technologies including electrical, cooling, structured cabling, security, network and server systems.

  Knowledge of out-of-band/lights-out server communication methods, such as IPMI and serial console.

  Time and project management experience.

  Experience in modifying and developing in commonly used scripting or programming languages.

  Solid oral and written communication skills. English as a working language.

  Preferred Qualifications:

  Preferred Qualifications:

  Experience in providing technical guidance to external vendors.

  Experience in a large-scale data center environment.

  Experience with large-scale GPU based systems.

  Experience in debugging, modifying and developing in commonly used scripting or programming languages including Bash, PHP, Python, SQL, or Perl.

  Industry: Internet

Comments
Welcome to zdrecruit comments! Please keep conversations courteous and on-topic. To fosterproductive and respectful conversations, you may see comments from our Community Managers.
Sign up to post
Sort by
Show More Comments
SIMILAR JOBS
COMSEC Custodian
Description 1901 Group (A Leidos Company) has an exciting COMSEC Custodian opportunity working onsite at our customer’s office located in Washington, D.C. An experienced COMSEC Custodian/NSSP Support
Applicator I - AgVantage FS - Hudson, IA
AgVantage FS is headquartered in Waverly, Iowa with branches in Northern and Eastern Iowa. AgVantage FS has about 400 employees (including seasonal workers) and approximately 23,000 customers. The co
Radiology Technician (Limited Certified)- Family Medicine (Myrtle Beach, SC)
Employee Type: Regular Work Shift: Day - 8 hour shift (United States of America) Join Team Tidelands and help people live better lives through better health! Position Summary: The Physician Office Ra
T&D TL&E Manager
We Are: Our Utility Industry, Transmission & Distribution Practice is powering the progress to a safe, connected, and sustainable planet. Every day, we work with the largest electric, gas, and wa
Senior Professional Contract Administrator - Remote US
Applicants in Eastern Time Zone highly preferred. Our Team: The Contract Development and Analytics department manages over 1000 contracts all with unique dates, terms, pricing, fees, etc. The end-to-
Physician- Family Medicine
Northern Light Mayo Hospital is seeking a family practice physician for their well-established and highly respected practice in Corinth, Maine. We are well-positioned to face the challenges of health
Oracle NetSuite - Account Executive - East - Mid-market
Job Description About Oracle NetSuite Oracle NetSuite was founded in 1998 and is widely recognized as the first cloud computing software company. Having over 36,000 customers, we are a world leader i
Registered Nurse RN - Geriatric Unit (3C) - Per Diem Night
Registered Nurse RN - Geriatric Unit (3C) - Per Diem NightReq #:0000134582 Category:Nursing Status:Per Diem Shift:Night Facility:Community Medical Center Department:Geriatric Unit (3C) Location: CMC
Shift Manager
SHIFT MANAGER We’re glad you’re here. You may know us as the brand with Roast Beef and Curly Fries – but we are also crafting incredible career opportunities. You’re in the right place if you’re here
Principal Software Developer
Job Description C loud F oundations S ervices (CFS) is a strategic component for providing critical cloud services to multiple Oracle Global Industry Unit applications. O racle C loud I nfrastructure
Copyright 2023-2025 - www.zdrecruit.com All Rights Reserved