Description
The Infrastructure Operations (Data Center) Team is the backbone of AWS, supporting the rapidly growing AWS business and customers 24/7. We are committed to maintain the physical infrastructure of AWS, ensuring the standards for operational performance in the areas of safety, security, availability, productivity, capacity, efficiency, and cost.
We are looking for a Data Center Engineering Operations (DCEO) Engineer with experience in critical facilities management, and a result-driven individual with strong technical understanding and the drive and vision to take our data center engineering operations to the next level. The role will be report to the DCEO Manager and responsible of sustaining availability, cost management, risk assessment and mitigation, review corrective and preventative maintenance of critical infrastructure and metric reporting.
Key job responsibilities
The Data Center Engineering Operation Engineer is responsible for ensuring that all electrical, mechanical, and fire/life safety equipment within the data center is operating within contract parameters within facilities. Often this will be including risk management and mitigation, corrective and preventative maintenance of critical infrastructure, vendor management and metric reporting.
• Responsible for the on-site management of contractors, sub-contractors and vendors, ensuring that all work performed is in accordance with established practices, procedures & local legislation.
• Establish performance benchmarks, conduct analyses, and prepare reports on all aspects of the data center facility infrastructure operations and maintenance.
• Generate change management requests & incident management tickets for DCEO activities.
• Work with DCO managers (IT) and other business leaders and operating partners to coordinate projects, manage capacity, and optimize plant safety, performance, reliability, sustainability and efficiency.
• Establish documentation relevant to technical support of business & facility operations.
• Responsible for supporting the installation of the racks and the provision of power/cooling.
• Support the COLO management of both routine maintenance and emergency services on a variety of essential systems such as: switchgear, generators, UPS systems, power distribution equipment, chillers, cooling towers, computer room air handlers, building monitoring systems, etc.
• Assist in the design, implementation, commissioning and build out of new facilities.
• Drive & implement projects to increase current facility capacity, efficiency, sustainability & reliability.
• Assist in recruiting efforts.
• Support operating partners in the resolution of any infrastructure engineering issues
A day in the life
In day today scale, you will be involved in:
• Assist in troubleshooting of facility and rack-level events within internal Service Level Agreements (SLA).
• Perform rack installs, rack decommissioning, and facility management.
• Provide operational readings and key performance indicators to make sure uptime is maintained
• Responsible for the on-site management of contractors, sub-contractors and vendors, ensuring that all work performed is in accordance with established practices, procedures & local legislation.
• Performance and oversight of maintenance and operations on all electrical, mechanical, and fire/life safety equipment within the data center.
• Work schedule changes depending on specific site needs. Shifts can be up to 12-hours and may rotate on a predefined schedule. Some locations have on-call rotations
We are open to hiring candidates to work out of one of the following locations:
Langfang, CHN
Basic Qualifications
• 4+ years of relevant work experience in maintaining a DC or Critical space facility.
• Strong verbal and written communication skills.
• Strong Facilities Management skills.
• Ability to prioritize in complex, fast-paced environment.
Preferred Qualifications
• An excellent understanding of the electrical and mechanical systems used in a data center environment, including but not limited to DRUPS, Transformers, Generators, Switchgear, UPS systems, ATS/STS units, PDUs, Chillers, AHUs and CRAC units.
• Mission Critical facility management experience for a large enterprise or large Colocation provider.
• Experience in management of vendors/contractors performing construction, maintenance and upgrading works in large-scale critical environment.
• Data center Project Management/ capacity planning and budgetary experience.
• Knowledge in IT services such as servers, network, cabling and platform .
• Appropriate Security and Safety awareness.