Home
/
Comprehensive
/
Technical Program Manager, SRE - Remote
Technical Program Manager, SRE - Remote-November 2024
Remote
Nov 23, 2024
ABOUT EPAM SYSTEMS
EPAM is a leading global provider of product development and software engineering solutions.
10,000+ employees
Consulting, Technology
VIEW COMPANY PROFILE >>
About Technical Program Manager, SRE - Remote

  If you are looking for a high-impact Site Reliability role with a global leader in digital transformation, EPAM is the perfect next step in your career! As an EPAMer, you’ll have the opportunity to work with a supportive team, on a variety of interesting projects for some of the biggest brands in the world. Are you ready for the next step in your career journey? Apply now!

  Req.#578148855

  RESPONSIBILITIES

  Function as program management point person for L2 support, ensuring that the efforts of the team are smooth and working as expected; create a program to drive value through SRE activities that reduce the need for L2 support over time

  Lead development teams through architectural reviews and recommendations

  Define what it means for a service to be available and develop, monitor, and alert on SLIs/SLOs

  Define, track, and enforce error budgets

  Review code instrumentation with development teams and ensure necessary dashboards are created to monitor SLI/SLO/SLAs

  Establish, test, and tune alerting for varying tiers of applications

  Document and maintain runbooks and procedures, automate as much as possible

  Plan and execute periodic Disaster Recovery exercises including both tabletop and simulated failures (fault injection)

  Perform periodic load and scalability testing to establish baselines, drift, and capacity planning

  Design and implement peak readiness reviews for anticipated high-volume times

  Lead weekly operational state reviews covering performance trends, anomalies, errors, and other availability events with SREs, product owners, and development teams

  Participate in quarterly business and operational reviews aligning on roadmaps, development velocity, efficiency, growth trends, etc

  Socialize SRE culture across teams within the organization to publicize the value of SRE, mentor and train other engineers around proactive reliability decision-making and planning

  REQUIREMENTS

  5+ years of SRE Engineering experience

  3+ years as team lead or SRE champion

  Experience functioning as program management point person for L2 support

  Experience creating a program that drives value through SRE activities to reduce the need for L2 support over time

  Bachelor's degree in Computer Science, similar technical field of study, or equivalent practical experience

  Proven experience in troubleshooting, mitigating, and resolving issues in a distributed system

  Strong communication and collaboration skills for varying groups of stakeholders

  Be self-motivated and can prioritize effectively between competing priorities

  Experience with implementing SRE practices for services and applications deployed in production in the cloud

  Must understand most SRE concepts, including SLI/SLO/SLA, Error Budget, MTTD/MTTR/MTBF, Toil, Capacity Planning, Observability, Monitoring/Alerting, Release Engineering, and Incident Management/Blameless Post-Mortems

  BENEFITS

  Extended Healthcare with Prescription Drugs, Dental and Vision Insurance, and Healthcare Spending Account (Company Paid)

  Maternity/Parental/Adoption Leave Top-up

  Life and AD&D Insurance (Company Paid)

  Employee Assistance Program (Company Paid)

  Long-Term Disability

  Registered Retirement Savings Plan (RRSP) with company match

  Paid Time Off

  Critical Illness Insurance

  Employee Stock Purchase Program

  Employee Discounts

  Unlimited access to LinkedIn learning solutions

  ABOUT EPAM

  EPAM is a leading global provider of digital platform engineering and development services. We are committed to having a positive impact on our customers, our employees, and our communities. We embrace a dynamic and inclusive culture. Here you will collaborate with multi-national teams, contribute to a myriad of innovative projects that deliver the most creative and cutting-edge solutions, and have an opportunity to continuously learn and grow. No matter where you are located, you will join a dedicated, creative, and diverse community that will help you discover your fullest potentialEPAM Systems, Inc. is an equal opportunity employer. We recognize the value of diversity and inclusion in creating success for our customers, business partners, shareholders, employees and communities. We are committed to recruiting, hiring, developing and promoting employees without discrimination. As a global employer, this commitment includes complying with all laws in the countries in which we operate. Nevertheless, we believe equal employment practices should not be limited to what the law requires. Equal opportunity and inclusion are essential to motivate, empower and recognize the best in everyone.

  At EPAM, employment actions are based on individual qualifications, without regard to race, color, religion, creed, gender, pregnancy status, sexual orientation, gender identity, gender expression, marital or familial status, national origin, ancestry, genetics, age, disability status, veteran status, citizenship status when otherwise legally able to work, or any other characteristic protected by law.

Comments
Welcome to zdrecruit comments! Please keep conversations courteous and on-topic. To fosterproductive and respectful conversations, you may see comments from our Community Managers.
Sign up to post
Sort by
Show More Comments
SIMILAR JOBS
Civil Engineer - Transportation/Traffic
Overview Kimley-Horn's Charlottesville, Va (Virginia) office is seeking a Civil Engineer with 6+ years of experience to join their Transportation Planning and Traffic team! This is not a remote posit
Night Manager - Hyatt Regency Qindgao
Description: You will be responsible to assist with the efficient running of the department in line with Hyatt International's Corporate Strategies and brand standards, whilst meeting employee, guest
Network Developer 3
Job Description Supports the design, deployment, and operations of a large-scale global Oracle cloud computing environment (Oracle Cloud Infrastructure - OCI). Primarily focused on development and su
Global Banking and Markets - Operations - Equities Franchise
Title- Equities Franchise - AnalystLocation- Salt Lake City, UTDuration- 6 MonthsPay Rate- $ 20/hr JOB SUMMARY AND RESPONSIBILITIES* Draft OTC derivative trade confirmations (in equity derivatives.)*
ANIMAL TRANSPORT DRIVER
This position will be responsible to drive safely to scheduled sow farms for loading weaned and / or culled pigs; coordination of safe loading and accurate counts; delivery to local grower farms; saf
Manager, Partnership Management, FLEX Partnerships
Manager, Partnership Management, FLEX Partnerships - 2306157607W Description Johnson & Johnson Innovative Medicine., a division of Johnson & Johnson's Family of Companies, is recruiting for a
2nd Shift Medium Voltage Assembler
Powell People Solving Tough Problems As a top player in Switchgear and Electrical Enclosure (PCR) Manufacturing and field servicing, a career with us will challenge you to solve some very tough probl
PUBLIC HEALTH NURSING MANAGER
PUBLIC HEALTH NURSING MANAGER Print (https://www.governmentjobs.com/careers/edcgov/jobs/newprint/4355655) Apply  PUBLIC HEALTH NURSING MANAGER Salary $117,457.60 - $142,771.20 Annually Location Plac
Cloud Account Executive - Oracle Government Defense and Intelligence (GDI)
Job Description Oracle GDI - Government Defense & Intelligence is looking for a Cloud Account Executive to be responsible for Government accounts with a focus on the National Reconnaissance Offic
Senior Technical Consultant, Appdev
As a Consultant in our Industry Solutions Delivery team, you will regularly interact with our customers as you consult and guide their teams through largescale and complex migrations to our cloud pla
Copyright 2023-2024 - www.zdrecruit.com All Rights Reserved