Home
/
Software Engineering
/
Senior Director, Reliability Tools and Practices
Senior Director, Reliability Tools and Practices-November 2024
Chevy Chase
Nov 23, 2024
ABOUT GEICO
With a range of policy options, GEICO provides affordable insurance for millions of customers across the United States.
10,000+ employees
Insurance, Client Services
VIEW COMPANY PROFILE >>
About Senior Director, Reliability Tools and Practices

  GEICO Introduction:

  GEICO (Government Employees Insurance Company) was founded in 1936 and insures more than 27 million vehicles in all 50 states and the District of Columbia. A member of the Berkshire Hathaway family of companies, GEICO constantly strives to make lives better by protecting people against unexpected events while saving them money. GEICO is one of the nation's largest and fastest-growing auto insurers, known for our low rates, outstanding service, and clever marketing, but we are so much more. GEICO Tech is constantly looking for new ways to anticipate our customers' needs by asking big questions and challenging ourselves to think outside the box. Join our team as we continue to build and innovate!

  Position Overview:

  GEICO is seeking an experienced and visionary technical Director / Senior Director of Reliability Tools and Practices Engineering within the Site Reliability Engineering (SRE) organization. You will play a critical role in ensuring the reliability, availability, and performance of the company's systems and services by defining, developing, and delivering Reliability Tooling and Platforms to Geico engineering organization. You will lead a team responsible for developing, implementing, and maintaining tools, practices, and processes that enable their organization to achieve world-class reliability and operational excellence. This role combines technical expertise, leadership, and strategic thinking to drive continuous improvement in reliability and scalability.

  Key Responsibilities:

  Tooling Strategy:Develop and execute a comprehensive tooling strategy to enhance the reliability and scalability of our systems.Identify, evaluate, and implement cutting-edge tools and technologies that align with industry best practices.Tooling Development:Lead the development and maintenance of custom tools and automation solutions tailored to the specific needs of our SRE teams.Collaborate with cross-functional teams to ensure tooling meets business requirements.Monitoring and Alerting:Define and implement robust monitoring and alerting practices, ensuring that SRE teams have timely and actionable insights into system performance and issues.Incident Response:Establish and maintain incident response processes, including incident escalation procedures, post-incident reviews, and incident management tooling.Capacity Planning:Work closely with Capacity Planning teams to ensure adequate resources are provisioned to meet growing demands and develop tools and practices for proactive capacity management.Reliability Best Practices:Define and promote reliability best practices across the organization. Lead efforts to improve service-level objectives (SLOs) and error budgeting.Documentation:Ensure comprehensive documentation of reliability tools and practices, making them accessible to SRE teams and promoting knowledge sharing.Team Leadership:Build and lead a high-performing team of reliability engineers and tooling specialists.Provide mentorship, guidance, and professional development opportunities.Vendor Relations:Manage relationships with third-party tooling vendors, negotiate contracts, and stay informed about emerging trends and innovations in the field.Compliance and Security:Ensure that all reliability tools and practices adhere to security and compliance standards, and drive efforts to continuously enhance security.Qualifications:Bachelor's or Master's degree in Computer Science, Information Technology, or related field, or equivalent practical experience. Advanced degree preferred. Proven experience leading full-stack SW development teams preferably within a Site Reliability Engineering, Engineering Productivity, Observability or Workflow Automation domains using agile SW development methodologies and DevOps practices.Strong expertise in defining, developing, and managing reliability tools and automation solutions as product owner.Knowledge of chaos engineering principles and tools.Experience with continuous integration and continuous delivery (CI/CD) pipelines.Proficiency in monitoring, alerting, and incident response tools such as Prometheus, Grafana, ELK, PagerDuty, etc.Experience with cloud platforms (e.g., AWS, Azure, GCP) and container orchestration (e.g., Kubernetes).Familiarity with industry best practices in reliability engineering, including SLOs, error budgets, and incident management.Expertise in incident management processes, including creating incident response playbooks, incident triaging strategies, and post-incident analysis to drive continuous improvement in system reliability and availability.Experience with infrastructure automation, tooling, and configuration management frameworks (e.g., Kubernetes, Ansible, Terraform, etc.).Excellent leadership and team management skills with a passion for mentoring and fostering professional growth. Strong problem-solving and analytical abilities, with a keen eye for detail and a passion for driving operational efficiency.Experience in budget management, resource allocation, and vendor collaboration. #LI-VR1

  Annual Salary

  $195,000.00 - $315,000.00

  The above annual salary range is a general guideline. Multiple factors are taken into consideration to arrive at the final hourly rate/ annual salary to be offered to the selected candidate. Factors include, but are not limited to, the scope and responsibilities of the role, the selected candidate's work experience, education and training, the work location as well as market and business considerations.

  At this time, GEICO will not sponsor a new applicant for employment authorization for this position.

  Benefits:

  As an Associate, you'll enjoy our Total Rewards Program * to help secure your financial future and preserve your health and well-being, including:Premier Medical, Dental and Vision Insurance with no waiting periodPaid Vacation, Sick and Parental Leave401(k) PlanTuition ReimbursementPaid Training and Licensures*Benefits may be different by location. Benefit eligibility requirements vary and may include length of service.

  Coverage begins on the date of hire. Must enroll in New Hire Benefits within 30 days of the date of hire for coverage to take effect.

  The equal employment opportunity policy of the GEICO Companies provides for a fair and equal employment opportunity for all associates and job applicants regardless of race, color, religious creed, national origin, ancestry, age, gender, pregnancy, sexual orientation, gender identity, marital status, familial status, disability or genetic information, in compliance with applicable federal, state and local law. GEICO hires and promotes individuals solely on the basis of their qualifications for the job to be filled.

  GEICO reasonably accommodates qualified individuals with disabilities to enable them to receive equal employment opportunity and/or perform the essential functions of the job, unless the accommodation would impose an undue hardship to the Company. This applies to all applicants and associates. GEICO also provides a work environment in which each associate is able to be productive and work to the best of their ability. We do not condone or tolerate an atmosphere of intimidation or harassment. We expect and require the cooperation of all associates in maintaining an atmosphere free from discrimination and harassment with mutual respect by and for all associates and applicants.

Comments
Welcome to zdrecruit comments! Please keep conversations courteous and on-topic. To fosterproductive and respectful conversations, you may see comments from our Community Managers.
Sign up to post
Sort by
Show More Comments
SIMILAR JOBS
Software Engineer - Full Stack
OVERVIEW This position can be based out of San Francisco or New York City We're looking for Full-Stack Software Engineers to join our Engineering team. In this role, you will build innovative payment
Engineering Manager - Corlu IC
ABOUT UNILEVER With 3.4 billion people in over 190 countries using our products every day, Unilever is a business that makes a real impact on the world. Work on brands that are loved and improve the
Software Engineer - Card Processing and Authorisation
Company Description Checkout.com is one of the most exciting FinTechs in the world. Our mission is to enable businesses and their communities to thrive in the digital economy. We’re the strategic pay
Sr. Manager, Analytics Engineer - Biopharma
ROLE SUMMARY: Pfizer is seeking hardworking, passionate and results-oriented individuals to join our Analytics Engineering team to build data foundations and tools to craft the future. You will desig
Software Engineer (Hybrid)
Software Engineer - IE08DE We're determined to make a difference and are proud to be an insurance company that goes well beyond coverages and policies. Working here means having every opportunity to
Lagerleiter*in (d/w/m)
DU BIST MEHR ALS DEIN JOB-TITEL. MEHR ALS ZAHLEN UND BUCHSTABEN IN DEINEM LEBENSLAUF. UND WIR SIND MEHR ALS EIN UNTERNEHMEN. WIE WÄR'S ALSO, WENN WIR UNS EINFACH ZUSAMMENTUN - UND GEMEINSAM NOCH MEHR
Staff Software Engineer - Backend (Growth Data Platform Team)
Hinge Health is creating a new health care system, built around you. Accessible to 26 million members across 1,500 customers, Hinge Health is the #1 digital clinic for joint and muscle pain, deliveri
Software Developer in Test - Vice President
iCapital is powering the world’s alternative investment marketplace. Our financial technology platform has transformed how advisors, wealth management firms, asset managers, and banks evaluate and re
Senior Software Engineer, Experience Containerization
Every day, tens of millions of people come to Roblox to explore, create, play, learn, and connect with friends in 3D immersive digital experiences– all created by our global community of developers a
Site Reliability Engineer
At Broadridge, we've built a culture where the highest goal is to empower others to accomplish more. If you're passionate about developing your career, while helping others along the way, come join t
Copyright 2023-2024 - www.zdrecruit.com All Rights Reserved