GEICO Introduction:
GEICO (Government Employees Insurance Company) was founded in 1936 and insures more than 27 million vehicles in all 50 states and the District of Columbia. A member of the Berkshire Hathaway family of companies, GEICO constantly strives to make lives better by protecting people against unexpected events while saving them money. GEICO is one of the nation's largest and fastest-growing auto insurers, known for our low rates, outstanding service, and clever marketing, but we are so much more. GEICO Tech is constantly looking for new ways to anticipate our customers' needs by asking big questions and challenging ourselves to think outside the box. Join our team as we continue to build and innovate!
Position Overview:
GEICO is seeking an experienced and visionary technical Director / Senior Director of Reliability Tools and Practices Engineering within the Site Reliability Engineering (SRE) organization. You will play a critical role in ensuring the reliability, availability, and performance of the company's systems and services by defining, developing, and delivering Reliability Tooling and Platforms to Geico engineering organization. You will lead a team responsible for developing, implementing, and maintaining tools, practices, and processes that enable their organization to achieve world-class reliability and operational excellence. This role combines technical expertise, leadership, and strategic thinking to drive continuous improvement in reliability and scalability.
Key Responsibilities:
Tooling Strategy:Develop and execute a comprehensive tooling strategy to enhance the reliability and scalability of our systems.Identify, evaluate, and implement cutting-edge tools and technologies that align with industry best practices.Tooling Development:Lead the development and maintenance of custom tools and automation solutions tailored to the specific needs of our SRE teams.Collaborate with cross-functional teams to ensure tooling meets business requirements.Monitoring and Alerting:Define and implement robust monitoring and alerting practices, ensuring that SRE teams have timely and actionable insights into system performance and issues.Incident Response:Establish and maintain incident response processes, including incident escalation procedures, post-incident reviews, and incident management tooling.Capacity Planning:Work closely with Capacity Planning teams to ensure adequate resources are provisioned to meet growing demands and develop tools and practices for proactive capacity management.Reliability Best Practices:Define and promote reliability best practices across the organization. Lead efforts to improve service-level objectives (SLOs) and error budgeting.Documentation:Ensure comprehensive documentation of reliability tools and practices, making them accessible to SRE teams and promoting knowledge sharing.Team Leadership:Build and lead a high-performing team of reliability engineers and tooling specialists.Provide mentorship, guidance, and professional development opportunities.Vendor Relations:Manage relationships with third-party tooling vendors, negotiate contracts, and stay informed about emerging trends and innovations in the field.Compliance and Security:Ensure that all reliability tools and practices adhere to security and compliance standards, and drive efforts to continuously enhance security.Qualifications:Bachelor's or Master's degree in Computer Science, Information Technology, or related field, or equivalent practical experience. Advanced degree preferred. Proven experience leading full-stack SW development teams preferably within a Site Reliability Engineering, Engineering Productivity, Observability or Workflow Automation domains using agile SW development methodologies and DevOps practices.Strong expertise in defining, developing, and managing reliability tools and automation solutions as product owner.Knowledge of chaos engineering principles and tools.Experience with continuous integration and continuous delivery (CI/CD) pipelines.Proficiency in monitoring, alerting, and incident response tools such as Prometheus, Grafana, ELK, PagerDuty, etc.Experience with cloud platforms (e.g., AWS, Azure, GCP) and container orchestration (e.g., Kubernetes).Familiarity with industry best practices in reliability engineering, including SLOs, error budgets, and incident management.Expertise in incident management processes, including creating incident response playbooks, incident triaging strategies, and post-incident analysis to drive continuous improvement in system reliability and availability.Experience with infrastructure automation, tooling, and configuration management frameworks (e.g., Kubernetes, Ansible, Terraform, etc.).Excellent leadership and team management skills with a passion for mentoring and fostering professional growth. Strong problem-solving and analytical abilities, with a keen eye for detail and a passion for driving operational efficiency.Experience in budget management, resource allocation, and vendor collaboration. #LI-VR1
Annual Salary
$195,000.00 - $315,000.00
The above annual salary range is a general guideline. Multiple factors are taken into consideration to arrive at the final hourly rate/ annual salary to be offered to the selected candidate. Factors include, but are not limited to, the scope and responsibilities of the role, the selected candidate's work experience, education and training, the work location as well as market and business considerations.
At this time, GEICO will not sponsor a new applicant for employment authorization for this position.
Benefits:
As an Associate, you'll enjoy our Total Rewards Program * to help secure your financial future and preserve your health and well-being, including:Premier Medical, Dental and Vision Insurance with no waiting periodPaid Vacation, Sick and Parental Leave401(k) PlanTuition ReimbursementPaid Training and Licensures*Benefits may be different by location. Benefit eligibility requirements vary and may include length of service.
Coverage begins on the date of hire. Must enroll in New Hire Benefits within 30 days of the date of hire for coverage to take effect.
The equal employment opportunity policy of the GEICO Companies provides for a fair and equal employment opportunity for all associates and job applicants regardless of race, color, religious creed, national origin, ancestry, age, gender, pregnancy, sexual orientation, gender identity, marital status, familial status, disability or genetic information, in compliance with applicable federal, state and local law. GEICO hires and promotes individuals solely on the basis of their qualifications for the job to be filled.
GEICO reasonably accommodates qualified individuals with disabilities to enable them to receive equal employment opportunity and/or perform the essential functions of the job, unless the accommodation would impose an undue hardship to the Company. This applies to all applicants and associates. GEICO also provides a work environment in which each associate is able to be productive and work to the best of their ability. We do not condone or tolerate an atmosphere of intimidation or harassment. We expect and require the cooperation of all associates in maintaining an atmosphere free from discrimination and harassment with mutual respect by and for all associates and applicants.