Home
/
Software Engineering
/
Sr Software Engineer: Fleet Reliability & Performance
Sr Software Engineer: Fleet Reliability & Performance-September 2024
Århus
Sep 21, 2024
ABOUT UBER
At Uber, we reimagine the way the world moves for the better.
10,000+ employees
Technology
VIEW COMPANY PROFILE >>
About Sr Software Engineer: Fleet Reliability & Performance

  About the Role

  We build the foundation for all of Uber's fleet of 100,000s of hosts or VMs by ensuring they are running reliable and are configured optimally for the container platforms using the hosts. We monitor for and detect a broad range of reliability and quality problems through codified processes and automatically drive remediation.

  We run generically across bare metal hosts and VMs and across our own on-prem data centers and multiple cloud vendors, and closely collaborate to develop integrations that ensure effective and automated management of the many hosts and VMs.

  Internally we integrate with Uber's stateful and stateless container scheduling platforms to orchestrate host operations in a safe and efficient way and use this to realize remediation of bad hosts or apply fleet wide upgrades such as rolling out a new kernel.

  We own the base OS image and the Kernel deployed on the fleet and handle fleet-wide kernel upgrades and configuration. We provide high fidelity host and container metrics to ensure secure and optimal performance for the workloads on the hosts.

  Our team consists of a healthy combination of both junior and senior engineers with a broad range of experiences across the industry. We value ideas over hierarchy, always improving, getting things done through code and having a measurable impact on the business.

  What you will do

  You will maximize your software engineering, systems engineering, hardware/Linux OS/kernel knowledge, cloud knowledge, and infrastructure systems experience to investigate and solve ambiguous problems in our production fleet while also contributing to planning, new systems design, and improvement of existing systems to enable even greater efficiency and insight.

  Contribute to planning, design and architecture, and building of systems, tooling and observability in support of production server fleet reliability, and cloud expansion effortsActively drive collaboration across multiple teams to create alignment and progress.Implement solutions in Go with a strong focus on clean, readable code with unit and integration test coverage.Low level debugging into host level issues and generalization of detectionTake active part in code change peer-reviews to ensure quality and knowledge sharing across the team.Contribute to engineering culture in terms of quality, monitoring and on-call practices.Own part of the team's charter and through that help setting longer term direction for the team.

  Basic Qualifications

  5+ years of experienceBS, MS or PhD degree in computer science, similar technical field of study or equivalent practical experienceBackground in multiple programming languages, e.g., C/C++, Go etc.Strong hands-on experience with Linux investigating and debugging performance problemsAn inherent aim to collaborate, both within the team and across orgsExcellent written and verbal communication skills, and the ability to write detailed design documents, post mortemsA belief that your team can accomplish more together than as separate individualsAttention to detail, particularly around software engineering fundamentals, testing methodologies, and quality

  Preferable Qualifications

  Strong understanding of Linux kernel internals, e.g., ability to read and understand kernel code.Experience in kernel, hardware performance evaluation, tuning, and debugging.An understanding of server hardware at scale: data center network fundamentals, OS imaging, provisioning, distribution, and configuration deployment at a large scale.Experience with cloud and migration to cloud is a plus.Experience with large distributed systems.Experience with containerization software such as Kubernetes, Docker, Mesos.Comfortable working with on-prem and cloud-based infrastructure (AWS, GCP).

  We welcome people from all backgrounds who seek the opportunity to help build a future where everyone and everything can move independently. If you have the curiosity, passion, and collaborative spirit, work with us, and let's move the world forward, together.

  Offices continue to be central to collaboration and Uber's cultural identity. Unless formally approved to work fully remotely, Uber expects employees to spend at least half of their work time in their assigned office. For certain roles, such as those based at green-light hubs, employees are expected to be in-office for 100% of their time. Please speak with your recruiter to better understand in-office expectations for this role.

  *Accommodations may be available based on religious and/or medical conditions, or as required by applicable law. To request an accommodation, please reach out to [email protected].

Comments
Welcome to zdrecruit comments! Please keep conversations courteous and on-topic. To fosterproductive and respectful conversations, you may see comments from our Community Managers.
Sign up to post
Sort by
Show More Comments
SIMILAR JOBS
Software Engineer (Hybrid)
Software Engineer - IE08DE We're determined to make a difference and are proud to be an insurance company that goes well beyond coverages and policies. Working here means having every opportunity to
Site Reliability Engineer
At Broadridge, we've built a culture where the highest goal is to empower others to accomplish more. If you're passionate about developing your career, while helping others along the way, come join t
Software Developer in Test - Vice President
iCapital is powering the world’s alternative investment marketplace. Our financial technology platform has transformed how advisors, wealth management firms, asset managers, and banks evaluate and re
Software Engineer - Full Stack
OVERVIEW This position can be based out of San Francisco or New York City We're looking for Full-Stack Software Engineers to join our Engineering team. In this role, you will build innovative payment
Sr. Manager, Analytics Engineer - Biopharma
ROLE SUMMARY: Pfizer is seeking hardworking, passionate and results-oriented individuals to join our Analytics Engineering team to build data foundations and tools to craft the future. You will desig
Staff Software Engineer - Backend (Growth Data Platform Team)
Hinge Health is creating a new health care system, built around you. Accessible to 26 million members across 1,500 customers, Hinge Health is the #1 digital clinic for joint and muscle pain, deliveri
Senior Software Engineer, Experience Containerization
Every day, tens of millions of people come to Roblox to explore, create, play, learn, and connect with friends in 3D immersive digital experiences– all created by our global community of developers a
Software Engineer - Card Processing and Authorisation
Company Description Checkout.com is one of the most exciting FinTechs in the world. Our mission is to enable businesses and their communities to thrive in the digital economy. We’re the strategic pay
Lagerleiter*in (d/w/m)
DU BIST MEHR ALS DEIN JOB-TITEL. MEHR ALS ZAHLEN UND BUCHSTABEN IN DEINEM LEBENSLAUF. UND WIR SIND MEHR ALS EIN UNTERNEHMEN. WIE WÄR'S ALSO, WENN WIR UNS EINFACH ZUSAMMENTUN - UND GEMEINSAM NOCH MEHR
Engineering Manager - Corlu IC
ABOUT UNILEVER With 3.4 billion people in over 190 countries using our products every day, Unilever is a business that makes a real impact on the world. Work on brands that are loved and improve the
Copyright 2023-2024 - www.zdrecruit.com All Rights Reserved