Home
/
Comprehensive
/
Principal Application Engineer (SRE)
Principal Application Engineer (SRE)-April 2024
Riverwoods
Apr 17, 2025
ABOUT DISCOVER
Make a difference working in a culture that thrives on customer-focused innovation.
10,000+ employees
Financial Services
VIEW COMPANY PROFILE >>
About Principal Application Engineer (SRE)

  Discover. A brighter future.

  With us, you’ll do meaningful work from Day 1. Our collaborative culture is built on three core behaviors: We Play to Win, We Get Better Every Day & We Succeed Together. And we mean it — we want you to grow and make a difference at one of the world's leading digital banking and payments companies. We value what makes you unique so that you have an opportunity to shine.

  Come build your future, while being the reason millions of people find a brighter financial future with Discover.

  Job Description:

  At Discover, be part of a culture where diversity, teamwork and collaboration reign. Join a company that is just as employee-focused as it is on its customers and is consistently awarded for both. We’re all about people, and our employees are why Discover is a great place to work. Be the reason we help millions of consumers build a brighter financial future and achieve yours along the way with a rewarding career.

  Have you ever wondered what’s behind Discover Card’s award-winning customer experience? Our Card Portfolio group owns dozens of cardmember experiences, from setting up your account on Discover.com for the first time to adding your Discover Card to your phone (and much more). This group is committed to building new and more efficient ways for our Cardmembers to use our products.

  This is where you come in. We need a Principal Application Reliability Engineer who’s seeking an opportunity to make a positive impact. You will partner with teams to identify and fix inefficiencies to solve system reliability and performance opportunities. Some examples include reviewing availability expectations, addressing performance issues, uncovering observability gaps, leading problem management, and driving capacity planning. You will actively manage risk and customer-impacting issues within the day-to-day role and ensure product leaders are aware.

  Responsibilities

  Consult teams and provide hands-on training to teams in observability, incident management and reliability best practices.

  Includes defining SLOs\SLAs\SLIs, on-call support behaviors, troubleshooting, building support playbooks, implementing monitoring and alerting, logging standards, conducting fragility & performance testing, etc.

  Review product journeys and reliability practices on regular interval to enforce best practices.

  Periodically pair/mob program with the teams to help build reliability thinking.

  Lead failure point discussions, chaos testing and family level capacity management.

  Responsible for family level application reliability and resiliency

  Leverage metrics and scorecards to better drive site reliability adoption in the product areas

  Ensure delivery teams in the product family track and meet annual operational goals (MTTR reduction, incident reduction, platform availability, SLO\SLA targets)

  Ensure automated delivery for all family level products.

  Ensure proper level of documentation exists.

  Drive SRE community discussions, share wins and failures with Discover SRE community of practice.

  Minimum Qualifications

  At a minimum, here’s what we need from you:

  Bachelors – Computer Science or related

  6+ Years -- Information Technology, (Software) Engineering, or related

  Internal applicants only: technical proficiency rating of proficient on the Dreyfus engineering scale

  Preferred Qualifications

  3+ years in a SRE or DevOps role

  Experience with DevOps tools, processes, and culture

  Extensive experience leading customer facing systems in a mission critical environment

  Advanced experience with programming and/or scripting languages (Python, Java, bash)

  In depth knowledge on application development landscape - Java, Rest API, design patterns and CI/CD.

  Extensive experience with monitoring and observability tools/technologies (i.e., Grafana, Kibana, Datadog, AppDynamics)

  Creation of standardized monitoring dashboards in cloud platforms for proactive monitoring of application and infrastructure health

  In-depth knowledge of Non-functional requirements (NFR’s) including pressure/chaos testing, performance, and penetration testing

  Reliability best practices in the cloud native environment

  Operational Readiness strategies and best practices

  #LI-DD1

  Application Deadline:

  The application window for this position is anticipated to close on Dec-05-2023. We encourage you to apply as soon as possible. The posting may be available past this date, but it is not guaranteed.

  Compensation:

  The base pay for this position generally ranges between $104,000.00 to $175,600.00. Additional incentives may be provided as part of a market competitive total compensation package. Factors, such as but not limited to, geographical location, relevant experience, education, and skill level may impact the pay for this position.

  Benefits:

  We also offer a range of benefits and programs based on eligibility. These benefits include:

  Paid Parental Leave

  Paid Time Off

  401(k) Plan

  Medical, Dental, Vision, & Health Savings Account

  STD, Life, LTD and AD&D

  Recognition Program

  Education Assistance

  Commuter Benefits

  Family Support Programs

  Employee Stock Purchase Plan

  Learn more at MyDiscoverBenefits.com .

  What are you waiting for? Apply today!

  All Discover employees place our customers at the very center of our work. To deliver on our promises to our customers, each of us contribute every day to a culture that values compliance and risk management.

  Discover is committed to a diverse and inclusive workplace. Discover is an equal opportunity employer and does not discriminate on the basis of race, color, religion, sex, sexual orientation, gender identity, national origin, age, disability, protected veteran status, or other legally protected status. (Know Your Rights) (https://urldefense.com/v3/__https:/www.eeoc.gov/poster__;!!MjXRb4uW6x5k!ABIVgRw0WsyX2wfQC-pKxK3V9X4h1NBUGgjO7EM8PTvp5MNRgpEuVC_jVk0fcn_ISAZjmwkbLuUIrj8mFedCBkyz$)

Comments
Welcome to zdrecruit comments! Please keep conversations courteous and on-topic. To fosterproductive and respectful conversations, you may see comments from our Community Managers.
Sign up to post
Sort by
Show More Comments
SIMILAR JOBS
Assistant Store Manager - JR026202
*Assistant Store Manager (Operations)*Contribute to our mission to improve Health and Wellness in your community.Become a Rite Aid Assistant Store Manager over Operations, today!As an Assistant Store
Software Engineer, Product
Summary: Meta Platforms, Inc. (Meta), formerly known as Facebook Inc., builds technologies that help people connect, find communities, and grow businesses. When Facebook launched in 2004, it changed
Water/Industrial Engineer
Parametrix is an Environmental, Engineering, Planning and Construction Management consulting firm. We are 100% employee-owned and have been in business over 50 years with a strong Western US practice
Behavioral Health Clinician- Inpatient Adult
Behavioral Health Clinician- Inpatient Adult Job Ref 2400024 Category Behavioral Health Job Family Behavioral Health Clinician Department Adult Overflow Unit Schedule Part-time Facility Behavioral He
Lead Cook, Long Term Care - Full Time
Create Your Career With Us! Join our not-for-profit organization that has provided over 100 years of housing and services to seniors with a commitment to quality care and service in a Christian envir
Principal Software Engineer
Job Description The Oracle Cloud Infrastructure (OCI) team can provide you the opportunity to build and operate a suite of massive scale, integrated cloud services in a broadly distributed, multi-ten
Warehouse Associate
Summary Job title: Warehouse Associate Job ID: 202454050001 Department: Omaha - 3E Location: NE-Omaha Description Summary: Looking for an associate to work in a fast-paced warehouse environment where
20176 - Service Desk Representative I
20176 -  Service Desk Rep. I West Point, GA   PURPOSE: Provide initial employee support for technical inquiries received via phone, email, and messaging applications. Assess the nature of problems an
Controller - Hybrid
Why Compucom? (Overview) Compucom Systems, Inc. provides end-to-end managed services to enable the digital workplace for enterprise, midsize and small businesses. To enable our clients to focus on wh
SOCIAL SERVICES ASSISTANT - (TEMPORARY)
Job Description About UsWhen you work at Chugach Government Solutions (CGS), you join a proud legacy of supporting missions while sustaining culture.The federal division of Chugach Alaska Corporation
Copyright 2023-2025 - www.zdrecruit.com All Rights Reserved