Home
/
Comprehensive
/
Site Reliability Engineer - Hybrid Cloud Container Platform
Site Reliability Engineer - Hybrid Cloud Container Platform-April 2024
Charlotte
Apr 19, 2025
ABOUT BANK OF AMERICA
Bank of America is a leading financial institution, serving consumers, small businesses, and large corporations with a full range of banking, investing, and other financial products and services.
10,000+ employees
Financial Services
VIEW COMPANY PROFILE >>
About Site Reliability Engineer - Hybrid Cloud Container Platform

  Site Reliability Engineer - Hybrid Cloud Container Platform

  Chandler, Arizona;Atlanta, Georgia; Richmond, Virginia; Charlotte, North Carolina; Plano, Texas; Jersey City, New Jersey

  Job Description:

  At Bank of America, we are guided by a common purpose to help make financial lives better through the power of every connection. Responsible Growth is how we run our company and how we deliver for our clients, teammates, communities and shareholders every day.

  One of the keys to driving Responsible Growth is being a great place to work for our teammates around the world. We’re devoted to being a diverse and inclusive workplace for everyone. We hire individuals with a broad range of backgrounds and experiences and invest heavily in our teammates and their families by offering competitive benefits to support their physical, emotional, and financial well-being.

  Bank of America believes both in the importance of working together and offering flexibility to our employees. We use a multi-faceted approach for flexibility, depending on the various roles in our organization.

  Working at Bank of America will give you a great career with opportunities to learn, grow and make an impact, along with the power to make a difference. Join us!

  About Bank of America – Global Technology:

  Global Technology delivers technology services globally across the bank’s eight lines of business that serve individuals, companies, and institutions. The team also focuses on digital banking, payments, infrastructure, data management and technology that enhances cyber security, and risk and capital management. Innovation is at the heart of all Global Technology does.

  Enterprise Cloud Platforms Team:

  Enterprise Cloud Platforms team in the CTO organization offers Private and Public Cloud platforms for Bank of America’s developers to drive faster time-to-market, innovation with private and public cloud capabilities, and reduce complexity with bult-in integrations. We believe in high quality engineering culture to engineer our platforms with customer and platform mindset, design for large enterprise scale and resilience, and accelerate market innovation into the technical platforms we deliver.

  As part of this team, you will have a large impact on the evolution of next generation Cloud services for Bank of America and explore an extensive list of new technologies that will drive innovation across our company.

  We are seeking an experienced Site Reliability Engineer (SRE) to support and administration of our Hybrid Cloud Container (OpenShift /AKS) platform.

  Our HCCP Service Reliability Engineers (SRE) ensure that our Platform meets the reliability and uptime requirements of our demanding enterprise customers. This is achieved with, the best engineering practices and resilient design and through a well-defined and effective global on-call rotation that runs 24x7.

  The role provides opportunity to work with wide range of technologies and unique perspective on how various services (on-prem/off-prem) interact with each other. You will work with colleagues that are as smart, hardworking, and driven as you. You will get an opportunity to work in a team that keeps growing, innovating, and giving you room to be proactive and creative.

  Are you ready for the next step in your career? Then we’d love to hear from you!

  Position Summary

  Responsible for reliability and support of Container Platform on-prem and external clouds (Azure /AWS /Google)

  Monitor and troubleshoot Container platform (Openshift) and Azure (AKS) environment performance issues, connectivity issues, security issues, etc.

  Perform deep dives into systemic and latent reliability issues, Incident management, problem management

  Identifying, analyzing, and resolving infrastructure vulnerabilities and application deployment issues.

  Perform blameless RCA, partner with engineering and operation teams across the organization to roll out fixes.

  Responsible for application onboarding and provide troubleshooting support through the lifecycle of the applications on the container platform.

  Identify and drive opportunities to improve automation to reduce TOIL and improve operational excellence.

  Partner with risk, and compliance teams to bring visibility and implement right controls and remediation of vulnerabilities.

  Ensure resiliency during implementation and identify/fix resiliency problems by collaborating with engineering teams.

  Be a key stakeholder in the design of cloud services and work with Architecture, engineering, product teams

  Participate in 24x7 on-call coverage follow the sun model

  Required Skills

  BS /MS degree in Computer Science or related technical field involving systems or equivalent practical experience.

  Minimum 5+ years of hands-on experience supporting Kubernetes /Openshift / AKS /EKS Container platform.

  Experience with Python, Ansible, Golang, and shell scripting

  Strong experience in major services related to Compute, Storage, Network and Security

  Experience with monitoring tools like Prometheus and Dynatrace, as well as cloud native tools like Azure Monitor and Log Analytics

  Strong understanding and background of working with a complex IAM infrastructure, including Active Directory, Azure AD Connect, Azure AD, and Ping Identity or other SSO solutions.

  Advanced knowledge of Linux OS, DNS, DHCP, Kerberos and Windows Authentication

  Experience with CI/CD tools git /Jenkins, GitOps model

  Excellent understanding of Linux /Windows operating systems administration

  Experience in Container security and vulnerability remediation.

  Systematic problem-solving approach, sense of ownership and drive

  Ability to juggle competing priorities and adapt to changes in project scope.

  Excellent interpersonal, organizational and communication (written, verbal, and presentation) skills are a must.

  Proven ability to work independently with minimal supervision and as part of a team with direct responsibilities..

  Desired Skills

  Experience in Openshift, CSP Kubernetes services such as AKS and EKS

  Kubernetes /Openshift /Terraform certifications are a plus

  Experience in Terraform, ArgoCD, Tekton, and K-native technologies.

  Experience in agile deployment methodologies (GitOps)

  Knowledge of various container runtimes

  Familiarity with the operator deployment pattern.

  Experience working in a highly available multi-datacenter environment

  Experience working with monitoring tools such as Prometheus, Splunk, Dynatrace, Sysdig, or similar tools.

  Understanding of cost management, inventory management, FinOps model

  Shift:

  1st shift (United States of America)

  Hours Per Week:

  40

  Bank of America and its affiliates consider for employment and hire qualified candidates without regard to race, religious creed, religion, color, sex, sexual orientation, genetic information, gender, gender identity, gender expression, age, national origin, ancestry, citizenship, protected veteran or disability status or any factor prohibited by law, and as such affirms in policy and practice to support and promote the concept of equal employment opportunity and affirmative action, in accordance with all applicable federal, state, provincial and municipal laws. The company also prohibits discrimination on other bases such as medical condition, marital status or any other factor that is irrelevant to the performance of our teammates.

  To view the "EEO is the Law" poster, CLICK HERE (https://www.dol.gov/ofccp/regs/compliance/posters/pdf/eeopost.pdf) .

  To view the "EEO is the Law" Supplement, CLICK HERE (https://www.dol.gov/ofccp/regs/compliance/posters/pdf/OFCCP_EEO_Supplement_Final_JRF_QA_508c.pdf) .

  Bank of America aims to create a workplace free from the dangers and resulting consequences of illegal and illicit drug use and alcohol abuse. Our Drug-Free Workplace and Alcohol Policy (“Policy”) establishes requirements to prevent the presence or use of illegal or illicit drugs or unauthorized alcohol on Bank of America premises and to provide a safe work environment.

  To view Bank of America’s Drug-free workplace and alcohol policy, CLICK HERE .

Comments
Welcome to zdrecruit comments! Please keep conversations courteous and on-topic. To fosterproductive and respectful conversations, you may see comments from our Community Managers.
Sign up to post
Sort by
Show More Comments
SIMILAR JOBS
Associate Engineer - Electrical Safety
Associate Engineer - Electrical Safety Intertek, a Nationally Recognized Testing Lab (NRTL) and leading provider of ATIC (Assurance, Testing, Inspection, and Certification) Services, is looking for a
Quality Specialist - Data Center Build
If you are a Quality Specialist looking for an exciting role working with industry leaders, look no further! Black Box is hiring a Data Center Quality Specialist as part of our strategic growth initi
Director-Sales (Individual Contributor)
Description You Lead the Way. We’ve Got Your Back. With the right backing, people and businesses have the power to progress in incredible ways. When you join Team Amex, you become part of a global an
Customer Care II - Member (Mountain and Pacific Time Zones Preferred)
Mountain and Pacific time zones preferred - Standard pay rate $18.00 per hour for this class hiring initiative. Position Summary The Customer Service and Client Service Delivery (CSC) role supports G
Plumber I
Plumber I Casino, 5485 Casino Way, El Cajon, California, United States of America Req #13988 Friday, January 19, 2024 The ancestors of the Sycuan Band of the Kumeyaay Nation existed many centuries ag
Septic Install Technician
Are You An INSTALLER/HEAVY EQUIPMENT OPERATOR Who Wants To Work For A Thriving Company Where You Can Grow, Be Recognized, And Be Rewarded For Your Work? We’re looking for team members who are starvin
Senior Software Engineer, Systems Infrastructure
Company Description LinkedIn is the world’s largest professional network, built to create economic opportunity for every member of the global workforce. Our products help people make powerful connect
Principal Member of Technical Staff - APEX Developer
Job Description Building off our Cloud momentum, Oracle has formed a new organization - Oracle Health Applications & Infrastructure. This team will focus on product development and product strate
Cloud Software Engineer - Node.Js - Remote
Role Overview: We are seeking a development engineer to join the McAfee Consumer Platform Product Development team. In this role, you will be responsible for designing, developing, and maintaining va
Cellular Scientist
Description: MUST HAVE GMP QPCR/DD-PCR/PCR EXPERIENCE Our client a major biopharmaceutical company is seeking a Scientist/Biologist to provide critical support within their Method Transfer and Develo
Copyright 2023-2025 - www.zdrecruit.com All Rights Reserved