For our Client, we are looking for a Site Reliability Engineering Lead with Java or Python, to manage the technical infrastructure team responsible for constructing the architecture that ensures everything the Client's users view online operates smoothly.
The position includes a hybrid mode of work, with 2-3 days a week of performing tasks from the office (in Kraków, Wrocław or Gdańsk).
SEE YOURSELF IN THIS ROLE
From designing and upkeeping our data centers to crafting the future iterations of our platforms, we are the driving force behind the product portfolio. What's more, we ensure our networks remain operational, providing our users with the quickest and most optimal experience.
What You'll Do
Develop and sustain tools and infrastructure to enhance the management of data throughout its lifecycle and access control Participate in and improve the lifecycle of services from inception and design, through deployment, operation, and refinement Maintain services by measuring and monitoring availability, latency, and overall system health Construct and expand systems sustainably using approaches such as automation, and refine systems by initiating changes that boost reliability and speed Implement sustainable incident management and conduct thorough after-action reviews Responsibilities as a Lead: Serving as the primary contact for the client Daily interactions with the client's team based on the West Coast of the USA Engaging in daily Scrum routines Prioritizing the backlog and assigning tasks What You Have Team Lead experience and excellent communication skills Hands-on experience with any public cloud (GCP is nice to have) Proficiency in Linux administration (RH and Debian are preferred; other variants accepted) Solid grasp of automation tools like Terraform/Ansible Exposure to scripting languages, notably Python or Go Availability to work evenings for coordination with onshore client employees in West Coast and Central time zones For candidates with Java familiarity: understanding of its syntax, OOP principles, and basic code writing, testing, and debugging Knowledge of Java elements like variables, arrays, strings, math functions, and control flow statements Understanding of Java classes, object creation, exception handling, debugging, and common Java APIs and libraries For candidates with Python exposure: experience in writing, testing, and debugging Python code Basic Python proficiency including handling variables, loops, conditionals, and managing Python packages Nice to have Expertise in automating build processes and deployments Experience in debugging systems We Offer We gather like-minded people: Friendly team and enjoyable working environment Engineering community of industry's professionals Flexible schedule and opportunity to work remotely within Poland Chance to work abroad for up to 60 days annually Relocation within our 50+ offices We provide growth opportunities: Outstanding career roadmap Leadership development, career advising, soft skills and well-being programs Certification (GCP, Azure, AWS) Unlimited access to LinkedIn Learning, Get Abstract, O'Reilly, Cloud Guru Language classes on English and Polish for foreigners We cover it all: Stable income (Employment Contract or B2B) Participation in the Employee Stock Purchase Plan Referral bonuses Benefits package (health insurance, multisport, shopping vouchers) Corporate and social events We may contact chosen candidates only About EPAM EPAM is a leading global provider of digital platform engineering and development services. We are committed to having a positive impact on our customers, our employees, and our communities. We embrace a dynamic and inclusive culture. Here you will collaborate with multi-national teams, contribute to a myriad of innovative projects that deliver the most creative and cutting-edge solutions, and have an opportunity to continuously learn and grow. No matter where you are located, you will join a dedicated, creative, and diverse community that will help you discover your fullest potential Why EPAM