Home
/
Comprehensive
/
Senior DevOps Engineer - MLOps Platform
Senior DevOps Engineer - MLOps Platform-March 2024
Shanghai
Mar 21, 2025
ABOUT NVIDIA
NVIDIA is a computing platform company, innovating at the intersection of graphics, HPC, and AI.
10,000+ employees
Technology
VIEW COMPANY PROFILE >>
About Senior DevOps Engineer - MLOps Platform

  We are seeking passionate and hardworking individuals to help us scale our AI and deep learning platforms. As a DevOps and Release Engineer, you will play a critical role in ensuring the smooth and efficient release of our software applications, working closely with our development, operations, and quality assurance teams. You will be responsible for implementing and maintaining the DevOps practices, tools, and infrastructure that enable our teams to deliver high-quality software reliably and efficiently, while ensuring smooth release management and deployment processes.

  What you will be doing:

  Develop, maintain, and improve CI/CD tools for on-prem and cloud deployment of our software, enable sophisticated cross-platform build systems, and bring world-class release engineering to NVIDIA's platform and cloud deployment process.

  Collaborate with development, operations, and quality assurance teams to establish and maintain efficient and reliable DevOps practices, tools, and infrastructure that enable continuous integration, continuous delivery (CI/CD), and efficient software release management.

  Automate and streamline build, deployment, and release processes, including configuration management, environment provisioning, and application deployment, using infrastructure as code (IaC) tools such as Terraform.

  Manage and coordinate software releases, including versioning, branching, merging, and tagging, and ensure proper release management practices are followed.

  Monitor and fix the software development and deployment pipelines, identifying and resolving issues related to build failures, test failures, code quality, and performance, in collaboration with development, operations, and quality assurance teams.

  Collaborate with operations and security teams to ensure proper configuration and management of infrastructure resources, including containers, databases, and networking, following standard processes for security, scalability, and cost optimization.

  Stay up-to-date with the latest advancements in DevOps tools, technologies, and standard methodologies, and provide recommendations for continuous improvement of our software development and deployment processes.

  What we need to see:

  Bachelor's or Master's degree in Computer Science, Software Engineering, or a related field (or equivalent experience).

  2+ years of experience with large and complex software development environments. Experience with large code bases (1M+ LoC) is a plus

  Proven experience as a DevOps and Release Engineer, with a focus on implementing and maintaining DevOps practices, tools, and infrastructure.

  Strong programming and scripting skills in languages such as Python, Java, Shell, or PowerShell, and proficiency in version control systems such as Git or Subversion.

  Proficiency with popular CI/CD tools (e.g., Jenkins, GitLab CI, Travis CI, CircleCI), build systems (e.g., CMake, Bazel, Gradle), and version control systems (e.g., Git, Perforce).

  Knowledge of infrastructure as code (IaC) tools and concepts, including Terraform, and experience with cloud computing platforms.

  Familiarity with containerization technologies such as Docker and container orchestration platforms such as Kubernetes.

  Strong understanding of software testing principles, including unit testing, integration testing, and end-to-end testing, and experience with automated testing frameworks and tools.

  Knowledge of release management practices, including versioning, branching, merging, and tagging, and experience with release management tools and processes.

  Knowledge of networking, virtualization, and operating system concepts, and experience with managing virtual machines, containers, databases, and networking in cloud and on-premises environments.

  Ways To Stand Out From The Crowd

  Experience with GPU-accelerated applications or technologies.

  Thrives in a multi-tasking environment with constantly evolving priorities.

  Strong background with Jenkins on k8s at scale.

  Prior experience with a large scale operations team.

  Outstanding interpersonal skills and communication with all levels of management.

  NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

  NVIDIA is a Learning Machine

  NVIDIA pioneered accelerated computing to tackle challenges no one else can solve. Our work in AI and the metaverse is transforming the world's largest industries and profoundly impacting society.

  Learn more about NVIDIA .

Comments
Welcome to zdrecruit comments! Please keep conversations courteous and on-topic. To fosterproductive and respectful conversations, you may see comments from our Community Managers.
Sign up to post
Sort by
Show More Comments
SIMILAR JOBS
Food Service Manager 9058
JOB REQUIREMENTS: Hourly Assistant Manager (45-Hour Work Week) Whyshould you join the DReaM Team? To be part of a family focused culturethat allows you flexibility in your schedule to achieve your fa
Industrial Hygienist/Environmental Technician - 10300001555
Industrial Hygienist/Environmental Technician - 10300001555 DESCRIPTION/RESPONSIBILITIES:Your Role:Tetra Tech invites you to consider a rewarding opportunity within our Philadelphia, Pennsylvania fie
Accounts Payable Processing Specialist
Department/Unit: Accounts Payable Work Shift: Day (United States of America) Salary Range: $45,000 - $60,000 Under the direction of the Accounts Payable Manager, the Accounts Payable Processing Speci
Roadway Engineer
Overview Kimley-Horn's Miami, Florida (FL) office is seeking a Civil Engineer with 4+ years of experience to join their Roadway team! This is not a remote position. Responsibilities Collaborate with
General Service Technician
Overview Start Your New Career Today! GREAT working hours – Monday thru Friday 8:00 a.m. to 6:00 p.m., Saturday 8:00 a.m. to 5:00 p.m., and CLOSED on Sunday. We offer a flexible full- or part-time sc
Gridline Operator
JOB REQUIREMENTS: Job Description: Pentair has a job opportunity foryou! Join us as aGridline Operatorin our Manitowoc Ice facility. Youwill operate the grid oven, washer, conveyor conveying equipmen
Surveillance Operator-Hochatown
Full-time All shifts available. $19.30 Hourly Weekly Earned Wage Access is an option for this position. Job Summary: You will observe all areas Casino operations using video and audio technologies, p
Software Quality Engineer - Greene or Syracuse, NY - East Syracuse, NY
Job Title Software Quality Engineer - Greene or Syracuse, NY About our company: The Raymond Corporation is a division of the Toyota Industries Corporation. We empower you to do great work in a compan
Lead Electrical Engineer - Solar Utility Design (US Hybrid)
Lead Electrical Engineer - Solar Utility Design (US Hybrid) Date: Feb 24, 2024 Location: Overland Park, KS, US Ann Arbor, MI, US US Cary, NC, US Company: Black & Veatch Family of Companies Togeth
Executive, Sales Development
About the Role The Sales Development Executive will function as the Production Printing Sales Specialist responsible for ensuring achievement of the Region Plan for the Monochrome and Color Cut Sheet
Copyright 2023-2025 - www.zdrecruit.com All Rights Reserved