NVIDIA is searching for a Deep Learning Algorithms and System Software Engineer to develop artificial intelligence (AI) and computer vision algorithms and applications for our Metropolis for Factories and Manufacturing platforms. Artificial Intelligence is transforming how we collect, inspect, and analyze different kinds of sensor data, impacting everything from manufacturing automation and warehouse management to product inspections and safety workflows. NVIDIA Metropolis is leading this AI revolution, providing the tools, technologies, and expertise to meet every challenge with smarter, faster applications.
This challenging role requires someone who deeply understands Large Language and Multi-modal (LLM/LMM) Foundation models to advance the application of artificial intelligence and machine learning in the Manufacturing AI market. The role involves optimizing inference performance, crafting efficient inference pipelines, and building scalable AI systems using NVIDIA technologies like TensorRT, Triton, and CUDA.
What You’ll Be Doing:
Collaborate with diverse software, research, and hardware teams across geographies to analyze the interplay of hardware and software architectures, solve critical problems, and shape future applications.
Support engagements with customers and their third-party software providers. Collaborate with Product Management, Marketing, and Developer Technology teams.
Develop algorithms and pipelines for multi-modal large models (text, image, video, audio, etc.), optimize and scale AI models for efficient and reliable performance.
Work on microservices architectures and inference APIs for AI model serving, ensuring modularity, scalability, and resilience.
Drive the design and implementation of complex AI projects, providing technical guidance and support, and mentoring junior engineers.
What We Need To See:
3 years or more of working experience and an MS or PhD in Computer Science, Computer Engineering, Electrical Engineering, or a related field with a focus on Deep Learning, Machine Learning, and Computer Vision.
Familiarity with AI model dataset preparation/curation, model training, and inference flow/pipeline.
Proficiency in working with deep learning frameworks such as TensorFlow and PyTorch. Strong programming skills in Python and/or C++, and experience developing integrated AI solutions.
Experience with microservices and inference API architectures for AI model serving.
Knowledge of software development best practices, including version control, code review, and documentation.
Proven ability to lead projects, manage timelines, and deliver results.
Strong communication skills and ability to work in a collaborative environment.
Ways To Stand Out From The Crowd:
Familiarity with cloud-based machine learning systems and CI/CD skills, including Kubernetes, containers, and Helm.
Experience with deploying and managing inference solutions with NVIDIA TensorRT, CUDA, and other acceleration technologies.
Skilled in large-scale data processing and distributed computing systems.
NVIDIA is widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us. Are you a creative and autonomous data scientist who loves challenges? Do you have a genuine passion for advancing the state of AI and machine learning across a variety of industries? If so, we want to hear from you. Come and join our Metropolis team where you'll help build our real-time, cost-effective computing platform driving our success in this exciting and quickly growing field.NVIDIA is committed to fostering a diverse work environment and is proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status, or any other characteristic protected by law.