Home
/
Comprehensive
/
Senior Deep Learning Scientist, LLM Retrieval Augmented Generation
Senior Deep Learning Scientist, LLM Retrieval Augmented Generation-March 2024
Santa Clara
Mar 28, 2025
ABOUT NVIDIA
NVIDIA is a computing platform company, innovating at the intersection of graphics, HPC, and AI.
10,000+ employees
Technology
VIEW COMPANY PROFILE >>
About Senior Deep Learning Scientist, LLM Retrieval Augmented Generation

  Widely considered to be one of the technology world’s most desirable employers, NVIDIA is an industry leader with groundbreaking developments in High-Performance Computing, Artificial Intelligence and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. GPU deep learning ignited modern AI — the next era of computing — with the GPU acting as the brain of computers, robots, autonomous cars and conversational AI that can perceive and understand the world. Today, we are increasingly known as “the AI computing company.” We're looking to grow our company, and build our teams with the hardest working people in the world. Join us at the forefront of technological advancement.

  NVIDIA is looking for Senior Data Scientist, LLM Retrieval Augmented Generation to develop high-impact, high-visibility Large language modeI products and improve the experience of millions of customers leveraging our NeMo LLM MLOps platform. If you're creative & passionate about solving real world conversational AI problems, come join our Digital Human LLM team. For more details on NeMo Frameworks for LLMs check https://www.nvidia.com/en-us/ai-data-science/generative-ai/nemo-framework/

  What you’ll be doing:

  Develop, Train, Fine-tune, and Deploy multimodal large language models for retrieval augmented generation

  Develop LLM agent framework for orchestrating large scale RAG applications

  Apply instruction tuning, reinforcement learning from human feedback (RLHF), and parameter efficient fine-tuning such as p-tuning, adaptors, LoRA, and so on to improve LLMs for different RAG use cases

  Measure and benchmark model and application performance

  Analyze model accuracy and bias and recommend the next course of action & Improvements.

  Maintain model evaluation systems

  Drive the gathering, building, and annotation of domain specific datasets to train LLMs for different tasks and applications.

  Characterize performance and quality metrics across platforms for various AI and system components

  Participate in developing and reviewing code, design documents, use case reviews, and test plan reviews.

  Help innovate, identify problems, recommend solutions and perform triage in a collaborative team environment and collaborate with various teams on new product features and improvements of existing products.

  What we need to see:

  Master’s degree (or equivalent experience) or PhD in Computer Science, Electrical Engineering, Artificial Intelligence, or Applied Math with 5+ years of experience

  Excellent programming skills in Python with strong fundamentals in programming, optimizations and software design

  Solid understanding of ML/DL techniques, algorithms and tools with exposure to CNN, RNN (LSTM), Transformers (BERT, BART, GPT/T5, Megatron, LLMs)

  Hands-on experience on conversational AI Technologies like Natural Language Understanding, Natural Language Generation, Dialog systems (including system integration, state tracking and action prediction), Information retrieval and Question and Answering, Machine Translation etc.

  Experience with Training BERT, GPT and Megatron Models for different NLP and dialog system tasks using “PyTorch” Deep Learning Frameworks and performing NLP data wrangling and tokenization

  Develop large scale multimodal information retrieval system leveraging open source frameworks such as LlamaIndex, LangChain, FAISS, Haystack and so one

  Experience developing production LLM powered applications and tools with natural language interface

  Understanding of MLOps life cycle and experience with MLOps workflows & traceability and versioning of datasets including knowhow of database management and queries (in SQL, MongoDB etc)

  Experience using end-to-end MLOps platform such as KubeFlow, MLFlow, AirFlow

  Strong collaborative and interpersonal skills, specifically a proven ability to effectively guide and influence within a dynamic matrix environment

  Ways to stand out from the crowd:

  Fluency in a non-English language - Spanish / Mandarin / German / Japanese / Russian / French / UK English / Arabic/ Korean / Italian / Portuguese

  Familiarity with GPU based technologies like CUDA, CuDNN and TensorRT

  Background with Dockers and Kubernetes and strong C++ programming skills

  Background with deploying machine learning models on data center, cloud, and embedded systems

  Experience developing document extraction for different documents types and sources, and indexing at scale

  Experience adapting LLMs to different domains such as automotive, health care, finance etc.

  With competitive salaries and a generous benefits package (www.nvidiabenefits.com ), we are widely considered to be one of the technology world’s most desirable employers. We have some of the most forward-thinking and hardworking people in the world working for us and, due to outstanding growth, our best-in-class engineering teams are rapidly growing. If you're a creative and autonomous engineer with a real passion for technology, we want to hear from you!

  The base salary range is 144,000 USD - 270,250 USD. Your base salary will be determined based on your location, experience, and the pay of employees in similar positions.

  You will also be eligible for equity and benefits (https://www.nvidia.com/en-us/benefits/) . NVIDIA accepts applications on an ongoing basis.

  NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

  NVIDIA is a Learning Machine

  NVIDIA pioneered accelerated computing to tackle challenges no one else can solve. Our work in AI and the metaverse is transforming the world's largest industries and profoundly impacting society.

  Learn more about NVIDIA .

Comments
Welcome to zdrecruit comments! Please keep conversations courteous and on-topic. To fosterproductive and respectful conversations, you may see comments from our Community Managers.
Sign up to post
Sort by
Show More Comments
SIMILAR JOBS
MFG Process Engineer
What you’ll do: 岗位职责: 具备基础的精益生产知识,负责产线平衡率改善、标准化作业改善和产线布局规划设计工作; 负责产线生产工艺持续改善,能够独立负责新工艺、新设备的引入工作; 负责生产所需的工装/夹具/检具及人机工程的设计、投制、验证和交付; 负责产品生产制造工艺文件WI、PFMEA等的编制、校核和更新,支持新产品实现工业化生产; 对产品生产过程中的异常问题和突发情况现场分析,
Registered Nurse
Compensation Range $43 - $47 / hour You Matter • Make a difference every day in the lives of the underserved • Join a mission driven organization with a people first culture • Excellent career growth
Controls Engineer
The Opportunity QuidelOrtho unites the strengths of Quidel Corporation and Ortho Clinical Diagnostics, creating a world-leading in vitro diagnostics company with award-winning expertise in immunoassa
FinOps/Project Manager (3205)
FinOps/Project Manager (3205)at SMX(View all jobs) (https://www.smxtech.com/careers/) United States SMX is seeking a dynamic individual to take on a dual role as a Project Manager and FinOps Professi
Restaurant Team Member
Req ID: 428326 Address: 105 SE Interstate IH 45 Alma, TX, 75119 Benefits: * Paid Time Off * Flexible Scheduling * 401(k) – 100% match up to 5% * Medical/Dental/Vision Insurance after 30 days * Compet
HVAC Service Technician (up To $5000 Sign On Bonus)
Description RK Company OverviewAs a second-generation, family-owned enterprise, RK Industries, LLC(RK) offers a diverse range of construction, manufacturing, advancedfabrication and building services
Retail Sales Associate
#Youth24--------------Must be at least 18 years of age per company policy. Position Summary [The primary responsibility of a Retail Sales Associate/ Product Specialist is to execute the Living Spaces
Warehouse Part Time Days
...
Postdoctoral Associate - Thermal Spray
Required Qualifications: (as evidenced by an attached resume) Doctoral Degree (or foreign equivalent) in Materials Science and Engineering or closely related field. Preferred Qualifications: PhD in M
Clinical Support Specialist III - Women's Imaging Wexford
UPMC Magee-Womens Hospital is seeking a Full Time Clinical Support Specialist III to support Women’s Imaging Wexford! This position will work daylight and evenings Monday through Friday, as well as r
Copyright 2023-2025 - www.zdrecruit.com All Rights Reserved