Home
/
Comprehensive
/
Senior Deep Learning Scientist, Speech Synthesis
Senior Deep Learning Scientist, Speech Synthesis-April 2024
Pune
Apr 16, 2025
ABOUT NVIDIA
NVIDIA is a computing platform company, innovating at the intersection of graphics, HPC, and AI.
10,000+ employees
Technology
VIEW COMPANY PROFILE >>
About Senior Deep Learning Scientist, Speech Synthesis

  Widely considered to be one of the technology world’s most desirable employers, NVIDIA is an industry leader with groundbreaking developments in High-Performance Computing, Artificial Intelligence and Visualization. The GPU, our invention, serves as the visual cortex of modern computers and is at the heart of our products and services. GPU deep learning ignited modern AI — the next era of computing — with the GPU acting as the brain of computers, robots, autonomous cars and conversational AI that can perceive and understand the world. Today, we are increasingly known as “the AI computing company.” We're looking to grow our company, and build our teams with the smartest people in the world. Join us at the forefront of technological advancement.

  NVIDIA is looking for Speech Data Scientists to develop high-impact, high-visibility Speech AI product "Riva" & improve the experience of millions of customers. If you're creative & passionate about solving real world conversational AI problems, come join our Riva Product engineering team. For more details on Riva check https://developer.nvidia.com/riva

  What you’ll be doing:

  Train Speech Synthesis Mel spectrogram and vocoder models.

  Measure and benchmark model performance.

  Maintain TTS model evaluation system

  Analyze model accuracy and bias and recommend the next course of action & Improvements.

  Improve processes for speech data processing, augmentation, filtering & TTS Training sets preparation.

  Gather knowhow on TTS datasets for training & evaluation.

  Characterize performance and quality metrics across platforms for various speech AI components.

  Collaborate with various teams on new product features and improvements of existing products.

  Participate in developing and reviewing code, design documents, use case reviews, and test plan reviews.

  Help innovate, identify problems, recommend solutions and perform triage in a collaborative team environment.

  What we need to see:

  Master’s degree (or equivalent experience) or PhD in Computer Science, Electrical Engineering, Artificial Intelligence, Applied Math, Linguistics or Computational Linguistics

  5+ years of experience

  Excellent programming skills in Python.

  Strong fundamentals in Programming, optimizations and Software design and strong knowledge of ML/DL techniques, algorithms and tools with exposure to CNN, RNN (LSTM), Transformers.

  Knowhow of Deep learning applications to Speech synthesis, LLM, and Speech-to-speech translations.

  Hands-on experience on Speech Technologies like Speech Synthesis, voice cloning, etc.

  Experience with Training of speech models and experience with “PyTorch” Deep Learning Frameworks.

  Exposure to basic speech digital signal processing and feature extraction techniques like FFT, MFCC, Mel Spectrogram, etc.

  General background around version control and code review tools like Git, Gerrit, Gitlab.

  Strong collaborative and interpersonal skills, specifically a proven ability to effectively guide and influence within a dynamic matrix environment.

  Ways to stand out from the crowd:

  Native or near-native fluency in a non-English language - Spanish / Mandarin / German / Japanese / Russian / French / UK English / Arabic / Hindi / Korean / Italian / Portuguese

  Experience developing multilingual code-switched TTS, voice cloning, and cross-lingual voice cloning

  Background in developing WFST and Neural networks-based Text-Normalization and Inverse Text-Normalization

  Experience working with G2P systems for multiple languages

  Feeling comfortable and motivated when working in a fast paced, highly collaborative, dynamic work environment

  NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression , sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.

  NVIDIA is a Learning Machine

  NVIDIA pioneered accelerated computing to tackle challenges no one else can solve. Our work in AI and the metaverse is transforming the world's largest industries and profoundly impacting society.

  Learn more about NVIDIA .

Comments
Welcome to zdrecruit comments! Please keep conversations courteous and on-topic. To fosterproductive and respectful conversations, you may see comments from our Community Managers.
Sign up to post
Sort by
Show More Comments
SIMILAR JOBS
Chief Operations Office | Advisory
Status Category:Full-Time Exempt/Non-Exempt:Exempt Scheduled Hours Per Week:40 Job Code:FS144 With over 120 offices and nearly 7,000 associates throughout the U.S. CBIZ(NYSE: CBZ) delivers top-level
Project Superintendent
Project Superintendent Company Name: Baker Concrete Construction, Inc Location: Miami, FL, US, 33132 Req ID : 4566 Travel: Up to 25% Number of Openings: 1 If you’re driven to accomplish great things,
Purchasing ASSISTANT
Description: Performs a variety of purchasing duties. Maintains records and files pertinent to purchasing information. Purchases routine, non-discretionary materials and supplies. Compiles, records,
Pest Control Service Technician Trainee I
Want to Join the Best in Pests? Go Pro with Orkin. As an Orkin Pro, youll put the pro in protecting what people value most: their home. Youll have more than a jobyoull have a career with growth poten
Warehouse Order Picker/Backup Driver
Description: Warehouse Worker/Backup Driver We are looking for a Warehouse worker that will be picking orders, has strong attention to detail, using paper slips, receiving. Products are general plumb
Research Scientist
Summary: Meta Platforms, Inc. (Meta), formerly known as Facebook Inc., builds technologies that help people connect, find communities, and grow businesses. When Facebook launched in 2004, it changed
Retail Personal Banker I - Southeast Dayton Region
Make banking a Fifth Third better® We connect great people to great opportunities. Are you ready to take the next step? Discover a career in banking at Fifth Third Bank. GENERAL FUNCTION: Selected ca
Package Consultant: SAP FIN CO
Introduction In this role, you'll work in our IBM Client Innovation Center (CIC), where we deliver deep technical and industry expertise to a wide range of public and private sector clients around th
Cosmetologist / Hairstylist
PS SALON & SPAAre you a licensed Cosmetologist looking for a part time position with no nights or weekend schedules? We'd love to discuss this great opportunity with you! Looking for talented, Li
Learning Designer - Digital Adoption Expert
Who Are We? Taking care of our customers, our communities and each other. That’s the Travelers Promise. By honoring this commitment, we have maintained our reputation as one of the best property casu
Copyright 2023-2025 - www.zdrecruit.com All Rights Reserved