Summary:
We are looking for Research Scientist Interns to join the Meta AI Speech team in London. Our team creates spoken language technologies to make it faster and easier for people to build community and connect with others around the world. We conduct product-motivated research in ML/AI and design, develop and deploy state-of-the-art algorithms to the rest of Meta. We work on all aspects of AI for speech and audio processing, including speech recognition, speech synthesis, speaker identification, keyword spotting, and acoustic event detection with an emphasis on multimodal understanding, i.e. by augmenting acoustic information with visual cues or cues from other sensors available on AR devices. Our work is largely focused on the areas of voice interfaces, including speech technologies for Ray-Ban | Meta RayBan smart glasses, Quest 3 mixed-reality headsets, Augmented Reality, the Metaverse, and understanding video on Facebook and Instagram, including transcription, captioning, and content understanding.As a Research Scientist Intern, you will help us develop innovative models and algorithms and apply them to large-scale production speech tasks. Our teams at Meta AI offer twelve (12) to twenty-four (24) weeks long internships and we have various start dates throughout the year. To learn more about our research, visit https://research.facebook.com.
Required Skills:
Research Scientist Intern, Speech & Audio Technologies (PhD) Responsibilities:
Perform research to advance the science and technology of intelligent machines.
Develop novel and accurate speech algorithms and systems, leveraging deep learning and machine learning on big data resources.
Contribute research that can be applied to Meta product development.
Analyze and improve efficiency, scalability, and stability of various deployed systems.
Collaborate with team members from prototyping to production.
Minimum Qualifications:
Minimum Qualifications:
Currently has, or is in the process of obtaining a PhD degree.
Must obtain work authorization in the country of employment at the time of hire, and maintain ongoing work authorization during employment.
Experience in C/C++ and Python.
Experience in deep learning frameworks such as PyTorch, Tensorflow.
Research and/or work experience in machine learning, deep learning, and/or speech technology.
Preferred Qualifications:
Preferred Qualifications:
Experience manipulating and analyzing complex, high-volume, high-dimensionality data from varying sources.
Proven track record of achieving results as demonstrated by grants, fellowships, patents, as well as first-authored publications at workshops or conferences such as Interspeech, ICASSP or similar.
A strong interest in theoretical and empirical research and for answering hard questions with research.
Interpersonal experience: cross-group and cross-culture collaboration.
Ability to stay in touch with the literature of a particular domain and has the ability to reproduce results if needed.
Experienced with training deep neural networks for key Speech tasks such as speech recognition, speech synthesis, speech translation, speaker diarization, sentiment analysis, acoustic event recognition, scene understanding, wake word, etc.
Experience working with other modalities such as vision and text understanding is a plus.
Intent to return to the degree-program after the completion of the internship/co-op.
Industry: Internet