Meta AR/VR Job | Research Scientist Intern, Speech & Audio Technologies (PhD)

Job(岗位): Research Scientist Intern, Speech & Audio Technologies (PhD)

Type(岗位类型): Artificial Intelligence | Computer Vision, Machine Learning, Research

Citys(岗位城市): London, UK

Date(发布日期): 2023-2-1


We are looking for Research Scientist Interns to join the Meta AI Speech teams in London. These teams at Meta create spoken language technology to make it faster and easier for people to build community and connect with others around the world. We are part of the AI Reality Labs Research organization, whose mission is to conduct product-motivated research in ML/AI and design, develop and deploy state of the art algorithms to the rest of Meta. We work on all aspects of AI for speech and audio processing, including speech recognition, speech synthesis, speaker identification, keyword spotting, and acoustic event detection with an emphasis on multimodal understanding, i.e. by augmenting acoustic information with visual cues or cues from other sensors available on AR devices. Our work is largely focused on the areas of voice interfaces, including speech technologies for RayBan Stories, Portal devices, Oculus VR headsets, Augmented Reality, the Metaverse, and video understanding, including transcription, captioning, and content understanding.

As a Research Scientist Intern, you will help us develop innovative models and algorithms and apply them to large-scale production speech tasks.

This is a 2023 internship opportunity with start dates from May to September. To learn more about our research, visit


Currently has, or is in the process of obtaining a PhD degree.

Must obtain work authorization in the country of employment at the time of hire, and maintain ongoing work authorization during employment.

Experience in C/C++ and Python.

Experience in deep learning frameworks (PyTorch, Tensorflow, …).

Research and/or work experience in machine learning, deep learning, and/or speech technology.


Perform research to advance the science and technology of intelligent machines.

Develop novel and accurate speech algorithms and systems, leveraging deep learning and machine learning on big data resources.

Contribute research that can be applied to Meta product development.

Analyze and improve efficiency, scalability, and stability of various deployed systems.

Collaborate with team members from prototyping to production.

Additional Requirements(额外要求)

Intent to return to the degree-program after the completion of the internship/co-op.

Experience manipulating and analyzing complex, high-volume, high-dimensionality data from varying sources.

Proven track record of achieving results as demonstrated by grants, fellowships, patents, as well as first-authored publications at workshops or conferences such as Interspeech, ICASSP or similar.

A strong interest in theoretical and empirical research and for answering hard questions with research.

Interpersonal experience: cross-group and cross-culture collaboration.

Ability to stay in touch with the literature of a particular domain and has the ability to reproduce results if needed.

Experienced with training deep neural networks for key Speech tasks such as speech recognition, speech synthesis, speech translation, speaker diarization, sentiment analysis, acoustic event recognition, scene understanding, wake word, etc.

Experience working with other modalities such as vision and text understanding is a plus.