Meta AR/VR Job | Research Scientist Intern, Audio Machine Learning and Computer Vision (PhD)

Job(岗位): Research Scientist Intern, Audio Machine Learning and Computer Vision (PhD)

Type(岗位类型): Computer Vision | Engineering, Machine Learning, Research

Citys(岗位城市): Redmond, WA

Date(发布日期): 2021-12-15


The Facebook Reality Labs (FRL) Research Team brings together a world-class team of researchers, d()evelopers, and engineers to create the future of virtual and augmented reality, which together will become as universal and essential as smartphones and personal computers are today. And just as personal computers have done over the past 45 years, AR and VR will ultimately change everything about how we work, play, and connect. We are developing all the technologies needed to enable breakthrough AR glasses and VR headsets, including optics and displays, computer vision, audio, graphics, brain-computer interfaces, haptic interaction, eye/hand/face/body tracking, perception science, and true telepresence. Some of those will advance much faster than others, but they all need to happen to enable AR and VR that are so compelling that they become an integral part of our lives.

In particular, the FRL Research audio team is focused on two goals; creating virtual sounds that are perceptually indistinguishable from reality, and redefining human hearing. See more about our work here ( These two initiatives will allow us to connect people by allowing them to feel together despite being physically apart, and allow them to converse in even the most difficult listening environments.

FRL Research Redmond is looking for experienced interns who are passionate about ground breaking research in audio signal processing, machine learning and audio visual learning to solve important audio-driven problems for AR/VR applications. We currently have multiple open positions for a range of projects in multimodal representation learning, audio visual scene analysis, egocentric audio visual learning, multi-sensory speech enhancement and acoustic activity localization. Come join us as we make AR and VR happen!


Currently has, or is in the process of obtaining, a PhD degree in the field of Computer Science, Artificial Intelligence, Signal Processing, Machine learning, Computer vision, Electrical Engineering, Applied Math, Acoustics Engineering or a related STEM field.

3+ years experience with machine learning software platforms such as PyTorch, TensorFlow, etc.

2+ years experience building novel computational models in audio or audio-visual or speech application domains using machine learning or signal processing.

Must obtain work authorization in country of employment at the time of hire, and maintain ongoing work authorization during employment.


Research, model, design, develop and test novel audio-visual/speech algorithms using machine learning, signal processing, and computer vision.

Collaborate with researchers and engineers across diverse disciplines.

Communication of research agenda, progress and results.

Additional Requirements(额外要求)

Demonstrated software engineer experience via an internship, work experience, coding competitions, or widely used contributions in open source repositories (e.g. Github).

Strong background in statistical modeling techniques and signal processing.

Interpersonal skills: cross-group and cross-culture collaboration.

Proven track record of achieving results as demonstrated in accepted papers at top computer vision and machine learning related conferences such as CVPR, ECCV, NIPS, ICASSP, InterSpeech etc.

Intent to return to degree-program after the completion of the internship/co-op.