Research Intern for Media Computing Group (Audio)

Date Posted: July 20, 2024

Number of Openings: 1-2

Location: Beijing

Group Introduction:

The Multimedia Computing Group at Microsoft Research Asia focuses on various technologies in real-time multimedia communication, encompassing low-level video and audio processing, compression, encoding/decoding, as well as high-level computer vision and speech processing and understanding. The group leverages artificial intelligence and other technologies to enhance user experience in real-time multimedia communication. With over 20 years of research and engineering expertise at Microsoft Research Asia, our technologies have been integrated into multiple product lines, including Office, Microsoft Teams, Surface, and Xbox. Additionally, our researchers have published numerous papers in leading conferences in these related fields, including CVPR, SIGGRAPH, ICCV, NIPS, AAAI, TPAMI, ACMMM, and more. Join the Multimedia Computing Group to research cutting-edge computer vision technologies, with goals of producing papers, participating in competitions, and developing algorithms to be implemented in Microsoft products.

Job Responsibilities:

Under the guidance of researchers, conduct pioneering research and experimental validation in the field of audio and speech, focusing on one of the following subfields:

  1. Audio signal processing in real-time audio and video communication, such as speech enhancement, echo cancellation, neural audio coding, packet loss concealment, etc.
  2. Audio and speech representation learning
  3. Sound landscape modeling, understanding and cross-modal technologies

Interns will participate in all steps of the research process, including data analysis, algorithm design, algorithm implementation, experimental research, and demonstrations.

Qualifications:

  1. Bachelor’s, Master’s, or PhD in Computer Science, Software Engineering, Electronic Engineering, or other related fields.
  2. Solid foundation in data structures and algorithms.
  3. Excellent programming skills or relevant project experience.
  4. Quick learning ability.
  5. Proficiency in using English as a working language.
  6. Strong communication skills and team collaboration spirit.
  7. Written consent from your academic advisor.

Internship Duration Requirements:

Must obtain permission from your academic advisor and commit to at least six months of internship.

Please be sure to download and complete the application form (Application form link: https://aka.ms/InternApplication ) and send it along with a complete English and Chinese resume (in PDF/Word format) to: MSRAih@microsoft.com & xipe@microsoft.com. Please include “Research Intern for Media Computing Group” in the email subject line.

 

 

岗位名称:多媒体计算组(音频方向)研究实习生

工作性质:全职实习生

招聘人数:1-2人

工作地点:北京

组别简介:

微软亚洲研究院多媒体计算组致力于多媒体实时通信中的各种技术,囊括底层的视频音频处理、压缩、编解码和上层的计算机视觉和语音处理与理解。利用人工智能等技术提高多媒体实时通信中的用户体验。在微软亚洲研究院有超过 20 年的研究与工程技术积累,技术转换到包括 Office, Microsoft Teams, Surface 和 XBox 等多条产品线。另外,研究员们在这些相关领域的顶会上发表过多篇论文,包括CVPR, SIGGRAPH, ICCV, NIPS, AAAI, TPAMI, ACMMM等。加入多媒体计算组,研究最前沿的计算机视觉技术,培养目标为论文、竞赛以及算法,并落地于微软产品。

工作职责:

在研究员的指导下,针对音频和语音方向开展前沿性研究、实验验证以及论文撰写,包括

  1. 实时音视频通信中的音频信号处理,比如语音去噪,回声消除,编解码,丢包补偿等
  2. 广义音频和语音的表征学习
  3. 音频场景的建模,理解以及跨模态技术

研究实习生将参与研究过程的所有步骤,包括数据分析、算法设计、算法实现、实验研究和演示。

任职要求:

  1. 计算机科学、软件工程、电子工程或其它相关专业(本科/硕士/博士)
  2. 扎实的数据结构/算法基础
  3. 具有优秀的编程能力或相关的项目经验
  4. 快速学习能力
  5. 可以熟练使用英文作为工作语言
  6. 具有良好的沟通能力和团队协作精神
  7. 能得到导师的书面同意

工作时间要求:

能获得导师许可并保证至少六个月的实习。

请务必下载并填写申请表(申请表链接:https://aka.ms/InternApplication )并将其与完整的中英文简历(PDF/Word形式)一同发送至:MSRAih@microsoft.com & xipe@microsoft.com,邮件标题中注明:多媒体计算组研究实习生