Audio and acoustics

Publication

Hearable devices with sound bubbles

Tuochao Chen, Malek Itani, Sefik Emre Eskimez, Takuya Yoshioka, Shyamnath Gollakota

Nature Electronics | November 2024, pp. 1-12

Publication

Whispering Wearables: Multimodal Approach to Silent Speech Recognition with Head-Worn Devices

Tanmay Srivastava, R. Michael Winters, Yu-Te Wang, Thomas M. Gable, Teresa LaScala, Ivan Tashev

International Conference on Multimodal Interaction | November 2024

Project

Career Opportunity

Research Intern – Brain-Computer Interfaces

Posted: November 3, 2024

Location: Redmond, WA, US

Research Area(s): Audio and Acoustics, Human-computer interaction, Medical, health and genomics

The Brain-Computer Interfaces (BCI) project in Microsoft Research aims to enable BCI for the general population. This means non-intrusive methods; fewer number of electrodes and custom-designed signal picking devices. We go towards interactive BCI, which…

Career Opportunity

Research Intern – Audio and Acoustics

Posted: November 3, 2024

Location: Redmond, WA, US

Research Area(s): Artificial intelligence, Audio and Acoustics

The Audio and Acoustics Research Group has several openings in the areas of generative audio, artificial intelligence (AI) for audio, speech enhancement, spatial audio, and audio devices for communication and interaction. The group actively publishes…

Career Opportunity

Research Intern – Applied Sciences Group (Audio/Vision/NLP/Multimodal)

Posted: October 28, 2024

Location: Redmond, WA, US

Research Area(s): Audio and Acoustics, Computer vision, Human language technologies

The Microsoft Applied Sciences Group incubates disruptive technologies for Microsoft’s next-gen Windows and Surface products. Operating as a startup within the company, this team works closely with several research and product teams to bring compelling…

Microsoft Research Blog

Research Focus: Week of October 7, 2024

October 9, 2024

Simplifying secure decision tree training; Improving accuracy of audio content detection; A novel neurosymbolic system for converting text to tables; New video series: AI for Business Transformation; TEE security protections for container workloads.

Group

Interactive Multimodal AI Systems (IMAIS)

The Interactive Multimodal AI Systems focuses on creating interactive systems and experiences that blend the richness and complexity of people and their real, physical world with advanced technology. We seek to leverage multimodal generative AI…

Microsoft Research Blog

Research Focus: Week of September 9, 2024

September 12, 2024 | Sara Abdali, Sefik Emre Eskimez, Xiaofei Wang, Manthan Thakker, Jinyu Li, Sheng Zhao, Naoyuki Kanda, Carmen Badea, Christian Bird, Tom Zimmermann, Rob DeLine, Nicole Forsgren, Denae Ford Robinson, Xenofon Foukas

Investigating vulnerabilities in LLMs; A novel total-duration-aware (TDA) duration model for text-to-speech (TTS); Generative expert metric system through iterative prompt priming; Integrity protection in 5G fronthaul networks: