Projects
-
- Language Understanding: Don’t just recognize the words a user spoke, but understand what they mean.
- Noise Robustness: How do we make the system work when background noise is present?
- Voice search: Users can search for information such as a business from your phone.
- Automatic Grammar Induction: How do create grammars to ease the development of spoken language systems?
- (MiPad) Multimodal Interactive Pad: Our first multimodal prototype.
- SALT (Speech Enabled Language Tags): A markup language for the multimodal web
- From Captions to Visual Concepts and Back: Image captioning and understanding
- Intent Understanding: Not recognize the words the user says, but understand what they mean.
- Multimodal Conversational User Interface
- Personalized Language Model for improved accuracy
- Recurrent Neural Networks for Language Processing
- Speech Technology for Computational Phonetics and Reading Assessment
- (Whisper) Speech Recognition: Our previous dictation-oriented speech recognition project is a state-of-the-art general-purpose speech recognizer.
- (WhisperID) Speaker Identification: Who is doing the talking?
- Speech Application Programming Interface (SAPI) Development Toolkit: The Whisper speech recognizer can be used by developers to produce applications using speech recognition
Current Projects
Loading…
No results found.