Belief Updating in Spoken Language Interfaces

Over the last decade, advances in natural language processing technologies have paved the way for the emergence of complex spoken language interfaces. A persistent and important problem in the development of these systems is their lack of robustness when confronted with understanding-errors. The problem stems mostly from the unreliability of current speech recognition technology, and is present across all domains and interaction types. My research addresses this problem by: (1) endowing spoken language interfaces with better error awareness, (2) constructing and evaluating a rich repertoire of error recovery strategies, and (3) developing data-driven, adaptive approaches for making error handling decisions.

In this talk, I focus on the first of these problems: error awareness.
Traditionally, spoken dialog systems rely on recognition confidence scores and simple heuristics to guard against potential misunderstandings. While confidence scores can provide an initial reliability assessment, ideally a system should leverage information from subsequent user turns in the conversation to continuously update and improve the accuracy of its beliefs.

I describe a scalable data-driven solution for this belief updating problem. The proposed approach relies on a compressed concept-level representation of beliefs and casts the belief updating problem as a multinomial regression task. Experimental results indicate that the constructed belief updating models significantly outperform typical heuristic rules used in current systems. Furthermore, a user study with a deployed mixed-initiative spoken dialog system shows that the proposed approach leads to large improvements in both the effectiveness and the efficiency of the interaction across a wide range of recognition error rates.

Speaker Details

Dan Bohus is a Ph.D. student working under the supervision of Dr. Alex Rudnicky and Prof. Roni Rosenfeld in the Computer Science Department at Carnegie Mellon University. He has received a B.S. in Computer Science from “Politehnica” University of Timisoara, Romania.Dan’s dissertation research is focused on developing adaptive techniques for error detection and recovery in spoken language interfaces. In previous work, Dan has developed RavenClaw – a task-independent dialog management framework that serves as a research platform in a number of projects and has been used to construct and successfully deploy several dialog systems spanning different domains and interaction types (www.ravenclaw-olympus.org). Dan is a founding member of the ‘Dialogs on Dialogs’ student reading group at CMU (www.cs.cmu.edu/~dod). Together with other student members of this group he organized the “1st Young Researchers’ Roundtable on Spoken Dialog Systems” held in conjunction with SIGdial/Interspeech-2005.

Date:
Speakers:
Dan Bohus
Affiliation:
Carnegie Mellon University