Online Supervised Learning of Non-understanding Recovery Policies
- Dan Bohus ,
- Brian Langner ,
- Antoine Raux ,
- Alan Black ,
- Maxine Eskenazi ,
- Alex Rudnicky
IEEE/ACL 2006 Workshop on Spoken Language Technology |
Spoken dialog systems typically use a limited number of nonunderstanding recovery strategies and simple heuristic policies to engage them (e.g. first ask user to repeat, then give help, then transfer to an operator). We propose a supervised, online method for learning a non-understanding recovery policy over a large set of recovery strategies. The approach consists of two steps: first, we construct runtime estimates for the likelihood of success of each recovery strategy, and then we use these estimates to construct a policy. An experiment with a publicly available spoken dialog system shows that the learned policy produced a 12.5% relative improvement in the non-understanding recovery rate.