The AT&T WATSON Speech Recognizer.

V. Goffin; C. Allauzen; E. Bocchieri; D. Hakkani-Tur; A. Ljolje; Sarangarajan Parthasarathy; M. Rahim; G. Riccardi; M. Saraclar

The AT&T WATSON Speech Recognizer.

V. Goffin ,
C. Allauzen ,
E. Bocchieri ,
D. Hakkani-Tur ,
A. Ljolje ,
Sarangarajan Parthasarathy ,
M. Rahim ,
G. Riccardi ,
M. Saraclar

ICASSP 2005 | March 2005

Download BibTex

This paper describes the AT&T WATSON real-time speech recognizer, the product of several decades of research at AT&T. The recognizer handles a wide range of vocabulary sizes and is based on continuous-density hidden Markov models for acoustic modeling and finite state networks for language modeling. The recognition network is optimized for efficient search. We identify the algorithms used for high-accuracy, real-time and low-latency recognition. We present results for small and large vocabulary tasks taken from the AT&T VoiceTone service, showing word accuracy improvement of about 5% absolute and real-time processing speed-up by a factor between 2 and 3.