{"id":169745,"date":"2004-01-29T16:47:15","date_gmt":"2004-01-29T16:47:15","guid":{"rendered":"https:\/\/www.microsoft.com\/en-us\/research\/project\/personalized-language-model-for-improved-accuracy\/"},"modified":"2019-08-19T09:36:42","modified_gmt":"2019-08-19T16:36:42","slug":"personalized-language-model-for-improved-accuracy","status":"publish","type":"msr-project","link":"https:\/\/www.microsoft.com\/en-us\/research\/project\/personalized-language-model-for-improved-accuracy\/","title":{"rendered":"Personalized Language Model for Improved Accuracy"},"content":{"rendered":"
Traditionally speech recognition systems are built with models that are an average of many different users. A speaker-independent model is provided that works reasonably well for a large percentage of users. But the accuracy can be improved if the acoustic model is personalized to the given user. We have built a service that constantly looks at the user’s sent emails to personalize the language model and we’ve observed a 30% reduction in error rate for the text dictated in the body of emails.<\/p>\n
Traditionally speech recognition systems are built with models that are an average of many different users. A speaker-independent model is provided that works reasonably well for a large percentage of users. But the accuracy can be improved if the acoustic model is personalized to the given user, i.e. if the system learns the voice characteristics of the user, and this is often done in dictation systems as part of an “enrollment phase” that typically lasts at least 10 minutes. We would like to also adapt the language model to the user but a large number of sentences written by the user is required for the error decrease to be significant. We have built a service that constantly looks at the user’s sent emails to personalize the language model and weve observed a 30% reduction in error rate for the text dictated in the body of emails.<\/p>\n