{"id":5145,"date":"2017-02-03T10:35:55","date_gmt":"2017-02-03T18:35:55","guid":{"rendered":"https:\/\/blogs.msdn.microsoft.com\/translation\/?p=5145"},"modified":"2017-02-03T10:35:55","modified_gmt":"2017-02-03T18:35:55","slug":"microsoft-translator-publicly-releases-speech-translation-corpus","status":"publish","type":"post","link":"https://www.microsoft.com\/en-us\/translator/blog\/2017\/02\/03\/microsoft-translator-publicly-releases-speech-translation-corpus\/","title":{"rendered":"Microsoft Translator publicly releases speech translation corpus"},"content":{"rendered":"\n\n\n\n
\"christian_federmann\"<\/a><\/span><\/td>\n<\/tr>\n
Christian Federmann, senior program manager<\/span><\/td>\n<\/tr>\n<\/tbody>\n<\/table>\n

As part of an ongoing effort within Microsoft to improve the accuracy of artificial intelligence (AI) systems, Microsoft Translator is publicly releasing a set of data that includes multiple conversations between bilingual speakers who are speaking French, German and English.<\/span><\/p>\n

This corpus, which was produced by Microsoft using bilingual speakers, aims to create a standard by which people can measure how well their conversational speech translation systems work. It can serve as a standardized data set for testing bilingual conversational speech translation systems such as the <\/span>Microsoft Translator live feature<\/span><\/a> and <\/span>Skype Translator<\/span><\/a>.<\/span><\/p>\n

Christian Federmann<\/span><\/a>, a senior program manager working with the Microsoft Translator team, said there aren\u2019t as many standardized data sets for testing bilingual conversational speech translation systems. \u201cYou need high-quality data in order to have high-quality testing,\u201d Federmann said.<\/span><\/p>\n

The Microsoft team hopes the corpus, which is freely available, will benefit the entire field of conversational translation and help to create more standardized benchmarks that researchers can use to measure their work against others.<\/span><\/p>\n

\u201cThis helps propel the field forward,\u201d said <\/span>Will Lewis<\/span><\/a>, a principal technical program manager with the Microsoft Translator team who also worked on the project.<\/span><\/p>\n

Download the Microsoft Speech Language Translation corpus <\/span>here<\/span><\/a>.<\/span><\/p>\n

Learn more about this release as well as other ways Microsoft is working to make AI smarter and more accurate in the <\/span>Microsoft Research blog<\/span><\/a>.<\/span><\/p>\n

\u00a0<\/span><\/p>\n

Learn More<\/span><\/p>\n