{"id":664548,"date":"2020-07-20T09:54:40","date_gmt":"2020-07-20T16:54:40","guid":{"rendered":"https:\/\/www.microsoft.com\/en-us\/research\/?post_type=msr-group&p=664548"},"modified":"2023-07-26T10:10:50","modified_gmt":"2023-07-26T17:10:50","slug":"cognitive-services-research","status":"publish","type":"msr-group","link":"https:\/\/www.microsoft.com\/en-us\/research\/group\/cognitive-services-research\/","title":{"rendered":"Azure Cognitive Services Research"},"content":{"rendered":"
\n\t
\n\t\t
\n\t\t\t\"Microsoft\t\t<\/div>\n\t\t\n\t\t
\n\t\t\t\n\t\t\t
\n\t\t\t\t\n\t\t\t\t
\n\t\t\t\t\t\n\t\t\t\t\t
\n\t\t\t\t\t\t
\n\t\t\t\t\t\t\t\n\t\t\t\t\t\t\t\n\n

Azure Cognitive Services Research<\/h1>\n\n\t\t\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t<\/div>\n\t\t<\/div>\n\t<\/div>\n<\/section>\n\n\n\n\n\n

The mission of the Azure Cognitive Services Research group (CSR) is to make fundamental contributions to advancing the state of the art of the most challenging problems in speech, language, and vision\u2014both within Microsoft and the external research community. The CSR includes Computer Vision<\/a>, Knowledge and Language<\/a>, and Speech<\/a> teams.<\/p>\n\n\n\n

We conduct cutting edge research in all aspects of spoken language processing and computer vision. This includes audio-visual fusion; visual-semantic reasoning; federated learning; speech recognition; speech enhancement; speaker recognition and diarization; machine reading comprehension; text summarization; multilingual language modeling; and related topics in natural language processing, understanding, and generation; as well as face forgery detection; object detection and segmentation; dense pose, head, and mask tracking, action recognition; image and video captioning; and other topics in image and real-time video understanding. We leverage large-scale GPU and CPU clusters as well as internal and public data sets to develop world-leading deep learning technologies for forward-looking topics such as audio-visual far-field meeting transcription, automatic meeting minutes generation, and multi-modal dialog systems. We publish our research on public benchmarks, such as our breakthrough human parity performances on the Switchboard conversational speech recognition task<\/a>, CommonenseQA<\/a> and Stanford\u2019s Conversational Question Answering Challenge<\/a> (CoQA).<\/p>\n\n\n\n

In addition to expanding our scientific understanding of speech, language, and vision, our work finds outlets in Microsoft products such as Azure Cognitive Services (opens in new tab)<\/span><\/a>, HoloLens, Teams, Windows, Office, Bing, Cortana, Skype Translator, Xbox, and more.<\/p>\n\n\n\n

The Azure Cognitive Services Research group is managed by Michael Zeng<\/a>.<\/p>\n\n\n\n\n\n

The Knowledge and Language Team<\/a> is part of the Azure AI Cognitive Services Research (CSR) group, focusing on cutting edge research and the development of the next generation framework for knowledge and natural language processing.<\/p>\n\n\n\n

We are working on problems including knowledge-boosted language modeling, knowledge extraction, knowledge graph, summarization, language understanding and generation. We conduct large-scale pre-training and domain-specific fine-tuning on internal and public data sets to develop state-of-the-art deep learning technologies for core knowledge and language problems in various real applications.<\/p>\n\n\n\n

Our work has resulted in multiple publications in top NLP conferences and first place submissions to the CommonsenseQA and FEVER leaderboards.<\/p>\n\n\n\n

Our recent work covers:<\/p>\n\n\n\n