{"id":486827,"date":"2019-01-14T09:35:06","date_gmt":"2019-01-14T17:35:06","guid":{"rendered":"https:\/\/www.microsoft.com\/en-us\/research\/?post_type=msr-project&p=486827"},"modified":"2023-03-29T19:11:44","modified_gmt":"2023-03-30T02:11:44","slug":"newsqa-dataset","status":"publish","type":"msr-project","link":"https:\/\/www.microsoft.com\/en-us\/research\/project\/newsqa-dataset\/","title":{"rendered":"NewsQA Dataset"},"content":{"rendered":"

With massive volumes of written text being produced every second, how do we make sure that we have the most recent and relevant information available to us? Microsoft research Montreal is tackling this problem by building AI systems that can read and comprehend large volumes of complex text in real-time.<\/p>\n

The purpose of the NewsQA dataset is to help the research community build algorithms that are capable of answering questions requiring human-level comprehension and reasoning skills.<\/p>\n

Leveraging CNN articles from the DeepMind Q&A Dataset, we prepared a crowd-sourced machine reading comprehension dataset of 120K Q&A pairs.<\/p>\n