Downloads
Impact of Controlled Language on Machine-Translation Quality and Post-Editing Efforts
September 2007
Results from experiments conducted by Microsoft Research’s Machine Translation Incubation Team to investigate the impact of using good English (controlled language) on post-editing productivity—as well as on the overall quality of our statistical machine-translation system.
Size : 334104
Microsoft Research IME Corpus
December 2005
This download consists of data only: it provides a test data set for the task of Japanese character conversion for text input. The data set consists of: (1) reference files, which consist of Japanese sentences that are randomly extracted from…
Size : 4495451
Powergrading Short Answer Grading Corpus
October 2013
This corpus contains the original data analyzed in the following paper: Basu, Jacobs, and Vanderwende, “Powergrading: a Clustering Approach to Amplify Human Effort for Short Answer Grading,” Transactions of the ACL, 2013. It consists of responses from 100 + 698…
Search-based Neural Structured Learning for Sequential Question Answering
August 2018
This project contains the source code of the Dynamic Neural Semantic Parser (DynSP), based on DyNet, described in the paper paper “Search-based Neural Structured Learning for Sequential Question Answering”.