https:\/\/microsoft.sharepoint.com\/teams\/Data2Text\/DocLib1\/Data2Text%20Introduction.mp4<\/a><\/video><\/div>\nThe Data2Text (or Data-to-Text) project aims to automatically generate fluent and fact-based descriptions or utterances given data tables. Typical business applications for text generation include the generation of financial and sports news stories, the generation of product descriptions, the analysis and interpretation of business data, and the analysis and interpretation of Internet of Things data, etc. See below for a few Data2Text applications.<\/p>\n
Product Description Generation <\/p>\n
Writing Assistant <\/p>\n
Fact-based QA <\/p>\n
Fact-based Conversation <\/p>\n
Analytic Narrative Generation <\/p>\n
The mainstream methods of data-to-text generation include rule-based, template-based approaches and neural network-based approaches. Rule-based and template-based approaches are the mainstream approaches in the relevant applications, as they are clearly interpretable and controllable, making it easier to ensure the correctness of the generated text contents. However, how to create rules and extract high-quality templates require labor-intensive manual feature engineering. On the contrary, the neural network-based models are mainly data-driven, do not need too much human intervention, and can easily produce rich and smooth text description. However, users often can not directly manipulate the content generation and it is difficult to ensure that generated texts are faithful to their input data. <\/p>\n
Fact-based Data-to-Text Generation <\/p>\n
The Data2Text project aims to develop automated high-fidelity data-to-text generation technologies to address the shortcomings of template-based and the neural network-based approaches.<\/p>\n","protected":false},"excerpt":{"rendered":"
The Data2Text project aims to automatically generate fluent and fact-based descriptions or utterances given a data table. Typical business applications for text generation include the generation of financial and sports news stories, the generation of product descriptions, the analysis and interpretation of business data, and the analysis and interpretation of Internet of Things data, etc. Figure 1 gives an example of the automatic generation of weather forecasts. Figure 1a is a structured weather data collected by various sensors, the machine will be figure 1a data as input, output figure 1b weather forecast.<\/p>\n","protected":false},"featured_media":0,"template":"","meta":{"msr-url-field":"","msr-podcast-episode":"","msrModifiedDate":"","msrModifiedDateEnabled":false,"ep_exclude_from_search":false,"_classifai_error":"","footnotes":""},"research-area":[13556,13563,13545],"msr-locale":[268875],"msr-impact-theme":[],"msr-pillar":[],"class_list":["post-717085","msr-project","type-msr-project","status-publish","hentry","msr-research-area-artificial-intelligence","msr-research-area-data-platform-analytics","msr-research-area-human-language-technologies","msr-locale-en_us","msr-archive-status-active"],"msr_project_start":"2016-09-01","related-publications":[714370,714388,714415,714688,714694,714700,714730],"related-downloads":[],"related-videos":[],"related-groups":[],"related-events":[],"related-opportunities":[],"related-posts":[],"related-articles":[],"tab-content":[{"id":0,"name":"Articles","content":"
\r\n \t- Summary or recent data-to-text generation papers<\/a> (in Chinese, 2019-02-26, \"\u6570\u636e\u5230\u6587\u672c\u751f\u6210\u7684\u8fd1\u671f\u4f18\u8d28\u8bba\u6587\u89e3\u8bfb\")<\/a>.<\/li>\r\n \t
- Learning to write data-based articles automatically<\/a> (in Chinese, 2017-04-21, \"\u5982\u4f55\u8ba9\u4eba\u5de5\u667a\u80fd\u5b66\u4f1a\u7528\u6570\u636e\u8bf4\u8bdd\")<\/a>.<\/li>\r\n<\/ol>"}],"slides":[],"related-researchers":[{"type":"user_nicename","display_name":"Chin-Yew Lin","user_id":31493,"people_section":"Section name 0","alias":"cyl"}],"msr_research_lab":[199560],"msr_impact_theme":[],"_links":{"self":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-project\/717085"}],"collection":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-project"}],"about":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/types\/msr-project"}],"version-history":[{"count":19,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-project\/717085\/revisions"}],"predecessor-version":[{"id":717541,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-project\/717085\/revisions\/717541"}],"wp:attachment":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/media?parent=717085"}],"wp:term":[{"taxonomy":"msr-research-area","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/research-area?post=717085"},{"taxonomy":"msr-locale","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-locale?post=717085"},{"taxonomy":"msr-impact-theme","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-impact-theme?post=717085"},{"taxonomy":"msr-pillar","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-pillar?post=717085"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}