{"id":486770,"date":"2018-05-17T12:15:19","date_gmt":"2018-05-17T19:15:19","guid":{"rendered":"https:\/\/www.microsoft.com\/en-us\/research\/?post_type=msr-project&p=486770"},"modified":"2019-02-06T10:22:55","modified_gmt":"2019-02-06T18:22:55","slug":"dci-distributed-causal-inference","status":"publish","type":"msr-project","link":"https:\/\/www.microsoft.com\/en-us\/research\/project\/dci-distributed-causal-inference\/","title":{"rendered":"DCI – Distributed Causal Inference"},"content":{"rendered":"
The Distributed Causal Inference (DCI)<\/strong> project explores ways to improve distributed systems technology and data query & storage functionality to enable and support answering causal inference questions from very-large-scale longitudinal and observational datasets, with the long-term goal to make data-driven exploration of outcomes as fast and common place as \u201cweb search\u201d.<\/p>\n <\/p>\n Everyone, at some point in their lives, finds themselves in an unfamiliar situation, considering what they should do, and trying to understand what to expect of the future. The answers to these questions are not readily available in Wikipedia or other knowledge bases powering modern web search engines. Exploring expectations on the Internet plays an important role in people\u2019s planning, decision-making, and forecasting for both every day and extraordinary scenarios. The DCI (Distributed Causal Inference) project is focused on providing the runtime substrate to make such causal inference scenarios work on huge datasets (such as the Twitter corpus). Overview and Vision The Distributed Causal Inference (DCI) project explores ways to improve distributed systems technology and data query & storage functionality to enable and support answering causal inference questions from very-large-scale longitudinal and observational datasets, with the long-term goal to make data-driven exploration of outcomes as fast and common place as \u201cweb search\u201d.<\/p>\n","protected":false},"featured_media":0,"template":"","meta":{"msr-url-field":"","msr-podcast-episode":"","msrModifiedDate":"","msrModifiedDateEnabled":false,"ep_exclude_from_search":false,"_classifai_error":"","footnotes":""},"research-area":[13556,13563,13555,13547],"msr-locale":[268875],"msr-impact-theme":[],"msr-pillar":[],"class_list":["post-486770","msr-project","type-msr-project","status-publish","hentry","msr-research-area-artificial-intelligence","msr-research-area-data-platform-analytics","msr-research-area-search-information-retrieval","msr-research-area-systems-and-networking","msr-locale-en_us","msr-archive-status-active"],"msr_project_start":"2017-07-01","related-publications":[498320],"related-downloads":[],"related-videos":[],"related-groups":[470706,144672,144927],"related-events":[],"related-opportunities":[],"related-posts":[564300],"related-articles":[],"tab-content":[],"slides":[],"related-researchers":[{"type":"user_nicename","display_name":"Emre Kiciman","user_id":31739,"people_section":"Section name 1","alias":"emrek"}],"msr_research_lab":[199565],"msr_impact_theme":[],"_links":{"self":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-project\/486770"}],"collection":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-project"}],"about":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/types\/msr-project"}],"version-history":[{"count":3,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-project\/486770\/revisions"}],"predecessor-version":[{"id":486779,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-project\/486770\/revisions\/486779"}],"wp:attachment":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/media?parent=486770"}],"wp:term":[{"taxonomy":"msr-research-area","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/research-area?post=486770"},{"taxonomy":"msr-locale","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-locale?post=486770"},{"taxonomy":"msr-impact-theme","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-impact-theme?post=486770"},{"taxonomy":"msr-pillar","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-pillar?post=486770"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}More Details<\/h2>\n
\nWe see such expectation exploration questions show up in web search queries, with people exploring possible consequences of their choices and the outcomes of situations.
\nThese explorations cover both consequential topics, such as life-changing education and career choices (e.g., \u201cShould I join the military?\u201d) or major financial and personal decisions (e.g., \u201cShould I move to California?\u201d) as well as more quotidian topics, such as the consequences of purchase decisions, athletic training regimens and dating rituals.<\/p>\n
\nBut, the information necessary to answer these questions is already being recorded on social media platforms such as Twitter, where hundreds of millions of individuals regularly and publicly report their personal experiences, including the situations they are in, the actions they take, and the experiences they have afterwards.<\/p>\n
\nThese explorations encompass a broad variety of tasks, including explorations of hypothetical, ongoing or past problems, or seeking informational support, emotional satisfaction, or preparation for a future event. In particular, decision-making processes about future unknowns depend critically on such information gathering (especially in unfamiliar situations) where the web augments more conventional information sources such as professional and friends\u2019 advice, training, etc.
\nAdvice-related searches were measured to make up around 2-5% of web search tasks in 2004, and even in pregnancy (a scenario with dedicated information infrastructures, related health professionals and care programs) over 80% of women used web search to help make decisions.<\/p>\nProject Goals<\/h2>\n
\nThis requires a carefully curated and constructed combination of technologies spanning distributed systems, databases, machine learning and computational statistics.<\/p>\n","protected":false},"excerpt":{"rendered":"