{"id":959091,"date":"2023-08-10T09:00:00","date_gmt":"2023-08-10T16:00:00","guid":{"rendered":"https:\/\/www.microsoft.com\/en-us\/research\/?p=959091"},"modified":"2023-12-19T08:30:09","modified_gmt":"2023-12-19T16:30:09","slug":"microsoft-at-kdd-2023-advancing-health-at-the-speed-of-ai","status":"publish","type":"post","link":"https:\/\/www.microsoft.com\/en-us\/research\/blog\/microsoft-at-kdd-2023-advancing-health-at-the-speed-of-ai\/","title":{"rendered":"Microsoft at KDD 2023: Advancing health at the speed of AI"},"content":{"rendered":"\n
This content was given as a keynote at the Workshop of Applied Data Science for Healthcare and covered during a tutorial at the <\/em><\/strong>29th<\/sup> ACM SIGKDD Conference on Knowledge Discovery and Data Mining<\/em><\/strong> (opens in new tab)<\/span><\/a>, a premier forum for advancement, education, and adoption of the discipline of knowledge discovering and data mining.<\/em><\/strong><\/p>\n\n\n\n Recent and noteworthy advancements in generative AI and large language models (LLMs) are leading to profound transformations in various domains. This blog explores how these breakthroughs can accelerate progress in precision health. In addition to the keynote I delivered, “Applications and New Fronters of Generative Models for Healthcare<\/a>,” it includes part of a tutorial<\/a> (LS-21) being given at KDD 2023<\/a>. This tutorial surveys the broader research area of \u201cPrecision Health at the Age of Large Language Models,\u201d delivered by Sheng Zhang<\/a>, Javier Gonz\u00e1lez Hern\u00e1ndez<\/a>, Tristan Naumann<\/a>, and myself. <\/p>\n\n\n\n A longstanding objective within precision health is the development of a continuous learning system capable of seamlessly integrating novel information to enhance healthcare delivery and expedite advancements in biomedicine. The National Academy of Medicine has gathered leading experts to explore this key initiative, as documented in its Learning Health System (opens in new tab)<\/span><\/a> series. However, the current state of health systems is far removed from this ideal. The burden of extensive unstructured data and labor-intensive manual processing hinder progress. This is evident, for instance, in the context of cancer treatment, where the traditional standard of care frequently falls short, leaving clinical trials as a last resort. Yet a lack of awareness renders these trials inaccessible, with only 3 percent of US patients finding a suitable trial. This enrollment deficiency contributes to nearly 40 percent of trial failures, as shown in Figure 1. Consequently, the process of drug discovery is exceedingly slow, demanding billions of dollars and a timeline of over a decade.<\/p>\n\n\n\n On an encouraging note, advances in generative AI provide unparalleled opportunities in harnessing real-world observational data to improve patient care\u2014a long-standing goal in the realm of real-world evidence (RWE), which the US Food and Drug Administration (FDA) relies on to monitor and evaluate post-market drug safety (opens in new tab)<\/span><\/a>. Large language models (LLMs) like GPT-4 have the capability of \u201cuniversal structuring,\u201d enabling efficient abstraction of patient information from clinical text at a large scale. This potential can be likened to the transformative impact LLMs are currently making in other domains, such as software development and productivity tools.<\/p>\n\n\n\n<\/figure>\n\n\n\n
\n\t\t