{"id":1039806,"date":"2024-05-29T09:00:00","date_gmt":"2024-05-29T16:00:00","guid":{"rendered":"https:\/\/www.microsoft.com\/en-us\/research\/blog\/research-focus-week-of-may-27-2024\/"},"modified":"2024-07-18T07:53:27","modified_gmt":"2024-07-18T14:53:27","slug":"research-focus-week-of-may-27-2024","status":"publish","type":"post","link":"https:\/\/www.microsoft.com\/en-us\/research\/blog\/research-focus-week-of-may-27-2024\/","title":{"rendered":"Research Focus: Week of May 27, 2024"},"content":{"rendered":"\n

Welcome to Research Focus, a series of blog posts that highlights notable publications, events, code\/datasets, new hires and other milestones from across the research community at Microsoft.<\/em><\/p><\/blockquote><\/figure>\n\n\n\n

\"Research<\/figure>\n\n\n\n

EVENT<\/h3>\n\n\n\n

Register now for Research Forum on June 4<\/h2>\n\n\n\n

Join us for Research Forum (opens in new tab)<\/span><\/a>, an event series that explores recent research advances, bold new ideas, and important discussions with the global research community in the era of general AI. <\/p>\n\n\n\n

In Episode 3, researchers at Microsoft emphasize the importance of globally equitable AI, and will share novel use cases, transformative applications from industry to material design, and provide updates on AutoGen and MatterGen. <\/p>\n\n\n\n

Your registration includes access to our live chat with researchers on the event day.\u00a0<\/p>\n\n\n\n

Episode 3 will air Tuesday, June 4 at 9:00 AM PT.<\/p>\n\n\n\n

\n
Register Now<\/a><\/div>\n<\/div>\n\n\n\n

NEW RESEARCH<\/h3>\n\n\n\n

Generative AI and the Politics of Visibility<\/h2>\n\n\n\n

Generative AI tools have a remarkable capacity to produce complicated and lengthy texts, with just simple direction from users. AI proponents assert they can help writers, providing creative suggestions, completing half-written sentences or story fragments, and inventing character backstories. But this raises questions about the politics of visibility: what kinds of stories do these tools tend to generate, and what do they generally leave out? Do these tools fully represent diverse or marginalized populations and non-normative communities?<\/p>\n\n\n\n

In a recent paper: Generative AI and the Politics of Visibility<\/a>, a researcher from Microsoft tested three widely available generative AI tools (Bing Chat, ChatGPT, and Google\u2019s Bard, now Gemini) with prompts designed to reveal their normative assumptions, prompting the tools multiple times with each to track the diversity of the outputs to the same query. His research demonstrates that, at least as currently designed and trained, generative AI tools tend to reproduce normative identities and narratives, rarely representing less common arrangements and perspectives unless specifically prompted. When they do generate variety, it is often narrow, maintaining deeper normative assumptions in what remains absent.<\/p>\n\n\n\n

\n
Read the paper<\/a><\/div>\n<\/div>\n\n\n\n
\n\n\n\n\t
\n\t\t\n\n\t\t

\n\t\tMicrosoft research podcast<\/span>\n\t<\/p>\n\t\n\t

\n\t\t\t\t\t\t
\n\t\t\t\t\n\t\t\t\t\t\"photo\n\t\t\t\t<\/a>\n\t\t\t<\/div>\n\t\t\t\n\t\t\t
\n\n\t\t\t\t\t\t\t\t\t

What\u2019s Your Story: Lex Story<\/h2>\n\t\t\t\t\n\t\t\t\t\t\t\t\t

Model maker and fabricator Lex Story helps bring research to life through prototyping. He discusses his take on failure; the encouragement and advice that has supported his pursuit of art and science; and the sabbatical that might inspire his next career move.<\/p>\n\t\t\t\t\n\t\t\t\t\t\t\t\t

\n\t\t\t\t\t
\n\t\t\t\t\t\t\n\t\t\t\t\t\t\tListen now\t\t\t\t\t\t<\/a>\n\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t\t\t<\/div>\n\t<\/div>\n\t<\/div>\n\t\n\n\n

NEW RESEARCH<\/h3>\n\n\n\n

ACM MMSys 2024 Bandwidth Estimation in Real Time Communications Challenge<\/h2>\n\n\n\n

Videoconferencing has become indispensable for everything from global business operations to accessible education, transforming the way people communicate across physical barriers and geographical divides. The quality of experience (QoE) delivered by video conferencing systems depends in part on correctly estimating the capacity of the bottleneck link between the sender and the receiver over time. Bandwidth estimation for real-time communications (RTC) remains a significant challenge, primarily due to the continuously evolving heterogeneous network architectures and technologies. From the first bandwidth estimation challenge hosted by Microsoft at ACM MMSys 2021<\/a>, researchers learned that bandwidth estimation models trained with reinforcement learning (RL) in simulations to maximize network-based reward functions may not be optimal, due to the sim-to-real gap and the difficulty of aligning network-based rewards with user-perceived QoE. In this year\u2019s ACM MMSys 2024 Bandwidth Estimation in Real Time Communications Challenge<\/a>, researchers from Microsoft aim to align reward maximization with user-perceived QoE optimization using offline RL and a real-world dataset released by Microsoft Teams. The challenge received enthusiastic participation from both academia and industry. All models submitted to the grand challenge underwent initial evaluation, and top models were further evaluated on a geographically distributed testbed. Challenge results show that by leveraging real-world data and integrating objective audio\/video quality scores as rewards, offline RL can facilitate the development of competitive bandwidth estimators for RTC.<\/p>\n\n\n\n

\n
Read the paper<\/a><\/div>\n<\/div>\n\n\n\n
\n\n\n\n

NEW RESEARCH<\/h3>\n\n\n\n

Player-Driven Emergence in LLM-Driven Game Narrative<\/h2>\n\n\n\n

Game creation is a labor-intensive process, with limited automation of non-graphic game elements related to dialogue and narrative structure. These elements are typically hand-coded and rigidly deterministic, with few options presented to the player. Large language models (LLMs) are beginning to show potential in the creation of richer and more expansive narrative spaces. <\/p>\n\n\n\n

In a recent paper: Player-Driven Emergence in LLM-Driven Game Narrative<\/a>, accepted for presentation at the IEEE Conference on Games 2024, researchers from Microsoft in collaboration with members of the Xbox organization explore how interaction with LLMs can empower players to participate in the evolution of game narratives. As a testbed, they created a text-adventure game in which players attempt to solve a mystery under a fixed narrative premise but can freely interact with non-player characters generated by GPT-4, a state-of-the-art LLM. They recruited 28 gamers to play the game and used GPT-4 to automatically convert the game logs into a node-graph representing the narrative in the player\u2019s gameplay. Through their interactions with the non-deterministic behavior of the LLM, players were able to discover interesting new emergent nodes that were not a part of the original narrative but have potential for being fun and engaging. Players that created the most emergent nodes tended to be those that often enjoy games that facilitate discovery, exploration and experimentation.<\/p>\n\n\n\n

\n
Read the paper<\/a><\/div>\n<\/div>\n\n\n\n
\n\n\n\n

NEW RESEARCH<\/h3>\n\n\n\n

Segmentation using large language models: A new typology of American neighborhoods<\/h2>\n\n\n\n

The U.S. Census Bureau\u2019s American Community Survey (ACS) is the country\u2019s primary source of social and economic data. But much of the data is low quality, especially at the highest levels of geographic detail (Block Groups). As one zooms in geographically on a map, the resolution of social and economic data decreases, which is counterintuitive. Typically, zooming in generates more detail, not less. Recent changes in the U.S. statistical system have amplified this geographic-demographic resolution trade-off.<\/p>\n\n\n\n

In a recent paper: Segmentation using large language models: A new typology of American neighborhoods<\/a>, researchers from Microsoft present a solution to this problem in the form of an AI-based open and reproducible geodemographic classification system using small area estimates from the ACS. They employ a partitioning clustering algorithm to a range of socio-economic, demographic, and built environment variables. Using an open-source software pipeline ensures adaptability to future data updates. One key innovation is the integration of GPT-4, to generate intuitive cluster descriptions and names. This represents a novel application of natural language processing in geodemographic research and showcases the potential for human-AI collaboration within the geospatial domain.<\/p>\n\n\n\n

\n
Read the paper<\/a><\/div>\n<\/div>\n\n\n\n
\n\n\n\n

NEW RESEARCH<\/h3>\n\n\n\n

From Local to Global: A Graph RAG Approach to Query-Focused Summarization <\/h2>\n\n\n\n

The use of retrieval-augmented generation (RAG) to retrieve relevant information from an external knowledge source enables LLMs to answer questions over private and\/or previously unseen document collections. However, RAG fails on global questions directed at an entire text corpus, such as: \u201cWhat are the main themes in the dataset?\u201d, since this is inherently a query-focused summarization (QFS) task, rather than an explicit retrieval task. Prior QFS methods fail to scale to the quantities of text indexed by typical RAG systems.<\/p>\n\n\n\n

In a recent preprint: From Local to Global: A Graph RAG Approach to Query-Focused Summarization<\/a>, researchers from Microsoft propose combining the strengths of these contrasting methods through a Graph RAG approach to question answering over private text corpora that scales with both the generality of user questions and the quantity of source text to be indexed. This approach uses an LLM to build a graph-based text index in two stages: first to derive an entity knowledge graph from the source documents, then to pre-generate community summaries for all groups of closely-related entities. Given a question, each community summary is used to generate a partial response, before all partial responses are again summarized in a final response to the user. For a class of global sensemaking questions over datasets in the 1 million token range, Graph RAG leads to substantial improvements over a na\u00efve RAG baseline for both the comprehensiveness and diversity of generated answers.<\/p>\n\n\n\n

\n
Read the paper<\/a><\/div>\n<\/div>\n\n\n\n
\n\t\n\t
\n\t\t
\n\t\t\t
\n\t
\n\n\t\t\n\t\t
\n\t\t\t\t\t\t\n\t\t\t\t\t<\/div>\n\t<\/div>\n<\/div>\t\t<\/div>\n\t<\/div>\n\n\t<\/div>\n","protected":false},"excerpt":{"rendered":"

How can generative AI tools represent less common identities and narratives; Can LLMs help players participate in game narratives; Using LLMs to improve geospatial demographic data; A Graph RAG Approach to Query-Focused Summarization; and more.<\/p>\n","protected":false},"author":42735,"featured_media":1039827,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"msr-url-field":"","msr-podcast-episode":"","msrModifiedDate":"","msrModifiedDateEnabled":false,"ep_exclude_from_search":false,"footnotes":""},"categories":[1],"tags":[],"research-area":[13556,13545,13559,13547],"msr-region":[],"msr-event-type":[],"msr-locale":[268875],"msr-post-option":[243984],"msr-impact-theme":[],"msr-promo-type":[],"msr-podcast-series":[],"class_list":["post-1039806","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-research-blog","msr-research-area-artificial-intelligence","msr-research-area-human-language-technologies","msr-research-area-social-sciences","msr-research-area-systems-and-networking","msr-locale-en_us","msr-post-option-blog-homepage-featured"],"msr_event_details":{"start":"","end":"","location":""},"podcast_url":"","podcast_episode":"","msr_research_lab":[199563,199565],"msr_impact_theme":[],"related-publications":[],"related-downloads":[],"related-videos":[],"related-academic-programs":[],"related-groups":[144736,332906,715138,901101],"related-projects":[1027041,170498],"related-events":[],"related-researchers":[{"type":"user_nicename","value":"Tarleton Gillespie","user_id":33877,"display_name":"Tarleton Gillespie","author_link":"Tarleton Gillespie<\/a>","is_active":false,"last_first":"Gillespie, Tarleton","people_section":0,"alias":"tarleton"},{"type":"user_nicename","value":"Sami Khairy","user_id":42906,"display_name":"Sami Khairy","author_link":"Sami Khairy<\/a>","is_active":false,"last_first":"Khairy, Sami","people_section":0,"alias":"samikhairy"},{"type":"user_nicename","value":"Vishak Gopal","user_id":39624,"display_name":"Vishak Gopal","author_link":"Vishak Gopal<\/a>","is_active":false,"last_first":"Gopal, Vishak","people_section":0,"alias":"vigopal"},{"type":"user_nicename","value":"Francis Y. Yan","user_id":39558,"display_name":"Francis Y. Yan","author_link":"Francis Y. Yan<\/a>","is_active":false,"last_first":"Yan, Francis Y.","people_section":0,"alias":"francisy"},{"type":"user_nicename","value":"Zhixiong Niu","user_id":38118,"display_name":"Zhixiong Niu","author_link":"Zhixiong Niu<\/a>","is_active":false,"last_first":"Niu, Zhixiong","people_section":0,"alias":"zhniu"},{"type":"user_nicename","value":"Ross Cutler","user_id":40660,"display_name":"Ross Cutler","author_link":"Ross Cutler<\/a>","is_active":false,"last_first":"Cutler, Ross","people_section":0,"alias":"rcutler"},{"type":"user_nicename","value":"Sudha Rao","user_id":38341,"display_name":"Sudha Rao","author_link":"Sudha Rao<\/a>","is_active":false,"last_first":"Rao, Sudha","people_section":0,"alias":"sudhra"},{"type":"user_nicename","value":"Weijia Xu","user_id":42636,"display_name":"Weijia Xu","author_link":"Weijia Xu<\/a>","is_active":false,"last_first":"Xu, Weijia","people_section":0,"alias":"weijiaxu"},{"type":"user_nicename","value":"Chris Brockett","user_id":31423,"display_name":"Chris Brockett","author_link":"Chris Brockett<\/a>","is_active":false,"last_first":"Brockett, Chris","people_section":0,"alias":"chrisbkt"},{"type":"user_nicename","value":"Nebojsa Jojic","user_id":32384,"display_name":"Nebojsa Jojic","author_link":"Nebojsa Jojic<\/a>","is_active":false,"last_first":"Jojic, Nebojsa","people_section":0,"alias":"jojic"},{"type":"user_nicename","value":"Michael Xu","user_id":41979,"display_name":"Michael Xu","author_link":"Michael Xu<\/a>","is_active":false,"last_first":"Xu, Michael","people_section":0,"alias":"michaelxu"},{"type":"user_nicename","value":"JORGE LEANDRO","user_id":42261,"display_name":"Jorge Leandro","author_link":"Jorge Leandro<\/a>","is_active":false,"last_first":"Leandro, Jorge","people_section":0,"alias":"jorgeleandro"},{"type":"user_nicename","value":"Bill Dolan","user_id":31229,"display_name":"Bill Dolan","author_link":"Bill Dolan<\/a>","is_active":false,"last_first":"Dolan, Bill","people_section":0,"alias":"billdol"},{"type":"user_nicename","value":"Seth Spielman","user_id":43314,"display_name":"Seth Spielman","author_link":"Seth Spielman<\/a>","is_active":false,"last_first":"Spielman, Seth","people_section":0,"alias":"sethspielman"},{"type":"user_nicename","value":"Darren Edge","user_id":31509,"display_name":"Darren Edge","author_link":"Darren Edge<\/a>","is_active":false,"last_first":"Edge, Darren","people_section":0,"alias":"daedge"},{"type":"user_nicename","value":"Ha Trinh","user_id":43245,"display_name":"Ha Trinh","author_link":"Ha Trinh<\/a>","is_active":false,"last_first":"Trinh, Ha","people_section":0,"alias":"trinhha"},{"type":"user_nicename","value":"Joshua Bradley","user_id":43272,"display_name":"Joshua Bradley","author_link":"Joshua Bradley<\/a>","is_active":false,"last_first":"Bradley, Joshua","people_section":0,"alias":"joshbradley"},{"type":"user_nicename","value":"Alex Chao","user_id":43254,"display_name":"Alex Chao","author_link":"Alex Chao<\/a>","is_active":false,"last_first":"Chao, Alex","people_section":0,"alias":"achao"},{"type":"user_nicename","value":"Apurva Mody","user_id":43257,"display_name":"Apurva Mody","author_link":"Apurva Mody<\/a>","is_active":false,"last_first":"Mody, Apurva","people_section":0,"alias":"moapurva"},{"type":"user_nicename","value":"Steven Truitt","user_id":43143,"display_name":"Steven Truitt","author_link":"Steven Truitt<\/a>","is_active":false,"last_first":"Truitt, Steven","people_section":0,"alias":"steventruitt"},{"type":"user_nicename","value":"Jonathan Larson","user_id":32385,"display_name":"Jonathan Larson","author_link":"Jonathan Larson<\/a>","is_active":false,"last_first":"Larson, Jonathan","people_section":0,"alias":"jolarso"}],"msr_type":"Post","featured_image_thumbnail":"\"Research","byline":"","formattedDate":"May 29, 2024","formattedExcerpt":"How can generative AI tools represent less common identities and narratives; Can LLMs help players participate in game narratives; Using LLMs to improve geospatial demographic data; A Graph RAG Approach to Query-Focused Summarization; and more.","locale":{"slug":"en_us","name":"English","native":"","english":"English"},"_links":{"self":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/posts\/1039806"}],"collection":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/users\/42735"}],"replies":[{"embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/comments?post=1039806"}],"version-history":[{"count":19,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/posts\/1039806\/revisions"}],"predecessor-version":[{"id":1058262,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/posts\/1039806\/revisions\/1058262"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/media\/1039827"}],"wp:attachment":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/media?parent=1039806"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/categories?post=1039806"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/tags?post=1039806"},{"taxonomy":"msr-research-area","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/research-area?post=1039806"},{"taxonomy":"msr-region","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-region?post=1039806"},{"taxonomy":"msr-event-type","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-event-type?post=1039806"},{"taxonomy":"msr-locale","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-locale?post=1039806"},{"taxonomy":"msr-post-option","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-post-option?post=1039806"},{"taxonomy":"msr-impact-theme","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-impact-theme?post=1039806"},{"taxonomy":"msr-promo-type","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-promo-type?post=1039806"},{"taxonomy":"msr-podcast-series","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-podcast-series?post=1039806"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}