{"id":1133598,"date":"2025-03-10T09:00:00","date_gmt":"2025-03-10T16:00:00","guid":{"rendered":"https:\/\/www.microsoft.com\/en-us\/research\/?p=1133598"},"modified":"2025-06-25T09:35:24","modified_gmt":"2025-06-25T16:35:24","slug":"semantic-telemetry-understanding-how-users-interact-with-ai-systems","status":"publish","type":"post","link":"https:\/\/www.microsoft.com\/en-us\/research\/blog\/semantic-telemetry-understanding-how-users-interact-with-ai-systems\/","title":{"rendered":"Semantic Telemetry: Understanding how users interact with AI systems"},"content":{"rendered":"\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"576\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2025\/03\/Semantic-Telemetry-BlogHeroFeature-1400x788-1-1024x576.jpg\" alt=\"Semantic Telemetry blog | diagram showing relationships between chat, LLM prompt, and labeled data\" class=\"wp-image-1133588\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2025\/03\/Semantic-Telemetry-BlogHeroFeature-1400x788-1-1024x576.jpg 1024w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2025\/03\/Semantic-Telemetry-BlogHeroFeature-1400x788-1-300x169.jpg 300w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2025\/03\/Semantic-Telemetry-BlogHeroFeature-1400x788-1-768x432.jpg 768w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2025\/03\/Semantic-Telemetry-BlogHeroFeature-1400x788-1-1066x600.jpg 1066w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2025\/03\/Semantic-Telemetry-BlogHeroFeature-1400x788-1-655x368.jpg 655w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2025\/03\/Semantic-Telemetry-BlogHeroFeature-1400x788-1-240x135.jpg 240w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2025\/03\/Semantic-Telemetry-BlogHeroFeature-1400x788-1-640x360.jpg 640w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2025\/03\/Semantic-Telemetry-BlogHeroFeature-1400x788-1-960x540.jpg 960w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2025\/03\/Semantic-Telemetry-BlogHeroFeature-1400x788-1-1280x720.jpg 1280w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2025\/03\/Semantic-Telemetry-BlogHeroFeature-1400x788-1.jpg 1400w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/figure>\n\n\n\n<p>AI tools are proving useful across a range of applications, from helping to drive the new era of business transformation to helping artists craft songs. But which applications are providing the most value to users? We\u2019ll dig into that question in a series of blog posts that introduce the <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/project\/semantic-telemetry\/\">Semantic Telemetry<\/a> project at Microsoft Research. In this initial post, we will introduce a new data science approach that we will use to analyze topics and task complexity of Copilot in Bing usage.<\/p>\n\n\n\n<p>Human-AI interactions can be iterative and complex, requiring a new data science approach to understand user behavior to build and support increasingly high value use cases. Imagine the following chat:<\/p>\n\n\n\n<figure class=\"wp-block-image aligncenter size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"408\" height=\"232\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2025\/03\/Semantic-Telemetry_figure0_example_chat.png\" alt=\"Example chat between user and AI\" class=\"wp-image-1133589\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2025\/03\/Semantic-Telemetry_figure0_example_chat.png 408w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2025\/03\/Semantic-Telemetry_figure0_example_chat-300x171.png 300w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2025\/03\/Semantic-Telemetry_figure0_example_chat-240x136.png 240w\" sizes=\"auto, (max-width: 408px) 100vw, 408px\" \/><\/figure>\n\n\n\n<p>Here we see that chats can be complex and span multiple topics, such as event planning, team building, and logistics. Generative AI has ushered in a two-fold paradigm shift. First, LLMs give us a new thing to measure, that is, how people interact with AI systems. Second, they give us a new way to measure those interactions, that is, they give us the capability to understand and make inferences on these interactions, at scale. The Semantic Telemetry project has created new measures to classify human-AI interactions and understand user behavior, contributing to efforts in developing new approaches for <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/news.microsoft.com\/source\/features\/ai\/measurement-is-the-key-to-helping-keep-ai-on-track\/\" target=\"_blank\" rel=\"noopener noreferrer\">measuring generative AI<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> across various use cases.<\/p>\n\n\n\n<p>Semantic Telemetry is a rethink of traditional telemetry&#8211;in which data is collected for understanding systems&#8211;designed for analyzing chat-based AI. We employ an innovative data science methodology that uses a large language model (LLM) to generate meaningful categorical labels, enabling us to gain insights into chat log data.<\/p>\n\n\n\n<figure class=\"wp-block-image aligncenter size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"650\" height=\"280\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2025\/03\/Semantic-Telemetry_figure1_llm_clasifier.png\" alt=\"Flow chart illustrating the LLM classification process starting with chat input, then prompting LLM with chat using generated label taxonomy, and output is the labeled chat.\" class=\"wp-image-1133590\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2025\/03\/Semantic-Telemetry_figure1_llm_clasifier.png 650w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2025\/03\/Semantic-Telemetry_figure1_llm_clasifier-300x129.png 300w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2025\/03\/Semantic-Telemetry_figure1_llm_clasifier-240x103.png 240w\" sizes=\"auto, (max-width: 650px) 100vw, 650px\" \/><figcaption class=\"wp-element-caption\">Figure 1: Prompting an LLM to classify a conversation based on LLM generated label taxonomy<\/figcaption><\/figure>\n\n\n\n<p>This process begins with developing a set of classifications and definitions. We create these classifications by instructing an LLM to generate a short summary of the conversation, and then iteratively prompting the LLM to generate, update, and review classification labels on a batched set of summaries. This process is outlined in the paper: <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/publication\/tnt-llm-text-mining-at-scale-with-large-language-models\/\">TnT-LLM: Text Mining at Scale with Large Language Models<\/a>. We then prompt an LLM with these generated classifiers to label new unstructured (and unlabeled) chat log data.<\/p>\n\n\n\n<figure class=\"wp-block-embed aligncenter is-type-video is-provider-youtube wp-block-embed-youtube wp-embed-aspect-16-9 wp-has-aspect-ratio\"><div class=\"wp-block-embed__wrapper\">\n<div class=\"yt-consent-placeholder\" role=\"region\" aria-label=\"Video playback requires cookie consent\" data-video-id=\"9O2UaMCtj5c\" data-poster=\"https:\/\/img.youtube.com\/vi\/9O2UaMCtj5c\/maxresdefault.jpg\"><iframe aria-hidden=\"true\" tabindex=\"-1\" title=\"Semantic Telemetry - taxonomy generation\" width=\"500\" height=\"281\" data-src=\"https:\/\/www.youtube-nocookie.com\/embed\/9O2UaMCtj5c?feature=oembed&rel=0&enablejsapi=1\" frameborder=\"0\" allow=\"accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share\" referrerpolicy=\"strict-origin-when-cross-origin\" allowfullscreen><\/iframe><div class=\"yt-consent-placeholder__overlay\"><button class=\"yt-consent-placeholder__play\"><svg width=\"42\" height=\"42\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\" aria-hidden=\"true\" focusable=\"false\"><g fill=\"none\" fill-rule=\"evenodd\"><circle fill=\"#000\" opacity=\".556\" cx=\"21\" cy=\"21\" r=\"21\"\/><path stroke=\"#FFF\" d=\"M27.5 22l-12 8.5v-17z\"\/><\/g><\/svg><span class=\"yt-consent-placeholder__label\">Video playback requires cookie consent<\/span><\/button><\/div><\/div>\n<\/div><figcaption class=\"wp-element-caption\">Description of LLM generated label taxonomy process<\/figcaption><\/figure>\n\n\n\n<p>With this approach, we have analyzed how people interact with Copilot in Bing. In this blog, we examine insights into how people are using Copilot in Bing, including how that differs from traditional search engines. Note that all analyses were conducted on anonymous Copilot interactions containing no personal information.<\/p>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"topics\">Topics<\/h2>\n\n\n\n<p>To get a clear picture of how people are using Copilot in Bing, we need to first classify sessions into topical categories. To do this, we developed a topic <strong>classifier<\/strong>. We used the LLM classification approach described above to label the primary topic (domain) for the entire content of the chat. Although a single chat can cover multiple topics, for this analysis, we generated a single label for the primary topic of the conversation. We sampled five million anonymized Copilot in Bing chats during August and September 2024, and found that globally, 21% of all chats were about <em>technology<\/em>, with a high concentration of these chats in <em>programming and scripting and computers and electronics<\/em>.<\/p>\n\n\n\n<figure class=\"wp-block-image aligncenter size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"564\" height=\"467\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2025\/03\/Semantic-Telemetry_figure2_topic_bubble_chart.png\" alt=\"Bubble chart showing topics based on percentage of sample. Primary topics shown are Technology (21%), Entertainment (12.8%), Health (11%), Language, Writing, & Editing (11.6%), Lifestyle (9.2%), Money (8.5%), History, Events, & Law (8.5%), Career (7.8%), Science (6.3%)\" class=\"wp-image-1133591\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2025\/03\/Semantic-Telemetry_figure2_topic_bubble_chart.png 564w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2025\/03\/Semantic-Telemetry_figure2_topic_bubble_chart-300x248.png 300w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2025\/03\/Semantic-Telemetry_figure2_topic_bubble_chart-217x180.png 217w\" sizes=\"auto, (max-width: 564px) 100vw, 564px\" \/><figcaption class=\"wp-element-caption\">Figure 2: Top Copilot in Bing topics based on anonymized data (August-September 2024)<\/figcaption><\/figure>\n\n\n\n<figure class=\"wp-block-image aligncenter size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"600\" height=\"346\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2025\/03\/Semantic-Telemetry_figure3_technology_topic_bubble_chart.png\" alt=\"Bubble chart of Technology topic showing subtopics: Programming & scripting, Computers & electronics, Engineering & design, Data analysis, and ML & AI.\" class=\"wp-image-1133592\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2025\/03\/Semantic-Telemetry_figure3_technology_topic_bubble_chart.png 600w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2025\/03\/Semantic-Telemetry_figure3_technology_topic_bubble_chart-300x173.png 300w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2025\/03\/Semantic-Telemetry_figure3_technology_topic_bubble_chart-240x138.png 240w\" sizes=\"auto, (max-width: 600px) 100vw, 600px\" \/><figcaption class=\"wp-element-caption\">Figure 3: Frequent topic summaries in Technology<\/figcaption><\/figure>\n\n\n\n<figure class=\"wp-block-image aligncenter size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"600\" height=\"430\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2025\/03\/Semantic-Telemetry_figure4_entertainment_topic_bubble_chart.png\" alt=\"Bubble chart of Entertainment showing subtopics: Entertainment, Sports & fitness, Travel & tourism, Small talk & chatbot, and Gaming\" class=\"wp-image-1133593\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2025\/03\/Semantic-Telemetry_figure4_entertainment_topic_bubble_chart.png 600w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2025\/03\/Semantic-Telemetry_figure4_entertainment_topic_bubble_chart-300x215.png 300w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2025\/03\/Semantic-Telemetry_figure4_entertainment_topic_bubble_chart-240x172.png 240w\" sizes=\"auto, (max-width: 600px) 100vw, 600px\" \/><figcaption class=\"wp-element-caption\">Figure 4: Frequent topic summaries in Entertainment<\/figcaption><\/figure>\n\n\n\n<p>Diving into the technology category, we find a lot of professional tasks in <em>programming and scripting<\/em>, where users request problem-specific assistance such as fixing a SQL query syntax error. In <em>computers and electronics<\/em>, we observe users getting help with tasks like adjusting screen brightness and troubleshooting internet connectivity issues. We can compare this with our second most common topic, <em>entertainment<\/em>, in which we see users seeking information related to personal activities like hiking and game nights.<\/p>\n\n\n\n<p>We also note that top topics differ by platform. The figure below depicts topic popularity based on mobile and desktop usage. Mobile device users tend to use the chat for more personal-related tasks such as helping to plant a garden or understanding medical symptoms whereas desktop users conduct more professional tasks like revising an email.<\/p>\n\n\n\n<figure class=\"wp-block-image aligncenter size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"600\" height=\"501\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2025\/03\/Semantic-Telemetry_figure5_sankey_platforms.png\" alt=\"Sankey visual showing top topics for Desktop and Mobile users\" class=\"wp-image-1133594\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2025\/03\/Semantic-Telemetry_figure5_sankey_platforms.png 600w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2025\/03\/Semantic-Telemetry_figure5_sankey_platforms-300x251.png 300w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2025\/03\/Semantic-Telemetry_figure5_sankey_platforms-216x180.png 216w\" sizes=\"auto, (max-width: 600px) 100vw, 600px\" \/><figcaption class=\"wp-element-caption\">Figure 5: Top topics for desktop users and mobile users<\/figcaption><\/figure>\n\n\n\n\t<div class=\"border-bottom border-top border-gray-300 mt-5 mb-5 msr-promo text-center text-md-left alignwide\" data-bi-aN=\"promo\" data-bi-id=\"999693\">\n\t\t\n\n\t\t<p class=\"msr-promo__label text-gray-800 text-center text-uppercase\">\n\t\t<span class=\"px-4 bg-white display-inline-block font-weight-semibold small\">Spotlight: Event Series<\/span>\n\t<\/p>\n\t\n\t<div class=\"row pt-3 pb-4 align-items-center\">\n\t\t\t\t\t\t<div class=\"msr-promo__media col-12 col-md-5\">\n\t\t\t\t<a class=\"bg-gray-300 display-block\" href=\"https:\/\/www.microsoft.com\/en-us\/research\/event\/microsoft-research-forum\/past-episodes\/?OCID=msr_researchforum_MCR_Blog_Promo\" aria-label=\"Microsoft Research Forum\" data-bi-cN=\"Microsoft Research Forum\" target=\"_blank\">\n\t\t\t\t\t<img decoding=\"async\" class=\"w-100 display-block\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2025\/05\/Research-Forum-hero_1400x788.jpg\" alt=\"Research Forum | abstract background with colorful hexagons\" \/>\n\t\t\t\t<\/a>\n\t\t\t<\/div>\n\t\t\t\n\t\t\t<div class=\"msr-promo__content p-3 px-5 col-12 col-md\">\n\n\t\t\t\t\t\t\t\t\t<h2 class=\"h4\">Microsoft Research Forum<\/h2>\n\t\t\t\t\n\t\t\t\t\t\t\t\t<p id=\"microsoft-research-forum\" class=\"large\">Join us for a continuous exchange of ideas about research in the era of general AI. Watch the latest episodes on demand.<\/p>\n\t\t\t\t\n\t\t\t\t\t\t\t\t<div class=\"wp-block-buttons justify-content-center justify-content-md-start\">\n\t\t\t\t\t<div class=\"wp-block-button\">\n\t\t\t\t\t\t<a href=\"https:\/\/www.microsoft.com\/en-us\/research\/event\/microsoft-research-forum\/past-episodes\/?OCID=msr_researchforum_MCR_Blog_Promo\" aria-describedby=\"microsoft-research-forum\" class=\"btn btn-brand glyph-append glyph-append-chevron-right\" data-bi-cN=\"Microsoft Research Forum\" target=\"_blank\">\n\t\t\t\t\t\t\tWatch on-demand\t\t\t\t\t\t<\/a>\n\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t\t\t<\/div><!--\/.msr-promo__content-->\n\t<\/div><!--\/.msr-promo__inner-wrap-->\n\t<\/div><!--\/.msr-promo-->\n\t\n\n\n<h2 class=\"wp-block-heading\" id=\"search-versus-copilot\">Search versus Copilot<\/h2>\n\n\n\n<p>Beyond analyzing topics, we compared Copilot in Bing usage to that of traditional search. Chat extends beyond traditional online search by enabling users to summarize, generate, compare, and analyze information. Human-AI interactions are conversational and more complex than traditional search (Figure 6).<\/p>\n\n\n\n<figure class=\"wp-block-image aligncenter size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"3535\" height=\"2069\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2025\/03\/figure6_venn_diagram_bing_vs_copilot_v2.png\" alt=\"Venn diagram showing differences between Bing Search and Copilot in Bing, with intersection in information lookup.\" class=\"wp-image-1133922\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2025\/03\/figure6_venn_diagram_bing_vs_copilot_v2.png 3535w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2025\/03\/figure6_venn_diagram_bing_vs_copilot_v2-300x176.png 300w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2025\/03\/figure6_venn_diagram_bing_vs_copilot_v2-1024x599.png 1024w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2025\/03\/figure6_venn_diagram_bing_vs_copilot_v2-768x450.png 768w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2025\/03\/figure6_venn_diagram_bing_vs_copilot_v2-1536x899.png 1536w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2025\/03\/figure6_venn_diagram_bing_vs_copilot_v2-2048x1199.png 2048w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2025\/03\/figure6_venn_diagram_bing_vs_copilot_v2-480x280.png 480w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2025\/03\/figure6_venn_diagram_bing_vs_copilot_v2-240x140.png 240w\" sizes=\"auto, (max-width: 3535px) 100vw, 3535px\" \/><figcaption class=\"wp-element-caption\">Figure 6: Bing Search Query compared to Copilot in Bing Conversation<\/figcaption><\/figure>\n\n\n\n<p>A major differentiation between search and chat is the ability to ask more complex questions, but how can we measure this? We think of complexity as a scale ranging from simply asking chat to look up information to evaluating several ideas. We aim to understand the difficulty of a task if performed by a human without the assistance of AI. To achieve this, we developed the <strong>task complexity classifier<\/strong>, which assesses task difficulty using <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/www.quincycollege.edu\/wp-content\/uploads\/Anderson-and-Krathwohl_Revised-Blooms-Taxonomy.pdf\" target=\"_blank\" rel=\"noopener noreferrer\">Anderson and Krathwohl\u2019s Taxonomy of Learning Objectives<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>. For our analysis, we have grouped the learning objectives into two categories: <em>low complexity<\/em> and <em>high complexity<\/em>. Any task more complicated than information lookup is classified as <em>high complexity<\/em>. Note that this would be very challenging to classify using traditional data science techniques.<\/p>\n\n\n\n<figure class=\"wp-block-embed aligncenter is-type-video is-provider-youtube wp-block-embed-youtube wp-embed-aspect-16-9 wp-has-aspect-ratio\"><div class=\"wp-block-embed__wrapper\">\n<div class=\"yt-consent-placeholder\" role=\"region\" aria-label=\"Video playback requires cookie consent\" data-video-id=\"T-Rt46aozu4\" data-poster=\"https:\/\/img.youtube.com\/vi\/T-Rt46aozu4\/maxresdefault.jpg\"><iframe aria-hidden=\"true\" tabindex=\"-1\" title=\"Semantic Telemetry - task complexity classifier\" width=\"500\" height=\"281\" data-src=\"https:\/\/www.youtube-nocookie.com\/embed\/T-Rt46aozu4?feature=oembed&rel=0&enablejsapi=1\" frameborder=\"0\" allow=\"accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share\" referrerpolicy=\"strict-origin-when-cross-origin\" allowfullscreen><\/iframe><div class=\"yt-consent-placeholder__overlay\"><button class=\"yt-consent-placeholder__play\"><svg width=\"42\" height=\"42\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\" aria-hidden=\"true\" focusable=\"false\"><g fill=\"none\" fill-rule=\"evenodd\"><circle fill=\"#000\" opacity=\".556\" cx=\"21\" cy=\"21\" r=\"21\"\/><path stroke=\"#FFF\" d=\"M27.5 22l-12 8.5v-17z\"\/><\/g><\/svg><span class=\"yt-consent-placeholder__label\">Video playback requires cookie consent<\/span><\/button><\/div><\/div>\n<\/div><figcaption class=\"wp-element-caption\">Description of task complexity and 6 categories of the Anderson and Krathwohl&#8217;s Taxonomy of Learning Objectives<\/figcaption><\/figure>\n\n\n\n<p>Comparing <em>low<\/em> versus <em>high complexity<\/em> tasks, most chat interactions were categorized as <em>high complexity<\/em> (78.9%), meaning that they were more complex than looking up information. <em>Programming and scripting, marketing and sales, and creative and professional writing<\/em> are topics in which users engage in higher complexity tasks (Figure 7) such as learning a skill, troubleshooting a problem, or writing an article.<\/p>\n\n\n\n<figure class=\"wp-block-image aligncenter size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"600\" height=\"356\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2025\/03\/Semantic-Telemetry_figure7_high_complexity_tasks.png\" alt=\"Highest and lowest complexity topics based on percent of high complexity chats\" class=\"wp-image-1133596\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2025\/03\/Semantic-Telemetry_figure7_high_complexity_tasks.png 600w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2025\/03\/Semantic-Telemetry_figure7_high_complexity_tasks-300x178.png 300w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2025\/03\/Semantic-Telemetry_figure7_high_complexity_tasks-240x142.png 240w\" sizes=\"auto, (max-width: 600px) 100vw, 600px\" \/><figcaption class=\"wp-element-caption\">Figure 7: Most and least complex topics based on percentage of high complexity tasks.<\/figcaption><\/figure>\n\n\n\n<p><em>Travel and tourism and history and culture <\/em>scored lowest in complexity, with users looking up information like flight times and latest news updates.<\/p>\n\n\n\n<figure class=\"wp-block-embed aligncenter is-type-video is-provider-youtube wp-block-embed-youtube wp-embed-aspect-16-9 wp-has-aspect-ratio\"><div class=\"wp-block-embed__wrapper\">\n<div class=\"yt-consent-placeholder\" role=\"region\" aria-label=\"Video playback requires cookie consent\" data-video-id=\"7ucGpDLbv-U\" data-poster=\"https:\/\/img.youtube.com\/vi\/7ucGpDLbv-U\/maxresdefault.jpg\"><iframe aria-hidden=\"true\" tabindex=\"-1\" title=\"Semantic Telemetry - Task Complexity Dashboard\" width=\"500\" height=\"281\" data-src=\"https:\/\/www.youtube-nocookie.com\/embed\/7ucGpDLbv-U?feature=oembed&rel=0&enablejsapi=1\" frameborder=\"0\" allow=\"accelerometer; autoplay; clipboard-write; encrypted-media; gyroscope; picture-in-picture; web-share\" referrerpolicy=\"strict-origin-when-cross-origin\" allowfullscreen><\/iframe><div class=\"yt-consent-placeholder__overlay\"><button class=\"yt-consent-placeholder__play\"><svg width=\"42\" height=\"42\" xmlns=\"http:\/\/www.w3.org\/2000\/svg\" aria-hidden=\"true\" focusable=\"false\"><g fill=\"none\" fill-rule=\"evenodd\"><circle fill=\"#000\" opacity=\".556\" cx=\"21\" cy=\"21\" r=\"21\"\/><path stroke=\"#FFF\" d=\"M27.5 22l-12 8.5v-17z\"\/><\/g><\/svg><span class=\"yt-consent-placeholder__label\">Video playback requires cookie consent<\/span><\/button><\/div><\/div>\n<\/div><figcaption class=\"wp-element-caption\">Demo of task complexity and topics on anonymous Copilot interactions<\/figcaption><\/figure>\n\n\n\n<p>When should you use chat instead of search? A 2024 Microsoft Research study: <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/publication\/the-use-of-generative-search-engines-for-knowledge-work-and-complex-tasks\/\">The Use of Generative Search Engines for Knowledge Work and Complex Tasks<\/a>, suggests that people are seeing value in technical, complex tasks such as web development and data analysis. Bing Search contained more queries with lower complexity focused on non-professional areas, like <em>gaming and entertainment<\/em>, <em>travel and tourism<\/em>, and <em>fashion and beauty<\/em>, while chat had a greater distribution of complex technical tasks. (Figure 8).<\/p>\n\n\n\n<figure class=\"wp-block-image aligncenter size-full\"><img loading=\"lazy\" decoding=\"async\" width=\"2622\" height=\"1628\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2025\/03\/figure8_bing_vs_copilot_v2.png\" alt=\"Comparison of Bing Search and Copilot in Bing topics based on complexity and knowledge work. Copilot in Bing trends greater complexity and greater knowledge work than Bing Search.\" class=\"wp-image-1133909\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2025\/03\/figure8_bing_vs_copilot_v2.png 2622w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2025\/03\/figure8_bing_vs_copilot_v2-300x186.png 300w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2025\/03\/figure8_bing_vs_copilot_v2-1024x636.png 1024w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2025\/03\/figure8_bing_vs_copilot_v2-768x477.png 768w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2025\/03\/figure8_bing_vs_copilot_v2-1536x954.png 1536w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2025\/03\/figure8_bing_vs_copilot_v2-2048x1272.png 2048w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2025\/03\/figure8_bing_vs_copilot_v2-240x149.png 240w\" sizes=\"auto, (max-width: 2622px) 100vw, 2622px\" \/><figcaption class=\"wp-element-caption\">Figure 8: Comparison of Bing Search and Copilot in Bing for anonymized sample data (May-June 2023)<\/figcaption><\/figure>\n\n\n\n<h2 class=\"wp-block-heading\" id=\"conclusion\">Conclusion<\/h2>\n\n\n\n<p>LLMs have enabled a new era of high-quality human-AI interaction, and with it, the capability to analyze those same interactions with high fidelity, at scale, and in near real-time. We are now able to obtain actionable insight from complex data that is not possible with traditional data science pattern-matching methods. LLM-generated classifications are pushing research into new directions that will ultimately improve user experience and satisfaction when using chat and other user-AI interaction tools.<\/p>\n\n\n\n<p>This analysis indicates that Copilot in Bing is enabling users to do more complex work, specifically in areas such as technology. In our next post, we will explore how Copilot in Bing is supporting professional knowledge work and how we can use these measures as indicators for retention and engagement.<\/p>\n\n\n\n<hr class=\"wp-block-separator has-alpha-channel-opacity\"\/>\n\n\n\n<p>FOOTNOTE: This research was conducted at the time the feature Copilot in Bing was available as part of the Bing service; since October 2024 Copilot in Bing has been deprecated in favor of the standalone Microsoft Copilot service.<\/p>\n\n\n\n<p><em>References:<\/em><\/p>\n\n\n\n<ol class=\"wp-block-list\">\n<li>Krathwohl, D. R. (2002). A Revision of Bloom\u2019s Taxonomy: An Overview.\u202f<em>Theory Into Practice<\/em>,\u202f41(4), 212\u2013218. <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/doi.org\/10.1207\/s15430421tip4104_2\" target=\"_blank\" rel=\"noopener noreferrer\">https:\/\/doi.org\/10.1207\/s15430421tip4104_2<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><\/li>\n<\/ol>\n","protected":false},"excerpt":{"rendered":"<p>AI interactions can be iterative and complex. Learn how the Semantic Telemetry project at Microsoft Research is developing a new data science approach to understand human-AI interactions and their value.<\/p>\n","protected":false},"author":38004,"featured_media":1133588,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"msr-url-field":"","msr-podcast-episode":"","msrModifiedDate":"","msrModifiedDateEnabled":false,"ep_exclude_from_search":false,"_classifai_error":"","msr-author-ordering":[{"type":"user_nicename","value":"Amber Hoak","user_id":"37992"},{"type":"user_nicename","value":"Scott Counts","user_id":"31471"},{"type":"user_nicename","value":"Kate Lytvynets","user_id":"38073"},{"type":"user_nicename","value":"David Tittsworth","user_id":"38064"},{"type":"user_nicename","value":"Siddharth Suri","user_id":"33766"},{"type":"user_nicename","value":"Ben Cutler","user_id":"31188"},{"type":"user_nicename","value":"Weiwei Yang","user_id":"40138"}],"msr_hide_image_in_river":null,"footnotes":""},"categories":[1],"tags":[],"research-area":[13556,13555],"msr-region":[],"msr-event-type":[],"msr-locale":[268875],"msr-post-option":[269148,243984,269142],"msr-impact-theme":[],"msr-promo-type":[],"msr-podcast-series":[],"class_list":["post-1133598","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-research-blog","msr-research-area-artificial-intelligence","msr-research-area-search-information-retrieval","msr-locale-en_us","msr-post-option-approved-for-river","msr-post-option-blog-homepage-featured","msr-post-option-include-in-river"],"msr_event_details":{"start":"","end":"","location":""},"podcast_url":"","podcast_episode":"","msr_research_lab":[199565],"msr_impact_theme":[],"related-publications":[],"related-downloads":[],"related-videos":[],"related-academic-programs":[],"related-groups":[],"related-projects":[1119417],"related-events":[],"related-researchers":[{"type":"user_nicename","value":"Amber Hoak","user_id":37992,"display_name":"Amber Hoak","author_link":"<a href=\"https:\/\/www.microsoft.com\/en-us\/research\/people\/amhoak\/\" aria-label=\"Visit the profile page for Amber Hoak\">Amber Hoak<\/a>","is_active":false,"last_first":"Hoak, Amber","people_section":0,"alias":"amhoak"},{"type":"user_nicename","value":"Scott Counts","user_id":31471,"display_name":"Scott Counts","author_link":"<a href=\"https:\/\/www.microsoft.com\/en-us\/research\/people\/counts\/\" aria-label=\"Visit the profile page for Scott Counts\">Scott Counts<\/a>","is_active":false,"last_first":"Counts, Scott","people_section":0,"alias":"counts"},{"type":"user_nicename","value":"Kate Lytvynets","user_id":38073,"display_name":"Kate Lytvynets","author_link":"<a href=\"https:\/\/www.microsoft.com\/en-us\/research\/people\/kalytv\/\" aria-label=\"Visit the profile page for Kate Lytvynets\">Kate Lytvynets<\/a>","is_active":false,"last_first":"Lytvynets, Kate","people_section":0,"alias":"kalytv"},{"type":"user_nicename","value":"David Tittsworth","user_id":38064,"display_name":"David Tittsworth","author_link":"<a href=\"https:\/\/www.microsoft.com\/en-us\/research\/people\/datittsw\/\" aria-label=\"Visit the profile page for David Tittsworth\">David Tittsworth<\/a>","is_active":false,"last_first":"Tittsworth, David","people_section":0,"alias":"datittsw"},{"type":"user_nicename","value":"Siddharth Suri","user_id":33766,"display_name":"Siddharth Suri","author_link":"<a href=\"https:\/\/www.microsoft.com\/en-us\/research\/people\/suri\/\" aria-label=\"Visit the profile page for Siddharth Suri\">Siddharth Suri<\/a>","is_active":false,"last_first":"Suri, Siddharth","people_section":0,"alias":"suri"},{"type":"user_nicename","value":"Weiwei Yang","user_id":40138,"display_name":"Weiwei Yang","author_link":"<a href=\"https:\/\/www.microsoft.com\/en-us\/research\/people\/weiwya\/\" aria-label=\"Visit the profile page for Weiwei Yang\">Weiwei Yang<\/a>","is_active":false,"last_first":"Yang, Weiwei","people_section":0,"alias":"weiwya"}],"msr_type":"Post","featured_image_thumbnail":"<img width=\"960\" height=\"540\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2025\/03\/Semantic-Telemetry-BlogHeroFeature-1400x788-1-960x540.jpg\" class=\"img-object-cover\" alt=\"Semantic Telemetry blog | diagram showing relationships between chat, LLM prompt, and labeled data\" decoding=\"async\" loading=\"lazy\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2025\/03\/Semantic-Telemetry-BlogHeroFeature-1400x788-1-960x540.jpg 960w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2025\/03\/Semantic-Telemetry-BlogHeroFeature-1400x788-1-300x169.jpg 300w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2025\/03\/Semantic-Telemetry-BlogHeroFeature-1400x788-1-1024x576.jpg 1024w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2025\/03\/Semantic-Telemetry-BlogHeroFeature-1400x788-1-768x432.jpg 768w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2025\/03\/Semantic-Telemetry-BlogHeroFeature-1400x788-1-1066x600.jpg 1066w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2025\/03\/Semantic-Telemetry-BlogHeroFeature-1400x788-1-655x368.jpg 655w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2025\/03\/Semantic-Telemetry-BlogHeroFeature-1400x788-1-240x135.jpg 240w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2025\/03\/Semantic-Telemetry-BlogHeroFeature-1400x788-1-640x360.jpg 640w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2025\/03\/Semantic-Telemetry-BlogHeroFeature-1400x788-1-1280x720.jpg 1280w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2025\/03\/Semantic-Telemetry-BlogHeroFeature-1400x788-1.jpg 1400w\" sizes=\"auto, (max-width: 960px) 100vw, 960px\" \/>","byline":"","formattedDate":"March 10, 2025","formattedExcerpt":"AI interactions can be iterative and complex. Learn how the Semantic Telemetry project at Microsoft Research is developing a new data science approach to understand human-AI interactions and their value.","locale":{"slug":"en_us","name":"English","native":"","english":"English"},"_links":{"self":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/posts\/1133598","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/users\/38004"}],"replies":[{"embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/comments?post=1133598"}],"version-history":[{"count":26,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/posts\/1133598\/revisions"}],"predecessor-version":[{"id":1133974,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/posts\/1133598\/revisions\/1133974"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/media\/1133588"}],"wp:attachment":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/media?parent=1133598"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/categories?post=1133598"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/tags?post=1133598"},{"taxonomy":"msr-research-area","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/research-area?post=1133598"},{"taxonomy":"msr-region","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-region?post=1133598"},{"taxonomy":"msr-event-type","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-event-type?post=1133598"},{"taxonomy":"msr-locale","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-locale?post=1133598"},{"taxonomy":"msr-post-option","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-post-option?post=1133598"},{"taxonomy":"msr-impact-theme","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-impact-theme?post=1133598"},{"taxonomy":"msr-promo-type","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-promo-type?post=1133598"},{"taxonomy":"msr-podcast-series","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-podcast-series?post=1133598"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}