{"id":566577,"date":"2019-02-08T09:00:05","date_gmt":"2019-02-08T17:00:05","guid":{"rendered":"https:\/\/www.microsoft.com\/en-us\/research\/?p=566577"},"modified":"2020-10-15T14:52:15","modified_gmt":"2020-10-15T21:52:15","slug":"email-overload-using-machine-learning-to-manage-messages-commitments","status":"publish","type":"post","link":"https:\/\/www.microsoft.com\/en-us\/research\/blog\/email-overload-using-machine-learning-to-manage-messages-commitments\/","title":{"rendered":"Email overload: Using machine learning to manage messages, commitments"},"content":{"rendered":"<p><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter size-large wp-image-566580\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2019\/02\/WSDM-Characterizing-Predicting-Email_Site_01_2019_1400x788-1024x576.png\" alt=\"\" width=\"1024\" height=\"576\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2019\/02\/WSDM-Characterizing-Predicting-Email_Site_01_2019_1400x788-1024x576.png 1024w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2019\/02\/WSDM-Characterizing-Predicting-Email_Site_01_2019_1400x788-300x169.png 300w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2019\/02\/WSDM-Characterizing-Predicting-Email_Site_01_2019_1400x788-768x432.png 768w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2019\/02\/WSDM-Characterizing-Predicting-Email_Site_01_2019_1400x788-1066x600.png 1066w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2019\/02\/WSDM-Characterizing-Predicting-Email_Site_01_2019_1400x788-655x368.png 655w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2019\/02\/WSDM-Characterizing-Predicting-Email_Site_01_2019_1400x788-343x193.png 343w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2019\/02\/WSDM-Characterizing-Predicting-Email_Site_01_2019_1400x788.png 1400w\" sizes=\"auto, (max-width: 1024px) 100vw, 1024px\" \/><\/p>\n<p>As email continues to be not only an important means of communication but also an official record of information and a tool for managing tasks, schedules, and collaborations, making sense of everything moving in and out of our inboxes will only get more difficult. The good news is there\u2019s a method to the madness of staying on top of your email, and Microsoft researchers are drawing on this behavior to create tools to support users. Two teams working in the space will be presenting papers at this year\u2019s <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"http:\/\/www.wsdm-conference.org\/2019\/\">ACM International Conference on Web Search and Data Mining<\/a> February 11\u201315 in Melbourne, Australia.<\/p>\n<p>\u201cIdentifying the emails you need to pay attention to is a challenging task,\u201d says <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/people\/ryenw\/\">Partner Researcher and Research Manager Ryen White of Microsoft Research<\/a>, who manages a team of about a dozen scientists and engineers and typically receives 100 to 200 emails a day. \u201cRight now, we end up doing a lot of that on our own.\u201d<\/p>\n<p><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/www.mckinsey.com\/~\/media\/McKinsey\/Industries\/High%20Tech\/Our%20Insights\/The%20social%20economy\/MGI_The_social_economy_Full_report.ashx\">According to the McKinsey Global Institute, professionals spend 28 percent of their time on email<\/a>, so thoughtful support tools have the potential to make a tangible difference.<\/p>\n<p>\u201cWe\u2019re trying to bring in machine learning to make sense of a huge amount of data to make you more productive and efficient in your work,\u201d says <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/people\/hassanam\/\">Senior Researcher and Research Manager Ahmed Hassan Awadallah<\/a>. \u201cEfficiency could come from a better ability to handle email, getting back to people faster, not missing things you would have missed otherwise. If we\u2019re able to save some of that time so you could use it for your actual work function, that would be great.\u201d<\/p>\n<h3>Email deferral: Deciding now or later<\/h3>\n<p>Awadallah has been studying the relationship between individuals and their email for years, exploring <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2017\/04\/sigir17a.pdf\">how machine learning can better support users in their email responses<\/a> and <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2018\/04\/EmailQA_SIGIR18.pdf\">help make information in inboxes more accessible<\/a>. During these studies, he and fellow researchers began noticing varying behavior among users. Some tackled email-related tasks immediately, while others returned to messages multiple times before acting. The observations led them to wonder: How <em>do<\/em> users manage their messages, and how can we help them make the process more efficient?<\/p>\n<p>\u201cThere\u2019s this term called \u2018email overload,\u2019 where you have a lot of information flowing into your inbox and you are struggling to keep up with all the incoming messages,\u201d explains Awadallah, \u201cand different people come up with different strategies to cope.\u201d<\/p>\n<p>In <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/publication\/characterizing-and-predicting-email-deferral-behaviour\/\">\u201cCharacterizing and Predicting Email Deferral Behavior,\u201d<\/a> Awadallah and his coauthors reveal the inner workings of one such common strategy: email deferral, which they define as seeing an email but waiting until a later time to address it.<\/p>\n<p>The team\u2019s goal was twofold: to gain a deep understanding of deferral behavior and to build a predictive model that could help users in their deferral decisions and follow-up responses. The team\u2014a collaboration between Microsoft Research\u2019s Awadallah, <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/people\/sdumais\/\">Susan Dumais<\/a>, and <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"http:\/\/uwaterloo.academia.edu\/BaharehSarrafzadeh\">Bahareh Sarrafzadeh<\/a>, lead author on the paper and an intern at the time, and <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"http:\/\/christopherhlin.com\/\">Christopher Lin<\/a>, <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/people.cs.umass.edu\/~cjlee\/\">Chia-Jung Lee<\/a>, and <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/people\/milads\/\">Milad Shokouhi<\/a> of the Microsoft Search, Assistant and Intelligence group\u2014dedicated a significant amount of resources to the former.<\/p>\n<p>\u201cAI and machine learning should be inspired by the behavior people are doing right now,\u201d says Awadallah.<\/p>\n<div id=\"attachment_566583\" style=\"width: 985px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" aria-describedby=\"caption-attachment-566583\" class=\"size-full wp-image-566583\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2019\/02\/figure-1-chart.png\" alt=\"\" width=\"975\" height=\"708\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2019\/02\/figure-1-chart.png 975w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2019\/02\/figure-1-chart-300x218.png 300w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2019\/02\/figure-1-chart-768x558.png 768w\" sizes=\"auto, (max-width: 975px) 100vw, 975px\" \/><p id=\"caption-attachment-566583\" class=\"wp-caption-text\">The probability of deferring an email based on the workload of the user as measured by the number of unhandled emails. The number of unhandled emails is one of many features Awadallah and his coauthors used in training their deferral prediction model.<\/p><\/div>\n<p>The team interviewed 15 subjects and analyzed the email logs of 40,000 anonymous users, finding that people defer for several reasons: They need more time and resources to respond than they have in that moment, or they\u2019re juggling more immediate tasks. They also factor in who the sender is and how many others have been copied. They found some of the more interesting reasons revolved around perception and boundaries, delaying or not to set expectations on how quickly they respond to messages.<\/p>\n<p>The researchers used this information to create a dataset of features\u2014such as the message length, the number of unanswered emails in an inbox, and whether a message was human- or machine-generated\u2014to train a model to predict whether a message is deferred. The model has the potential to significantly improve the email experience, says Awadallah. For example, email clients could use such a model to remind users about emails they\u2019ve deferred or even forgotten about, saving them the effort they would have spent searching for those emails and reducing the likelihood of missing important ones.<\/p>\n<p>\u201cIf you have decided to leave an email for later, in many cases, you either just rely on memory or more primitive controls that your mail client provides like flagging your message or marking the message unread, and while these are useful strategies, we found that they do not provide enough support for users,\u201d says Awadallah.<\/p>\n<h3>Commitment detection: A promise is a promise<\/h3>\n<p>Among the deluge of incoming emails are outgoing messages containing promises we make\u2014promises to provide information, set up meetings, or follow up with coworkers\u2014and losing track of them has ramifications.<\/p>\n<p>\u201cMeeting your commitments is incredibly important in collaborative settings and helps build your reputation and establish trust,\u201d says <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/people\/ryenw\/\">Ryen White<\/a>.<\/p>\n<p>Current commitment detection tools, <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/blogs.windows.com\/windowsexperience\/2017\/03\/06\/windows-10-tip-cortana-can-automatically-remind-commitments\/\">such as those available in Cortana<\/a>, are pretty effective, but there\u2019s room for further advancement. White, lead author <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/www.linkedin.com\/in\/hosein-azarbonyad-11028141\/\">Hosein Azarbonyad<\/a>, who was interning with Microsoft at the time of the work, and coauthor <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/people\/rsim\/\">Microsoft Research Principal Applied Scientist Robert Sim<\/a> seek to tackle one particular obstacle in their paper <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/publication\/domain-adaptation-for-commitment-detection-in-email\/\">\u201cDomain Adaptation for Commitment Detection in Email\u201d<\/a>: bias in the datasets available to train commitment detection models.<\/p>\n<p>Researcher access is generally limited to public corpora, which tend to be specific to the industry they\u2019re from. In this case, the team used public datasets of email from the energy company Enron and an unspecified tech startup referred to as \u201cAvocado.\u201d They found a significant disparity between models trained and evaluated on the same collection of emails and models trained on one collection and applied to another; the latter model failed to perform as well.<\/p>\n<p>\u201cWe want to learn transferable models,\u201d explains White. \u201cThat\u2019s the goal\u2014to learn algorithms that can be applied to problems, scenarios, and corpora that are related but different to those used during training.\u201d<\/p>\n<p>To accomplish this, the group turned to transfer learning, which has been effective in other scenarios where datasets aren\u2019t representative of the environments in which they\u2019ll ultimately be deployed. In their paper, the researchers train their models to remove bias by identifying and devaluing certain information using three approaches: feature-level adaptation, sample-level adaptation, and an adversarial deep learning approach that uses an autoencoder.<\/p>\n<p>Emails contain a variety and number of words and phrases, some more likely to be related to a commitment\u2014\u201cI will,\u201d \u201cI shall,\u201d \u201clet you know\u201d\u2014than others. In the Enron corpus, domain-specific words like \u201cEnron,\u201d \u201cgas,\u201d and \u201cenergy\u201d may be overweighted in any model trained from it. Feature-level adaptation attempts to replace or transform these domain-specific terms, or <em>features<\/em>, with similar domain-specific features in the target domain, explains Sim. For instance, \u201cEnron\u201d might be replaced with \u201cAvocado,\u201d and \u201cenergy forecast\u201d might be replaced with a relevant tech industry term. The sample level, meanwhile, aims to elevate emails in the training dataset that resemble emails in the target domain, downgrading those that aren\u2019t very similar. So if an Enron email is \u201cAvocado-like,\u201d the researchers will give it more weight while training.<\/p>\n<div id=\"attachment_566586\" style=\"width: 743px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" aria-describedby=\"caption-attachment-566586\" class=\"size-full wp-image-566586\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2019\/02\/general-schema-of-the-proposed.png\" alt=\"\" width=\"733\" height=\"778\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2019\/02\/general-schema-of-the-proposed.png 733w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2019\/02\/general-schema-of-the-proposed-283x300.png 283w\" sizes=\"auto, (max-width: 733px) 100vw, 733px\" \/><p id=\"caption-attachment-566586\" class=\"wp-caption-text\">General schema of the proposed neural autoencoder model used for commitment detection.<\/p><\/div>\n<p>The most novel\u2014and successful\u2014of the three techniques is the adversarial deep learning approach, which in addition to training the model to recognize commitments <em>also<\/em> trains the model to perform poorly at distinguishing between the emails it\u2019s being trained on and the emails it will evaluate; this is the <em>adversarial<\/em> aspect. Essentially, the network receives negative feedback when it indicates an email source, training it to be <em>bad<\/em> at recognizing which domain a particular email comes from. This has the effect of minimizing or removing domain-specific features from the model.<\/p>\n<p>\u201cThere\u2019s something counterintuitive to trying to train the network to be really bad at a classification problem, but it\u2019s actually the nudge that helps steer the network to do the right thing for our main classification task, which is, is this a commitment or not,\u201d says Sim.<\/p>\n<h3>Empowering users to do more<\/h3>\n<p>The two papers are aligned with the greater Microsoft goal of empowering individuals to do more, tapping into an ability to be more productive in a space full of opportunity for increased efficiency.<\/p>\n<p>Reflecting on his own email usage, which finds him interacting with his email frequently throughout the day, White questions the cost-benefit of some of the behavior.<\/p>\n<p>\u201cIf you think about it rationally, it\u2019s like, \u2018Wow, this is a thing that occupies a lot of our time and attention. Do we really get the return on that investment?\u2019\u201d he says.<\/p>\n<p>He and other Microsoft researchers are confident they can help users feel better about the answer with the continued exploration of the tools needed to support them.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>As email continues to be not only an important means of communication but also an official record of information and a tool for managing tasks, schedules, and collaborations, making sense of everything moving in and out of our inboxes will only get more difficult. The good news is there\u2019s a method to the madness of staying on top of your email, and Microsoft researchers are drawing on this behavior to create tools to support users. <\/p>\n","protected":false},"author":37074,"featured_media":566580,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"msr-url-field":"","msr-podcast-episode":"","msrModifiedDate":"","msrModifiedDateEnabled":false,"ep_exclude_from_search":false,"_classifai_error":"","footnotes":""},"categories":[194455],"tags":[],"research-area":[13556,13545],"msr-region":[],"msr-event-type":[],"msr-locale":[268875],"msr-post-option":[],"msr-impact-theme":[],"msr-promo-type":[],"msr-podcast-series":[],"class_list":["post-566577","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-machine-learning","msr-research-area-artificial-intelligence","msr-research-area-human-language-technologies","msr-locale-en_us"],"msr_event_details":{"start":"","end":"","location":""},"podcast_url":"","podcast_episode":"","msr_research_lab":[],"msr_impact_theme":[],"related-publications":[],"related-downloads":[],"related-videos":[],"related-academic-programs":[],"related-groups":[643845,644373],"related-projects":[],"related-events":[558867],"related-researchers":[],"msr_type":"Post","featured_image_thumbnail":"<img width=\"960\" height=\"540\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2019\/02\/WSDM-Characterizing-Predicting-Email_Site_01_2019_1400x788.png\" class=\"img-object-cover\" alt=\"a laptop computer sitting on top of a table\" decoding=\"async\" loading=\"lazy\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2019\/02\/WSDM-Characterizing-Predicting-Email_Site_01_2019_1400x788.png 1400w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2019\/02\/WSDM-Characterizing-Predicting-Email_Site_01_2019_1400x788-300x169.png 300w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2019\/02\/WSDM-Characterizing-Predicting-Email_Site_01_2019_1400x788-768x432.png 768w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2019\/02\/WSDM-Characterizing-Predicting-Email_Site_01_2019_1400x788-1024x576.png 1024w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2019\/02\/WSDM-Characterizing-Predicting-Email_Site_01_2019_1400x788-1066x600.png 1066w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2019\/02\/WSDM-Characterizing-Predicting-Email_Site_01_2019_1400x788-655x368.png 655w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2019\/02\/WSDM-Characterizing-Predicting-Email_Site_01_2019_1400x788-343x193.png 343w\" sizes=\"auto, (max-width: 960px) 100vw, 960px\" \/>","byline":"","formattedDate":"February 8, 2019","formattedExcerpt":"As email continues to be not only an important means of communication but also an official record of information and a tool for managing tasks, schedules, and collaborations, making sense of everything moving in and out of our inboxes will only get more difficult. The&hellip;","locale":{"slug":"en_us","name":"English","native":"","english":"English"},"_links":{"self":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/posts\/566577","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/users\/37074"}],"replies":[{"embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/comments?post=566577"}],"version-history":[{"count":6,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/posts\/566577\/revisions"}],"predecessor-version":[{"id":698302,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/posts\/566577\/revisions\/698302"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/media\/566580"}],"wp:attachment":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/media?parent=566577"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/categories?post=566577"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/tags?post=566577"},{"taxonomy":"msr-research-area","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/research-area?post=566577"},{"taxonomy":"msr-region","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-region?post=566577"},{"taxonomy":"msr-event-type","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-event-type?post=566577"},{"taxonomy":"msr-locale","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-locale?post=566577"},{"taxonomy":"msr-post-option","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-post-option?post=566577"},{"taxonomy":"msr-impact-theme","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-impact-theme?post=566577"},{"taxonomy":"msr-promo-type","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-promo-type?post=566577"},{"taxonomy":"msr-podcast-series","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-podcast-series?post=566577"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}