{"id":5962,"date":"2020-12-08T13:13:09","date_gmt":"2020-12-08T21:13:09","guid":{"rendered":"https:\/\/www.microsoft.com\/insidetrack\/blog\/?p=5962"},"modified":"2023-06-08T12:55:30","modified_gmt":"2023-06-08T19:55:30","slug":"microsoft-adopts-proactive-method-for-preventing-and-mitigating-failures","status":"publish","type":"post","link":"https:\/\/www.microsoft.com\/insidetrack\/blog\/microsoft-adopts-proactive-method-for-preventing-and-mitigating-failures\/","title":{"rendered":"Microsoft adopts proactive method for preventing and mitigating failures"},"content":{"rendered":"<div class=\"m-alert f-error\" role=\"alert\">\n<div>\n<div class=\"c-glyph glyph-incident-triangle\" aria-label=\"Error message\"><\/div>\n<p class=\"c-paragraph\">This content has been archived, and while it was correct at time of publication, it may no longer be accurate or reflect the current situation at Microsoft.<\/p>\n<\/div>\n<\/div>\n<p>System crashes and other failures have long been a fact of life in the IT world.<\/p>\n<p>Microsoft has always worked hard to ensure that its customers have reliable software tools, but it experiences problems as well. To reduce the severity and length of system failures, Microsoft Digital core\u2014the engineering organization at Microsoft that builds and manages the products, processes, and services that Microsoft runs on\u2014has created a standard engineering set of tools to identify where failures might occur and how to address them.<\/p>\n<p>This approach is called Failure Mode Effective Analysis (FMEA). Microsoft uses FMEA to recognize potential failure risks, understand their impact, and mitigate them before they occur, rather than reacting after a failure occurs.<\/p>\n<p>FMEA is combined with Service Quality Portal (SQP), a tool that combines Microsoft SharePoint, Microsoft Visio, and Microsoft Azure-based cloud applications. Built by an engineering team within Microsoft, it provides a full capability of risk management using the FMEA framework. SQP makes it much easier to track and mitigate the complex sets of failures that can occur in the cloud.<\/p>\n<p>\u201cIt\u2019s a matter of combining people, processes, and technology,\u201d says Harsh Sharma, a senior program manager for Microsoft. \u201cSQP and FMEA provide the framework we need to make our systems more secure and stable.\u201d<\/p>\n<p>[<a href=\"https:\/\/www.microsoft.com\/en-us\/itshowcase\/monitoring-endtoend-enterprise-health-with-azure\"><em>Learn how Microsoft created a telemetry platform to uncover information about end-to-end enterprise health with Microsoft Azure.<\/em><\/a> <a href=\"https:\/\/www.microsoft.com\/en-us\/itshowcase\/monitoring-sap-end-to-end-on-azure\"><em>Find out how Microsoft monitors SAP end to end on Microsoft Azure.<\/em><\/a> <a href=\"https:\/\/www.microsoft.com\/en-us\/itshowcase\/driving-effective-data-governance-for-improved-quality-and-analytics\"><em>Learn about how Microsoft has created a modern data-governance strategy.<\/em><\/a>]<\/p>\n<p>Once risks are identified, engineers can set up auto-detect to identify and mitigate failures, allowing them to replace the human decision-making used previously.<\/p>\n<p>This new approach addresses how computing faults have evolved over the past 20 years.<\/p>\n<p>Back then, an IT failure was invariably a local event, limited to a handful of PCs or an enterprise\u2019s central servers. But today, with most computing taking place in the cloud, IT infrastructure is widely distributed. It\u2019s also often built with commodity hardware that depends on an array of third-party and partner services.<\/p>\n<p>\u201cCloud services are complex,\u201d Sharma says. \u201cThey have a lot of moving pieces and need a lot of scalability. Owners of services may even have to rebuild their cloud architecture on occasion.\u201d<\/p>\n<p>Several years back, faults were outlined and tracked on a Microsoft Excel spreadsheet. Extrapolate this to more than 2,000 service components in Microsoft Digital, and that\u2019s a lot of spreadsheets.<\/p>\n<p>To get around this, engineers use the SQP tool and FMEA method to store and maintain architecture diagrams and perform risk management prior to every major release.<\/p>\n<p>Some failures are minor\u2014perhaps a set of users can\u2019t sign into an app, or the app has limited functionality. Others are more serious, such as widespread outages that shut down critical services for multiple cloud customers, such as email or access to important data.<\/p>\n<p>The causes of these failures can vary widely as well, from natural disasters such as hurricanes, to human error or hardware or software errors.<\/p>\n<blockquote class=\"quote-body\"><p>Today we just can\u2019t be in firefighting mode all the time. We\u2019re committed to 99.9 percent reliability\u2014sometimes 99.999 percent. So, it\u2019s a requirement to proactively identify potential failures.<\/p>\n<p class=\"source\">&#8211; Harsh Sharma, senior program manager<\/p>\n<\/blockquote>\n<p><strong>A new way to manage failures<\/strong><\/p>\n<p>The old ways of managing failures simply don\u2019t work any longer. And for good reason.<\/p>\n<p>In the past, \u201cfailure management\u201d meant listing every possible error imaginable in the aforementioned Microsoft Excel spreadsheet, with information about how to respond. It\u2019s as if a fire department listed potential fire hazards in every house, then did nothing until the smoke alarms went off.<\/p>\n<p>\u201cToday we just can\u2019t be in firefighting mode all the time,\u201d Sharma says. \u201cWe\u2019re committed to 99.9 percent reliability\u2014sometimes 99.999 percent. So, it\u2019s a requirement to proactively identify potential failures.\u201d<\/p>\n<p>That has meant a shift in what successful fault mitigation looks like.<\/p>\n<p>Rather than focus on extending time between failures, Microsoft Digital\u2019s goal is to reduce time to recover. Complex systems are prone to a wide range of failures, so the best strategy is to cope with them in a way that minimizes impact on customers.<\/p>\n<p>That\u2019s where FMEA comes in.<\/p>\n<p>It prioritizes work in areas such as detection, mitigation, and recovery from failures\u2014all factors in reducing the time needed to correct a failure. Using this approach, engineering teams think through potential reliability weaknesses and are prepared when failures occur, greatly reducing impact on users.<\/p>\n<p>One big part of that is using the design phase of a new service or product to understand how it might fail.<\/p>\n<p>\u201cWe want to identify potential problems and poke holes in the service before something is deployed in production,\u201d Sharma says. \u201cYou want to understand the types of failures that could occur, how it would impact a business, and what could be the cause. And you want to have telemetry in place so that you can be alerted to a failure before a customer tells you about it.\u201d<\/p>\n<p>Not that doing so is easy.<\/p>\n<p>Microsoft engineers working with FMEA principles need to diagram a myriad of dependencies, determine how much redundancy a particular system has or needs, and how different parts of a cloud service interact. Interestingly, hardware components such as disks, processors, and routers are not given substantial attention. Their potential faults are already well understood and relatively easy to trace and fix.<\/p>\n<p>Moving into the cloud has pluses and minuses for managing failures.<\/p>\n<p>The cloud certainly presents a more complicated IT picture than the days of centralized IT. But it also creates its own backup.<\/p>\n<blockquote class=\"quote-body\"><p>We\u2019re getting in front of issues before they become a big deal. Now we\u2019re able to have conversations based on identifying failure points where we want to invest in new designs.<\/p>\n<p class=\"source\">&#8211; Dale Voth, site reliability engineering manager<\/p>\n<\/blockquote>\n<figure id=\"attachment_5967\" aria-describedby=\"caption-attachment-5967\" style=\"width: 300px\" class=\"wp-caption alignright\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-5967 size-medium\" src=\"https:\/\/www.microsoft.com\/insidetrack\/blog\/uploads\/prod\/2020\/12\/10116_secondary-300x225.jpg\" alt=\"Dale Voth works on his laptop in his home office. \" width=\"300\" height=\"225\" srcset=\"https:\/\/www.microsoft.com\/insidetrack\/blog\/uploads\/prod\/2020\/12\/10116_secondary-300x225.jpg 300w, https:\/\/www.microsoft.com\/insidetrack\/blog\/uploads\/prod\/2020\/12\/10116_secondary-768x576.jpg 768w, https:\/\/www.microsoft.com\/insidetrack\/blog\/uploads\/prod\/2020\/12\/10116_secondary.jpg 800w\" sizes=\"auto, (max-width: 300px) 100vw, 300px\" \/><figcaption id=\"caption-attachment-5967\" class=\"wp-caption-text\"><em>Dale Voth, site reliability engineering manager for Microsoft, has seen new approaches to fault mitigation make cloud services more robust and reliable. (Photo submitted by Dale Voth)<\/em><\/figcaption><\/figure>\n<p>\u201cMoving into the cloud allows us to provide resiliency in a more expedient fashion,\u201d says Dale Voth, a site reliability engineering (SRE) manager for Microsoft. \u201cBut it can add cost. Moving legacy systems into the cloud can be challenging, but provides an ecosystem with more reliability and resilience options.\u201d<\/p>\n<p>Voth says that when he began his current role three years ago, service outages were common. But by identifying weak points and improving reporting telemetry, major problems have been greatly reduced.<\/p>\n<p>\u201cWe\u2019re getting in front of issues before they become a big deal,\u201d he says. \u201cNow we\u2019re able to have conversations based on identifying failure points where we want to invest in new designs.\u201d<\/p>\n<p><strong>FMEA helps connect sellers and customers<\/strong><\/p>\n<p>Microsoft marketers have plenty of ways to connect with their customers. But they have to be careful\u2014they don\u2019t want to overwhelm customers, or ignore them. They want to connect with customers at the right time: When they\u2019re ready to buy.<\/p>\n<p>\u201cGiven the vast number of marketing assets where our customers engage with us, that\u2019s a challenge,\u201d says Saravanan Arumugam Subramani, a senior system engineer at Microsoft. \u201cA failure here might mean a sales contact is lost, or a seller reaches out at an inopportune time.\u201d<\/p>\n<p>A systematic approach such as FMEA ensures service reliability.<\/p>\n<p>FMEA helps identify failure points across the ecosystem, quantify the risk, and mitigate them with appropriate design. FMEA provides a framework for engineers to collaborate and critique design and solutions constructively, promoting a culture of learning and growing. It also helps identify where to set up the appropriate bells and whistles so Microsoft Digital can proactively detect imminent failures and not spam engineers with false alerts.<\/p>\n<p>Failures happen. But with the right engineering approach, they happen less often and with reduced impact. MS Digital is working to ensure its customers have the apps and services they need when they need them.<\/p>\n<p>&nbsp;<\/p>\n<figure id=\"attachment_5968\" aria-describedby=\"caption-attachment-5968\" style=\"width: 1048px\" class=\"wp-caption alignnone\"><img loading=\"lazy\" decoding=\"async\" class=\"wp-image-5968 size-full\" src=\"https:\/\/www.microsoft.com\/insidetrack\/blog\/uploads\/prod\/2020\/12\/10116_infograph.png\" alt=\"A diagram shows the multiple steps taken to process new cloud applications to ensure they are robust and fault-resistant. \" width=\"1048\" height=\"536\" srcset=\"https:\/\/www.microsoft.com\/insidetrack\/blog\/uploads\/prod\/2020\/12\/10116_infograph.png 1048w, https:\/\/www.microsoft.com\/insidetrack\/blog\/uploads\/prod\/2020\/12\/10116_infograph-300x153.png 300w, https:\/\/www.microsoft.com\/insidetrack\/blog\/uploads\/prod\/2020\/12\/10116_infograph-1024x524.png 1024w, https:\/\/www.microsoft.com\/insidetrack\/blog\/uploads\/prod\/2020\/12\/10116_infograph-768x393.png 768w\" sizes=\"auto, (max-width: 1048px) 100vw, 1048px\" \/><figcaption id=\"caption-attachment-5968\" class=\"wp-caption-text\"><em>Microsoft\u2019s new approach to identifying risk and mitigating failures focuses on heading off faults before they occur.<\/em><\/figcaption><\/figure>\n<p>As MS Digital teams and Microsoft lean towards \u201cshifting quality left\u201d in the DevOps cycle, it\u2019s becoming a necessity to identify potential risks in the design and architecture phase of DevOps and mitigate them prior to deploying in production.<\/p>\n<p>As someone wisely said, \u201cPrevention is better than cure.\u201d Let\u2019s practice it.<\/p>\n<p><a href=\"https:\/\/www.microsoft.com\/en-us\/itshowcase\/monitoring-endtoend-enterprise-health-with-azure\">Learn how Microsoft created a telemetry platform to uncover information about end-to-end enterprise health with Microsoft Azure.<\/a><\/p>\n<p><a href=\"https:\/\/www.microsoft.com\/en-us\/itshowcase\/monitoring-sap-end-to-end-on-azure\">Find out how Microsoft monitors SAP end to end on Microsoft Azure.<\/a><\/p>\n<p><a href=\"https:\/\/www.microsoft.com\/en-us\/itshowcase\/driving-effective-data-governance-for-improved-quality-and-analytics\">Learn about how Microsoft has created a modern data-governance strategy.<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>This content has been archived, and while it was correct at time of publication, it may no longer be accurate or reflect the current situation at Microsoft. System crashes and other failures have long been a fact of life in the IT world. Microsoft has always worked hard to ensure that its customers have reliable [&hellip;]<\/p>\n","protected":false},"author":146,"featured_media":5965,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":false,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_hide_featured_on_single":false,"_show_featured_caption_on_single":true,"_jetpack_memberships_contains_paid_content":false,"footnotes":""},"categories":[1],"tags":[],"coauthors":[674],"class_list":["post-5962","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-uncategorized","m-blog-post"],"yoast_head":"<!-- This site is optimized with the Yoast SEO plugin v24.8.1 - https:\/\/yoast.com\/wordpress\/plugins\/seo\/ -->\n<title>Microsoft adopts proactive method for preventing and mitigating failures - Inside Track Blog<\/title>\n<meta name=\"robots\" content=\"index, follow, max-snippet:-1, max-image-preview:large, max-video-preview:-1\" \/>\n<link rel=\"canonical\" href=\"https:\/\/www.microsoft.com\/insidetrack\/blog\/microsoft-adopts-proactive-method-for-preventing-and-mitigating-failures\/\" \/>\n<meta property=\"og:locale\" content=\"en_US\" \/>\n<meta property=\"og:type\" content=\"article\" \/>\n<meta property=\"og:title\" content=\"Microsoft adopts proactive method for preventing and mitigating failures - Inside Track Blog\" \/>\n<meta property=\"og:description\" content=\"This content has been archived, and while it was correct at time of publication, it may no longer be accurate or reflect the current situation at Microsoft. System crashes and other failures have long been a fact of life in the IT world. Microsoft has always worked hard to ensure that its customers have reliable [&hellip;]\" \/>\n<meta property=\"og:url\" content=\"https:\/\/www.microsoft.com\/insidetrack\/blog\/microsoft-adopts-proactive-method-for-preventing-and-mitigating-failures\/\" \/>\n<meta property=\"og:site_name\" content=\"Inside Track Blog\" \/>\n<meta property=\"article:published_time\" content=\"2020-12-08T21:13:09+00:00\" \/>\n<meta property=\"article:modified_time\" content=\"2023-06-08T19:55:30+00:00\" \/>\n<meta property=\"og:image\" content=\"https:\/\/www.microsoft.com\/insidetrack\/blog\/uploads\/prod\/2020\/12\/10116_wordpress_hero.jpg\" \/>\n\t<meta property=\"og:image:width\" content=\"2300\" \/>\n\t<meta property=\"og:image:height\" content=\"1294\" \/>\n\t<meta property=\"og:image:type\" content=\"image\/jpeg\" \/>\n<meta name=\"author\" content=\"Inside Track \u2013 retired stories\" \/>\n<meta name=\"twitter:card\" content=\"summary_large_image\" \/>\n<meta name=\"twitter:label1\" content=\"Written by\" \/>\n\t<meta name=\"twitter:data1\" content=\"Inside Track \u2013 retired stories\" \/>\n\t<meta name=\"twitter:label2\" content=\"Est. reading time\" \/>\n\t<meta name=\"twitter:data2\" content=\"7 minutes\" \/>\n<script type=\"application\/ld+json\" class=\"yoast-schema-graph\">{\"@context\":\"https:\/\/schema.org\",\"@graph\":[{\"@type\":\"WebPage\",\"@id\":\"https:\/\/www.microsoft.com\/insidetrack\/blog\/microsoft-adopts-proactive-method-for-preventing-and-mitigating-failures\/\",\"url\":\"https:\/\/www.microsoft.com\/insidetrack\/blog\/microsoft-adopts-proactive-method-for-preventing-and-mitigating-failures\/\",\"name\":\"Microsoft adopts proactive method for preventing and mitigating failures - Inside Track Blog\",\"isPartOf\":{\"@id\":\"https:\/\/www.microsoft.com\/insidetrack\/blog\/#website\"},\"primaryImageOfPage\":{\"@id\":\"https:\/\/www.microsoft.com\/insidetrack\/blog\/microsoft-adopts-proactive-method-for-preventing-and-mitigating-failures\/#primaryimage\"},\"image\":{\"@id\":\"https:\/\/www.microsoft.com\/insidetrack\/blog\/microsoft-adopts-proactive-method-for-preventing-and-mitigating-failures\/#primaryimage\"},\"thumbnailUrl\":\"https:\/\/www.microsoft.com\/insidetrack\/blog\/uploads\/prod\/2020\/12\/10116_wordpress_hero.jpg\",\"datePublished\":\"2020-12-08T21:13:09+00:00\",\"dateModified\":\"2023-06-08T19:55:30+00:00\",\"author\":{\"@id\":\"https:\/\/www.microsoft.com\/insidetrack\/blog\/#\/schema\/person\/59e5f7b07dae629412c990cc1a63b575\"},\"breadcrumb\":{\"@id\":\"https:\/\/www.microsoft.com\/insidetrack\/blog\/microsoft-adopts-proactive-method-for-preventing-and-mitigating-failures\/#breadcrumb\"},\"inLanguage\":\"en-US\",\"potentialAction\":[{\"@type\":\"ReadAction\",\"target\":[\"https:\/\/www.microsoft.com\/insidetrack\/blog\/microsoft-adopts-proactive-method-for-preventing-and-mitigating-failures\/\"]}]},{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.microsoft.com\/insidetrack\/blog\/microsoft-adopts-proactive-method-for-preventing-and-mitigating-failures\/#primaryimage\",\"url\":\"https:\/\/www.microsoft.com\/insidetrack\/blog\/uploads\/prod\/2020\/12\/10116_wordpress_hero.jpg\",\"contentUrl\":\"https:\/\/www.microsoft.com\/insidetrack\/blog\/uploads\/prod\/2020\/12\/10116_wordpress_hero.jpg\",\"width\":2300,\"height\":1294,\"caption\":\"Harsh Sharma, a Microsoft senior program manager, is improving how Microsoft identifies and responds to potential failures. (Photo submitted by Harsh Sharma)\"},{\"@type\":\"BreadcrumbList\",\"@id\":\"https:\/\/www.microsoft.com\/insidetrack\/blog\/microsoft-adopts-proactive-method-for-preventing-and-mitigating-failures\/#breadcrumb\",\"itemListElement\":[{\"@type\":\"ListItem\",\"position\":1,\"name\":\"Home\",\"item\":\"https:\/\/www.microsoft.com\/insidetrack\/blog\/\"},{\"@type\":\"ListItem\",\"position\":2,\"name\":\"Microsoft adopts proactive method for preventing and mitigating failures\"}]},{\"@type\":\"WebSite\",\"@id\":\"https:\/\/www.microsoft.com\/insidetrack\/blog\/#website\",\"url\":\"https:\/\/www.microsoft.com\/insidetrack\/blog\/\",\"name\":\"Inside Track Blog\",\"description\":\"How Microsoft does IT\",\"potentialAction\":[{\"@type\":\"SearchAction\",\"target\":{\"@type\":\"EntryPoint\",\"urlTemplate\":\"https:\/\/www.microsoft.com\/insidetrack\/blog\/?s={search_term_string}\"},\"query-input\":{\"@type\":\"PropertyValueSpecification\",\"valueRequired\":true,\"valueName\":\"search_term_string\"}}],\"inLanguage\":\"en-US\"},{\"@type\":\"Person\",\"@id\":\"https:\/\/www.microsoft.com\/insidetrack\/blog\/#\/schema\/person\/59e5f7b07dae629412c990cc1a63b575\",\"name\":\"Inside Track \u2013 retired stories\",\"image\":{\"@type\":\"ImageObject\",\"inLanguage\":\"en-US\",\"@id\":\"https:\/\/www.microsoft.com\/insidetrack\/blog\/#\/schema\/person\/image\/ee0de87c339052d5d84852473bd7f213\",\"url\":\"https:\/\/secure.gravatar.com\/avatar\/24a8c329ab32afd1bc23fd1658d1acc2?s=96&d=mm&r=g\",\"contentUrl\":\"https:\/\/secure.gravatar.com\/avatar\/24a8c329ab32afd1bc23fd1658d1acc2?s=96&d=mm&r=g\",\"caption\":\"Inside Track \u2013 retired stories\"},\"description\":\"The content on this page was crafted to highlight a specific moment in time or the solutions that have led us to where we are today. It offers valuable insights into our journey and the progress made over the years. Check out the Inside Track blog page for our up-to-date stories around Microsoft.\",\"url\":\"https:\/\/www.microsoft.com\/insidetrack\/blog\/author\/insidetrackarchive\/\"}]}<\/script>\n<!-- \/ Yoast SEO plugin. -->","yoast_head_json":{"title":"Microsoft adopts proactive method for preventing and mitigating failures - Inside Track Blog","robots":{"index":"index","follow":"follow","max-snippet":"max-snippet:-1","max-image-preview":"max-image-preview:large","max-video-preview":"max-video-preview:-1"},"canonical":"https:\/\/www.microsoft.com\/insidetrack\/blog\/microsoft-adopts-proactive-method-for-preventing-and-mitigating-failures\/","og_locale":"en_US","og_type":"article","og_title":"Microsoft adopts proactive method for preventing and mitigating failures - Inside Track Blog","og_description":"This content has been archived, and while it was correct at time of publication, it may no longer be accurate or reflect the current situation at Microsoft. System crashes and other failures have long been a fact of life in the IT world. Microsoft has always worked hard to ensure that its customers have reliable [&hellip;]","og_url":"https:\/\/www.microsoft.com\/insidetrack\/blog\/microsoft-adopts-proactive-method-for-preventing-and-mitigating-failures\/","og_site_name":"Inside Track Blog","article_published_time":"2020-12-08T21:13:09+00:00","article_modified_time":"2023-06-08T19:55:30+00:00","og_image":[{"width":2300,"height":1294,"url":"https:\/\/www.microsoft.com\/insidetrack\/blog\/uploads\/prod\/2020\/12\/10116_wordpress_hero.jpg","type":"image\/jpeg"}],"author":"Inside Track \u2013 retired stories","twitter_card":"summary_large_image","twitter_misc":{"Written by":"Inside Track \u2013 retired stories","Est. reading time":"7 minutes"},"schema":{"@context":"https:\/\/schema.org","@graph":[{"@type":"WebPage","@id":"https:\/\/www.microsoft.com\/insidetrack\/blog\/microsoft-adopts-proactive-method-for-preventing-and-mitigating-failures\/","url":"https:\/\/www.microsoft.com\/insidetrack\/blog\/microsoft-adopts-proactive-method-for-preventing-and-mitigating-failures\/","name":"Microsoft adopts proactive method for preventing and mitigating failures - Inside Track Blog","isPartOf":{"@id":"https:\/\/www.microsoft.com\/insidetrack\/blog\/#website"},"primaryImageOfPage":{"@id":"https:\/\/www.microsoft.com\/insidetrack\/blog\/microsoft-adopts-proactive-method-for-preventing-and-mitigating-failures\/#primaryimage"},"image":{"@id":"https:\/\/www.microsoft.com\/insidetrack\/blog\/microsoft-adopts-proactive-method-for-preventing-and-mitigating-failures\/#primaryimage"},"thumbnailUrl":"https:\/\/www.microsoft.com\/insidetrack\/blog\/uploads\/prod\/2020\/12\/10116_wordpress_hero.jpg","datePublished":"2020-12-08T21:13:09+00:00","dateModified":"2023-06-08T19:55:30+00:00","author":{"@id":"https:\/\/www.microsoft.com\/insidetrack\/blog\/#\/schema\/person\/59e5f7b07dae629412c990cc1a63b575"},"breadcrumb":{"@id":"https:\/\/www.microsoft.com\/insidetrack\/blog\/microsoft-adopts-proactive-method-for-preventing-and-mitigating-failures\/#breadcrumb"},"inLanguage":"en-US","potentialAction":[{"@type":"ReadAction","target":["https:\/\/www.microsoft.com\/insidetrack\/blog\/microsoft-adopts-proactive-method-for-preventing-and-mitigating-failures\/"]}]},{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.microsoft.com\/insidetrack\/blog\/microsoft-adopts-proactive-method-for-preventing-and-mitigating-failures\/#primaryimage","url":"https:\/\/www.microsoft.com\/insidetrack\/blog\/uploads\/prod\/2020\/12\/10116_wordpress_hero.jpg","contentUrl":"https:\/\/www.microsoft.com\/insidetrack\/blog\/uploads\/prod\/2020\/12\/10116_wordpress_hero.jpg","width":2300,"height":1294,"caption":"Harsh Sharma, a Microsoft senior program manager, is improving how Microsoft identifies and responds to potential failures. (Photo submitted by Harsh Sharma)"},{"@type":"BreadcrumbList","@id":"https:\/\/www.microsoft.com\/insidetrack\/blog\/microsoft-adopts-proactive-method-for-preventing-and-mitigating-failures\/#breadcrumb","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https:\/\/www.microsoft.com\/insidetrack\/blog\/"},{"@type":"ListItem","position":2,"name":"Microsoft adopts proactive method for preventing and mitigating failures"}]},{"@type":"WebSite","@id":"https:\/\/www.microsoft.com\/insidetrack\/blog\/#website","url":"https:\/\/www.microsoft.com\/insidetrack\/blog\/","name":"Inside Track Blog","description":"How Microsoft does IT","potentialAction":[{"@type":"SearchAction","target":{"@type":"EntryPoint","urlTemplate":"https:\/\/www.microsoft.com\/insidetrack\/blog\/?s={search_term_string}"},"query-input":{"@type":"PropertyValueSpecification","valueRequired":true,"valueName":"search_term_string"}}],"inLanguage":"en-US"},{"@type":"Person","@id":"https:\/\/www.microsoft.com\/insidetrack\/blog\/#\/schema\/person\/59e5f7b07dae629412c990cc1a63b575","name":"Inside Track \u2013 retired stories","image":{"@type":"ImageObject","inLanguage":"en-US","@id":"https:\/\/www.microsoft.com\/insidetrack\/blog\/#\/schema\/person\/image\/ee0de87c339052d5d84852473bd7f213","url":"https:\/\/secure.gravatar.com\/avatar\/24a8c329ab32afd1bc23fd1658d1acc2?s=96&d=mm&r=g","contentUrl":"https:\/\/secure.gravatar.com\/avatar\/24a8c329ab32afd1bc23fd1658d1acc2?s=96&d=mm&r=g","caption":"Inside Track \u2013 retired stories"},"description":"The content on this page was crafted to highlight a specific moment in time or the solutions that have led us to where we are today. It offers valuable insights into our journey and the progress made over the years. Check out the Inside Track blog page for our up-to-date stories around Microsoft.","url":"https:\/\/www.microsoft.com\/insidetrack\/blog\/author\/insidetrackarchive\/"}]}},"jetpack_featured_media_url":"https:\/\/www.microsoft.com\/insidetrack\/blog\/uploads\/prod\/2020\/12\/10116_wordpress_hero.jpg","jetpack_sharing_enabled":true,"jetpack_shortlink":"https:\/\/wp.me\/p9hcZA-1ya","_links":{"self":[{"href":"https:\/\/www.microsoft.com\/insidetrack\/blog\/wp-json\/wp\/v2\/posts\/5962","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.microsoft.com\/insidetrack\/blog\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.microsoft.com\/insidetrack\/blog\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.microsoft.com\/insidetrack\/blog\/wp-json\/wp\/v2\/users\/146"}],"replies":[{"embeddable":true,"href":"https:\/\/www.microsoft.com\/insidetrack\/blog\/wp-json\/wp\/v2\/comments?post=5962"}],"version-history":[{"count":7,"href":"https:\/\/www.microsoft.com\/insidetrack\/blog\/wp-json\/wp\/v2\/posts\/5962\/revisions"}],"predecessor-version":[{"id":11289,"href":"https:\/\/www.microsoft.com\/insidetrack\/blog\/wp-json\/wp\/v2\/posts\/5962\/revisions\/11289"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.microsoft.com\/insidetrack\/blog\/wp-json\/wp\/v2\/media\/5965"}],"wp:attachment":[{"href":"https:\/\/www.microsoft.com\/insidetrack\/blog\/wp-json\/wp\/v2\/media?parent=5962"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.microsoft.com\/insidetrack\/blog\/wp-json\/wp\/v2\/categories?post=5962"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.microsoft.com\/insidetrack\/blog\/wp-json\/wp\/v2\/tags?post=5962"},{"taxonomy":"author","embeddable":true,"href":"https:\/\/www.microsoft.com\/insidetrack\/blog\/wp-json\/wp\/v2\/coauthors?post=5962"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}