In addition to the physical and data redundancy, as one of our core strengths we build Office clients to provide functional redundancy to enable you to be productive using offline functionality when there is no network connectivity.<\/li>\n<\/ul>\nResiliency.<\/strong> Active load balancing and constant recovery testing across failure domains:<\/p>\n\nWe actively balance load to provide end users the best possible experiences in an automated manner. These mechanisms also dynamically prioritize, performing low priority tasks during low activity periods and deferring them during high load.<\/li>\n We have both automated and manual failover to healthy resources during hardware or software failures and monitoring alerts.<\/li>\n We routinely perform recovery across failure domains to ensure readiness for circumstances require failovers.<\/li>\n<\/ul>\nDistributed Services.<\/strong> Functionally distributed component services:<\/p>\n\nThe component services in Office 365 like Exchange, SharePoint, Lync and Office Web Apps are functionally distributed, ensuring that the scope and impact of failure in one area is limited to that area alone and not impact others.<\/li>\n We replicate directory data across these component services so that if one service is experiencing an issue, users are able to login and use other services seamlessly.<\/li>\n Our operations and deployment teams benefit from the distributed nature of our service, simplifying all aspects of maintenance and deployment, diagnostics, repair and recovery.<\/li>\n<\/ul>\nMonitoring. <\/strong>Extensive monitoring, recovery and diagnostic tools:<\/p>\n\nOur internal monitoring systems continuously monitor the service for any failure and are built to drive automated recovery of the service.<\/li>\n Our systems analyze any deviations in service behavior to alert on-call engineers to take proactive measures.<\/li>\n We also have Outside-In monitoring constantly executing from multiple locations around the world both from trusted third party services (for independent SLA verification) and our own worldwide datacenters to raise alerts.<\/li>\n For diagnostics, we have extensive logging, auditing, and tracing. Granular tracing and monitoring helps us isolate issues to root cause.<\/li>\n<\/ul>\nSimplification. <\/strong>Reduced complexity drives predictability:<\/p>\n\nWe use standardized components<\/em> wherever possible. This leads to fewer deployment and issue isolation complexities as well as predictable failures and recovery.<\/li>\nWe use standardized process<\/em> wherever possible. The focus is not only on automation but making sure that critical processes are repeated and repeatable.<\/li>\nWe have architected the software components to be loosely coupled so that their deployment and ongoing health don’t require complex orchestration.<\/li>\n Our change management goes through progressive, staged, instrumented rings of scope and validation before being deployed worldwide.<\/li>\n<\/ul>\nHuman back-up. <\/strong>24\/7 on-call support:<\/p>\n\nWhile we have automated recovery actions where possible, we also have a team of on-call professionals standing by 24×7 to support you. This team includes support engineers, product developers, program managers, product managers and senior leadership.<\/li>\n With an entire team on call, we have the ability to provide rapid response and information collection towards problem resolution.<\/li>\n Our on-call professionals while providing back-up, also improve the automated systems every time they are called to help.<\/li>\n<\/ul>\nContinuous learning<\/h3>\n We understand that there will be times when you may experience service interruptions. We do a thorough post-incident review every time an incident occurs regardless of the magnitude of impact. A post-incident review consists of an analysis of what happened, how we responded and how we prevent similar incidents in the future. In the interest of transparency and accountability, we share post-incident review for any major service incidents if your organization was affected. As a large enterprise, we also “eat our own dogfood,” i.e., use our own pre-production service to conduct day-to-day business here at Microsoft. Continuous improvement is a key component to provide a highly available, world-class service.<\/p>\n
Consistent communication<\/h3>\n Transparency requires consistent communication, especially when you are using online productivity services to conduct your business. We have a number of communication channels such as email, RSS feeds and the Service Health Dashboard. As an Office 365 customer, you get a detailed view into the availability of services that\u00a0are relevant to your organization. The Office 365 Service Health Dashboard is\u00a0your window into the current status of your services and your licenses. We continue to drive improvements into the Service Health Dashboard including tracking\u00a0timeliness of updates\u00a0to ensure so that you have full insight into your services health.<\/p>\n
<\/p>\n
We also have some exciting new tools to improve your ability to stay up to date with the service.\u00a0 Last week we\u00a0released a new feature in the administration portal called “Message Center.” Message Center is a central hub for service communications, tenant reporting and actions required by administrators.\u00a0 Also, by the end of this year, administrators can expect a new mobile app that will provide service health information as well as other communications regarding their service.<\/p>\n
Running a comprehensive and evolving service at ever increasing scale is a challenge and there will be service interruptions despite our efforts. We want to assure you that we\u00a0are continually learning and are relentless in our\u00a0commitment to provide you with a reliable highly available service that meets your expectations.\u00a0 Service continuity is more than\u00a0an engineering principle it is a commitment to customers in our SLA and as one of the key pillars of Office 365 Trust Center (the other four pillars being Privacy, Security, Compliance and Transparency).\u00a0This public disclosure of Office 365 uptime is evidence of our ongoing commitment to both Service Continuity and Transparency.<\/p>\n","protected":false},"excerpt":{"rendered":"
“Your complete office in the cloud” is how we think of Microsoft Office 365. While it gives us enormous pride that one billion people use Office, we deeply appreciate the responsibility we have to meet and exceed our customers’ expectations every day. We recognize that productivity apps are mission critical; using them is how work gets done. It is imperative for us to ensure our service is trustworthy and reliable while we continue to add new capabilities to Office 365. Our measure for this is service availability.<\/p>\n","protected":false},"author":10,"featured_media":0,"comment_status":"open","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"ep_exclude_from_search":false,"_classifai_error":"","footnotes":""},"content-type":[4034],"product":[4047,4048,4036,4055,4063,4066,4075],"audience":[4087,4081,4091],"tags":[],"coauthors":[4458],"class_list":["post-258","post","type-post","status-publish","format-standard","hentry","content-type-news","product-excel","product-exchange","product-microsoft-365","product-mobile-apps","product-powerpoint","product-sharepoint","product-word","audience-education","audience-enterprise","audience-government"],"yoast_head":"\n
Cloud services you can trust: Office 365 availability | Microsoft 365 Blog<\/title>\n \n \n \n \n \n \n \n \n \n \n \n \n \n \n \n\t \n\t \n\t \n