{"id":642939,"date":"2020-03-23T10:37:52","date_gmt":"2020-03-23T17:37:52","guid":{"rendered":"https:\/\/www.microsoft.com\/en-us\/research\/?p=642939"},"modified":"2020-04-30T15:54:48","modified_gmt":"2020-04-30T22:54:48","slug":"coyote-making-it-easier-for-developers-to-build-reliable-asynchronous-software","status":"publish","type":"post","link":"https:\/\/www.microsoft.com\/en-us\/research\/blog\/coyote-making-it-easier-for-developers-to-build-reliable-asynchronous-software\/","title":{"rendered":"Coyote: Making it easier for developers to build reliable asynchronous software"},"content":{"rendered":"<h3><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-645027 \" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/MSR_Coyote-v2.gif\" alt=\"\" width=\"808\" height=\"455\" \/><\/h3>\n<p>For developers, writing bug-free software that doesn\u2019t crash is getting difficult in an increasingly competitive world where software needs to ship before it becomes obsolete. This challenge is especially apparent with online cloud services, which are often dictated by aggressive shipping deadlines. Cloud services are distributed programs comprising multiple back-end systems that continuously exchange asynchronous signals while responding to incoming web requests. They are complex by nature, hard to get right, and require protection from failures that could jeopardize client data or halt key services.<\/p>\n<p>Such a programming environment is full of <em><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/microsoft.github.io\/coyote\/learn\/core\/non-determinism\">non-determinism<span class=\"sr-only\"> (opens in new tab)<\/span><\/a><\/em>, or scenarios outside developers\u2019 control. For example, there\u2019s non-determinism in the scheduling of concurrent operations, the order in which messages are received, the random system failures, and the random firing of timers, either for retry logic or timeouts from other services that have become unresponsive. Non-deterministic systems exist in all software domains, not just cloud services, and best practices for building and testing these systems fall short. Techniques such as failure injection and stress testing can either be too complicated to set up or time-consuming with no guarantees that found bugs can be reproduced. Consider a cloud service that, let\u2019s say, implements the Raft consensus protocol among a group of machines in an effort to provide a highly reliable fault-tolerant cluster to clients. Such a system will have hundreds of messages flying back and forth between the machines. You do stress testing and don\u2019t find any bugs, but can you really be confident that you\u2019re ready to ship?<\/p>\n<p>We\u2019re excited to announce the release of Coyote, an <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/github.com\/microsoft\/coyote\">open-source .NET framework<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> from Microsoft Research that guides developers toward designing, implementing, and testing code in a way that embraces non-determinism and asynchrony and helps them create asynchronous systems quickly and confidently. Instead of trying to hide non-determinism, Coyote helps explicitly model non-determinism in a system and uses the information to provide a state-of-the-art testing tool. This advanced testing tool can control every source of non-determinism defined, including the exact order of every asynchronous operation, which allows it to systematically explore all the possibilities. The tool runs very quickly and reaches unheard-of levels of coverage of all non-deterministic choices in code, enabling it to find most of the tricky bugs in a way that\u2019s also trivial to reproduce and debug.<\/p>\n<p>A result of years of investment from Microsoft Research in the space of program verification and testing, Coyote is being used to build various components of <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/azure.microsoft.com\/en-us\/product-categories\/compute\/\">Microsoft Azure Compute<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>, such as <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/docs.microsoft.com\/en-us\/azure\/batch\/\">Azure Batch<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>, and <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/azure.microsoft.com\/en-us\/services\/blockchain-service\/\">Microsoft Azure Blockchain<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>. The framework has received <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/microsoft.github.io\/coyote\/case-studies\/azure-batch-service\">positive feedback<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> from the Azure teams using it. One engineer said, \u201cFeatures developed in Coyote test mode worked perfectly in production first time,\u201d while another noted, \u201cA feature that took six months without Coyote was developed in one month using Coyote.\u201d Engineers expressed experiencing a \u201csignificant confidence boost\u201d as a result, allowing them to \u201cchurn [out] code much faster than before.\u201d<\/p>\n<h3>Coyote programming models<\/h3>\n<p>Coyote, which evolved from a previous Microsoft Research project called P#, is a combination of a programming model, a lightweight runtime, and a testing infrastructure all packaged as a portable library with minimal dependencies. The framework supports two main programming models: an <em>asynchronous tasks<\/em> programming model (in preview) and an <em>asynchronous actors<\/em> programming model.<\/p>\n<p>If you\u2019re happy developing your code using C# <em>async\/await<\/em> construct for asynchronous tasks, then Coyote can add value on top of that. If you switch to the <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/microsoft.github.io\/coyote\/learn\/programming-models\/async\/overview\">Coyote task library<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>, the Coyote testing tool will look for bugs by systematically exploring the concurrency between your tasks. However, while the C# <em>async\/await<\/em> feature is wonderful, it sometimes yields code that is too parallel, resulting in a lot of complexity. For example, when performing two or more concurrent tasks, you may need to guard private data with locks, and then you have to worry about deadlocks. Coyote offers an alternative that solves this with the more advanced <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/microsoft.github.io\/coyote\/learn\/programming-models\/actors\/overview\">asynchronous actors programming model<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>.<\/p>\n<p>Actors constrain your parallelism so that a given actor receives messages in a serialized order via an inbox. <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/en.wikipedia.org\/wiki\/Actor_model\">Actor models<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> have gained a lot of popularity, especially in the area of distributed systems, precisely because they help manage the complexity of a system. Actors essentially embrace asynchrony by making every message between actors an <em>async<\/em> operation. Coyote fully understands the semantics of actors and can do a world-class job of testing them and finding even the most subtle bugs. The framework goes one step further, providing a type of actor called a <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/microsoft.github.io\/coyote\/learn\/programming-models\/actors\/state-machines\">state machine<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>, which it knows how to fully test, ensuring every state is covered and every state transition is tested.<\/p>\n<p>&nbsp;<\/p>\n<div id=\"attachment_645066\" style=\"width: 630px\" class=\"wp-caption aligncenter\"><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/microsoft.github.io\/coyote\/learn\/programming-models\/actors\/state-machine-demo\"><img loading=\"lazy\" decoding=\"async\" aria-describedby=\"caption-attachment-645066\" class=\"wp-image-645066 size-full\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/Coyote-12.jpg\" alt=\"\" width=\"620\" height=\"533\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/Coyote-12.jpg 620w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/Coyote-12-300x258.jpg 300w\" sizes=\"auto, (max-width: 620px) 100vw, 620px\" \/><p id=\"caption-attachment-645066\" class=\"wp-caption-text\"><span class=\"sr-only\"> (opens in new tab)<\/span><\/a> The above animation shows Coyote testing in action on a five-server Raft implementation that was written using Coyote. The Coyote testing tool controls message ordering intelligently and, in this case, finds a bug in the implementation where two leaders get elected. For better visualization, the animation has been slowed down from the actual testing speed.<\/p><\/div>\n<h3>Building blocks of Coyote applications<\/h3>\n<p>The Coyote programming models are easy to use, so even with minimal investment, you get the huge upside of a powerful testing tool that automatically finds bugs in your code. And the more time and resources you invest in Coyote, the greater the benefits. Coyote provides the following building blocks for more reliable software:<\/p>\n<ul>\n<li><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/microsoft.github.io\/coyote\/learn\/ref\/Microsoft.Coyote.Tasks\/TaskType\">Task<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>: a wrapper on .NET tasks that allows the Coyote testing tool to take control of scheduling<\/li>\n<li><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/microsoft.github.io\/coyote\/learn\/ref\/Microsoft.Coyote.Actors\/ActorType\">Actor<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>, <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/microsoft.github.io\/coyote\/learn\/ref\/Microsoft.Coyote.Actors\/StateMachineType\">StateMachine<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>, and Event: base classes for the Coyote actors programming model<\/li>\n<li><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/microsoft.github.io\/coyote\/learn\/ref\/Microsoft.Coyote.Specifications\/SpecificationType\">Specification<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> and <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/microsoft.github.io\/coyote\/learn\/ref\/Microsoft.Coyote.Specifications\/MonitorType\">Monitor<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>: ways to embed checks into code that can be verified at test time; this also includes easy ways of monitoring <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/microsoft.github.io\/coyote\/learn\/core\/liveness-checking\">liveness<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>, ensuring that code doesn\u2019t get stuck spinning its wheels<\/li>\n<li><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/microsoft.github.io\/coyote\/learn\/programming-models\/actors\/timers\">Timers<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>: a way to model timing activities in a system, which is especially useful in the design of mocks that model external systems<\/li>\n<li><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/microsoft.github.io\/coyote\/learn\/core\/logging\">Logging<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>: a feature that allows you to see debug messages in context with decisions being made during a Coyote test run, including nice ways to visualize what\u2019s happening<\/li>\n<\/ul>\n<p>In addition to the above constructs, Coyote allows you to use the full power of the C# programming language. To get the best test performance, we recommend mocking all the systems outside your control. This allows the Coyote testing tool to test code locally on a laptop. The following example\u2014a shopping cart system with all external services written as Coyote mock actors\u2014shows a typical test setup:<\/p>\n<div style=\"width: 1817px\" class=\"wp-caption alignnone\"><img loading=\"lazy\" decoding=\"async\" class=\"size-medium\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/Coyote-8.jpg\" alt=\"flow chart \" width=\"1807\" height=\"356\" \/><p class=\"wp-caption-text\">To get the best test performance from the Coyote framework, it\u2019s recommended that developers mock all the systems outside their control. Above is a typical test setup, a shopping cart system with all external services written as Coyote mock actors using the asynchronous actors programming model.<\/p><\/div>\n<p>Larger teams can share their Coyote mocks for improved code reuse in testing. In fact, you can publish your Coyote mocks as a precise protocol definition of your public services. The Coyote testing tool can then be used to fully certify that new customer code is working properly with the mock model of the service before customers even attempt to use your production APIs.<\/p>\n<p>Coyote mocks can be more sophisticated than normal mocks. They not only specify the asynchronous API required to talk to a service, but they can also serve as a rich model of how the service is expected to behave. Most teams are already building mocks, so switching that over to work with Coyote usually requires minimal effort.<\/p>\n<h3>Learn more and contribute<\/h3>\n<p>The Coyote package is available on <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/www.nuget.org\/packages\/Microsoft.Coyote\/\">NuGet<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>, so <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/microsoft.github.io\/coyote\/learn\/get-started\/install\">getting started<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> with Coyote is very simple. Coyote is also <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"http:\/\/github.com\/microsoft\/coyote\">open source on GitHub<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> and available to all who want to provide feedback and suggestions. We\u2019d love to see your pull requests if you have specific ideas on how to improve Coyote. You can learn more about the <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/microsoft.github.io\/coyote\/learn\/resources\/publications\">research behind Coyote<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> or register to watch the <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/note.microsoft.com\/MSR-Webinar-Coyote-Registration-On-Demand.html\">Coyote webinar<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>\u00a0hosted by Chris Lovett.<\/p>\n<p>We hope you, too, can benefit from more confident coding of asynchronous systems using Coyote!<\/p>\n","protected":false},"excerpt":{"rendered":"<p>For developers, writing bug-free software that doesn\u2019t crash is getting difficult in an increasingly competitive world where software needs to ship before it becomes obsolete. This challenge is especially apparent with online cloud services, which are often dictated by aggressive shipping deadlines. Cloud services are distributed programs comprising multiple back-end systems that continuously exchange asynchronous [&hellip;]<\/p>\n","protected":false},"author":38838,"featured_media":647541,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"msr-url-field":"","msr-podcast-episode":"","msrModifiedDate":"","msrModifiedDateEnabled":false,"ep_exclude_from_search":false,"_classifai_error":"","footnotes":""},"categories":[1],"tags":[],"research-area":[13560],"msr-region":[],"msr-event-type":[],"msr-locale":[268875],"msr-post-option":[],"msr-impact-theme":[],"msr-promo-type":[],"msr-podcast-series":[],"class_list":["post-642939","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-research-blog","msr-research-area-programming-languages-software-engineering","msr-locale-en_us"],"msr_event_details":{"start":"","end":"","location":""},"podcast_url":"","podcast_episode":"","msr_research_lab":[],"msr_impact_theme":[],"related-publications":[],"related-downloads":[],"related-videos":[],"related-academic-programs":[],"related-groups":[],"related-projects":[615984],"related-events":[],"related-researchers":[{"type":"user_nicename","value":"Akash Lal","user_id":30905,"display_name":"Akash Lal","author_link":"<a href=\"https:\/\/www.microsoft.com\/en-us\/research\/people\/akashl\/\" aria-label=\"Visit the profile page for Akash Lal\">Akash Lal<\/a>","is_active":false,"last_first":"Lal, Akash","people_section":0,"alias":"akashl"}],"msr_type":"Post","featured_image_thumbnail":"<img width=\"960\" height=\"540\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/MSR_ProjectCoyote-v3-1-960x540.png\" class=\"img-object-cover\" alt=\"\" decoding=\"async\" loading=\"lazy\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/MSR_ProjectCoyote-v3-1-960x540.png 960w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/MSR_ProjectCoyote-v3-1-1066x600.png 1066w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/MSR_ProjectCoyote-v3-1-655x368.png 655w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/MSR_ProjectCoyote-v3-1-343x193.png 343w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2020\/03\/MSR_ProjectCoyote-v3-1-640x360.png 640w\" sizes=\"auto, (max-width: 960px) 100vw, 960px\" \/>","byline":"<a href=\"https:\/\/www.microsoft.com\/en-us\/research\/people\/akashl\/\" title=\"Go to researcher profile for Akash Lal\" aria-label=\"Go to researcher profile for Akash Lal\" data-bi-type=\"byline author\" data-bi-cN=\"Akash Lal\">Akash Lal<\/a>","formattedDate":"March 23, 2020","formattedExcerpt":"For developers, writing bug-free software that doesn\u2019t crash is getting difficult in an increasingly competitive world where software needs to ship before it becomes obsolete. This challenge is especially apparent with online cloud services, which are often dictated by aggressive shipping deadlines. Cloud services are&hellip;","locale":{"slug":"en_us","name":"English","native":"","english":"English"},"_links":{"self":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/posts\/642939","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/users\/38838"}],"replies":[{"embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/comments?post=642939"}],"version-history":[{"count":21,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/posts\/642939\/revisions"}],"predecessor-version":[{"id":655206,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/posts\/642939\/revisions\/655206"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/media\/647541"}],"wp:attachment":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/media?parent=642939"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/categories?post=642939"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/tags?post=642939"},{"taxonomy":"msr-research-area","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/research-area?post=642939"},{"taxonomy":"msr-region","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-region?post=642939"},{"taxonomy":"msr-event-type","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-event-type?post=642939"},{"taxonomy":"msr-locale","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-locale?post=642939"},{"taxonomy":"msr-post-option","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-post-option?post=642939"},{"taxonomy":"msr-impact-theme","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-impact-theme?post=642939"},{"taxonomy":"msr-promo-type","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-promo-type?post=642939"},{"taxonomy":"msr-podcast-series","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-podcast-series?post=642939"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}