{"id":572415,"date":"2019-03-13T09:58:54","date_gmt":"2019-03-13T16:58:54","guid":{"rendered":"https:\/\/www.microsoft.com\/en-us\/research\/?p=572415"},"modified":"2019-03-13T09:58:54","modified_gmt":"2019-03-13T16:58:54","slug":"researchers-seek-to-simplify-the-complex-in-cloud-computing","status":"publish","type":"post","link":"https:\/\/www.microsoft.com\/en-us\/research\/blog\/researchers-seek-to-simplify-the-complex-in-cloud-computing\/","title":{"rendered":"Researchers seek to simplify the complex in cloud computing"},"content":{"rendered":"<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-large wp-image-572418\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2019\/03\/NSDI_Computing-In-The-Loud_Blog_Site_03_2019_1400x788-1024x576.png\" alt=\"\" width=\"1024\" height=\"576\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2019\/03\/NSDI_Computing-In-The-Loud_Blog_Site_03_2019_1400x788-1024x576.png 1024w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2019\/03\/NSDI_Computing-In-The-Loud_Blog_Site_03_2019_1400x788-300x169.png 300w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2019\/03\/NSDI_Computing-In-The-Loud_Blog_Site_03_2019_1400x788-768x432.png 768w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2019\/03\/NSDI_Computing-In-The-Loud_Blog_Site_03_2019_1400x788-1066x600.png 1066w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2019\/03\/NSDI_Computing-In-The-Loud_Blog_Site_03_2019_1400x788-655x368.png 655w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2019\/03\/NSDI_Computing-In-The-Loud_Blog_Site_03_2019_1400x788-343x193.png 343w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2019\/03\/NSDI_Computing-In-The-Loud_Blog_Site_03_2019_1400x788.png 1400w\" sizes=\"(max-width: 1024px) 100vw, 1024px\" \/><\/p>\n<p>From February 26\u201328, researchers gathered in Boston for the <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/www.usenix.org\/conference\/nsdi19\">16th USENIX Symposium on Networked Systems Design and Implementation (NSDI)<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>, one of the top conferences in the networking and systems field. <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/event\/nsdi-19\/\">Microsoft, a silver sponsor of the event, was represented by researchers serving on the program committee, as well as those presenting papers<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>, including two research teams using novel abstractions to empower and better serve cloud users.<\/p>\n<p>\u201cBoth papers describe new ways to cope with the ever-increasing scale and complexity of what it means to do state-of-the-art computing in the cloud,\u201d said <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/people\/moscitho\/\">Thomas Moscibroda, Microsoft Partner Research Scientist, Azure Compute<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>.<\/p>\n<p>With their respective work, the teams seek to simplify the underlying operations\u2014or what <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/people\/kokarana\/\">Microsoft Principal Scientist Konstantinos Karanasos<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>, co-author on the other paper, calls \u201cthe magic\u201d\u2014to deliver a more efficient and seamless user experience.<\/p>\n<h3>Direct Universal Access: A communications architecture<\/h3>\n<p>Field programmable gate arrays (FPGAs) are <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/blogs.microsoft.com\/ai\/project_brainwave_catapult_moonshot\/\">becoming widely used in today\u2019s data centers<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>. These reprogrammable circuits combine the advantages of hardware speed while offering some of the flexibility that makes software ideal for programming. But taking advantage of their full potential at cloud computing scale has been extremely challenging for several reasons, and researchers in the <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/group\/networking-research-group-2\/\">Networking Research Group at Microsoft Research Asia<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>, in collaboration with engineering leaders in <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/azure.microsoft.com\/en-us\/\">Microsoft Azure<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>, are hoping to change that by addressing one such obstacle: the absence of an efficient, reliable, easy-to-use communications layer.<\/p>\n<p>In their paper <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2018\/10\/nsdi19spring-final64.pdf\">\u201cDirect Universal Access: Making Data Center Resources Available to FPGA,\u201d<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> they present a new communications architecture, one that <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/people\/pengc\/\">Microsoft Researcher Peng Cheng<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> and his co-authors liken to the Internet Protocol or the operating system of a computer.<\/p>\n<p>\u201cOur challenge has been, how do we provide a software-like IP layer inside this hardware-based platform,\u201d said Cheng, adding that the goal is a unified platform.<\/p>\n<p>Currently, communication between pairs of FPGAs and other data center resources, such as CPUs, GPUs, memory, and storage, is complex, making programming large-scale heterogenous applications impractical and, at times, nearly impossible.<\/p>\n<p>There are several reasons for this, the researchers explain in their paper: First, the communications paradigms used for connecting resources that are local to a server and resources that are remote\u2014that is, located on a different server in the data center\u2014are different and use vastly different communications stacks. Secondly, resources are named in a way that is specific to the server they live on. And lastly, current FPGA architecture is inefficient when it comes to multiplexing multiple diverse communications links to different local and remote resources.<\/p>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-572424 size-large\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2019\/03\/pic-06-1024x575.jpg\" alt=\"\" width=\"1024\" height=\"575\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2019\/03\/pic-06-1024x575.jpg 1024w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2019\/03\/pic-06-300x168.jpg 300w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2019\/03\/pic-06-768x431.jpg 768w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2019\/03\/pic-06-1066x600.jpg 1066w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2019\/03\/pic-06-655x368.jpg 655w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2019\/03\/pic-06-343x193.jpg 343w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2019\/03\/pic-06.jpg 1304w\" sizes=\"(max-width: 1024px) 100vw, 1024px\" \/><\/p>\n<div id=\"attachment_572433\" style=\"width: 1034px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" aria-describedby=\"caption-attachment-572433\" class=\"wp-image-572433 size-large\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2019\/03\/pic-08-1024x456.jpg\" alt=\"\" width=\"1024\" height=\"456\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2019\/03\/pic-08-1024x456.jpg 1024w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2019\/03\/pic-08-300x134.jpg 300w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2019\/03\/pic-08-768x342.jpg 768w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2019\/03\/pic-08.jpg 1210w\" sizes=\"(max-width: 1024px) 100vw, 1024px\" \/><p id=\"caption-attachment-572433\" class=\"wp-caption-text\">Current FPGA communications architecture (top) compared to an ideal FPGA communications architecture. Deploying a common communications interface, a global unified naming scheme, and an underlying network service providing routing and multiplexing, the ideal architecture captured by DUA will allow designers and developers to build large-scale heterogenous FPGA-based applications.<\/p><\/div>\n<p>Direct Universal Access (DUA) makes communication among data center resources possible and easier by providing a common communications interface, a global unified naming scheme, and an underlying network service that provides routing and resource multiplexing, creating a common resource pool that can be accessed uniformly and efficiently. The architecture is implemented as an overlay network\u2014a layer between the developer and the various data center communications stacks and resources\u2014and supports systems and communications protocols currently in place. This is critical because it means that no manufacturing overhaul of existing devices is required; DUA can be deployed as is on existing frameworks.<\/p>\n<p>\u201cDUA connects all resources in a data center regardless of location and type of resource,\u201d explained <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/people\/rashu\/#!publications\">Microsoft Associate Researcher Ran Shu<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>. \u201cAll these resources are in a unified naming space and unified IP-based networking scheme, so each application can access different resources with the same code, so it is easy for developers to port their code, and it greatly reduces application development time.\u201d<\/p>\n<p>The researchers hope DUA will allow developers to build large-scale, diverse, and novel FPGA-based applications that, before this, haven\u2019t been within reach.<\/p>\n<p>\u201cThe proliferation of FPGAs in the cloud is a reality and offers gigantic promise because if we can make it easy for developers to use and connect the different types of data center resources in an efficient way, they can build novel types of applications that are inconceivable otherwise,\u201d said Moscibroda.<\/p>\n<p>To demonstrate this potential, the research team has built two large-scale FPGA applications\u2014regular expression matching for packet inspection and deep crossing, a machine learning algorithm\u2014on top of DUA.<\/p>\n<p>The team is in the process of making DUA open-source, and it will be available on GitHub soon.<\/p>\n<h3>Hydra: A resource management framework<\/h3>\n<p>Cloud services for storing, analyzing, and managing big data can process thousands of jobs for thousands of users in a single day. No small order. And it\u2019s the responsibility of the service\u2019s resource manager to make sure these jobs go off without a hitch. The resource management infrastructure determines where a particular job and its tasks should run and what share of resources each user should get to accomplish said job. For smaller-scale services, the challenges of task placement and share determination can generally be tackled together. But Microsoft is no small-scale operation.<\/p>\n<p>Serving 10,000 users and running half a million jobs daily across hundreds of thousands of machines, Microsoft was in need of a new approach, so its researchers set out to deliver a resource manager capable of offering the scalability and utilization of its existing infrastructure while also meeting several additional key requirements: It needed to be able to handle not only a high volume of work but also a diverse workload, including both internal Microsoft applications and open-source frameworks; it needed to allocate resources in a more principled and efficient way; and it needed to make the testing of new features easier.<\/p>\n<p>The result of their work is Hydra, the main resource manager behind the big-data analytics clusters of Microsoft today. The infrastructure has actually been in place for a few years now, the team migrating 99 percent of users over in real-time while continuing services. \u201cThis is what we call changing airplane engines mid-flight,\u201d Karanasos said with a laugh.<\/p>\n<p>In their paper <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2018\/12\/NSDI19_paper159_CR.pdf\">\u201cHydra: A Federated Resource Manager for Data-Center Scale Analytics,\u201d<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> Karanasos and his co-authors unveil the newest\u2014and arguably one of the most important components\u2014to Hydra, the federated architecture it leverages to divide and conquer task placement and share determination.<\/p>\n<p>\u201cWith our requirements, we had to split the problem,\u201d explained Karanasos. \u201cAnything else would not scale or would give up quality.\u201d<\/p>\n<div id=\"attachment_572436\" style=\"width: 1034px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" aria-describedby=\"caption-attachment-572436\" class=\"size-large wp-image-572436\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2019\/03\/pic-07-1024x618.png\" alt=\"\" width=\"1024\" height=\"618\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2019\/03\/pic-07-1024x618.png 1024w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2019\/03\/pic-07-300x181.png 300w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2019\/03\/pic-07-768x464.png 768w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2019\/03\/pic-07.png 1037w\" sizes=\"(max-width: 1024px) 100vw, 1024px\" \/><p id=\"caption-attachment-572436\" class=\"wp-caption-text\">With its federated architecture (above), Hydra can scale to individual clusters of over 50,000 machines and perform scheduling decisions at rates that are 10 times to 100 times higher than existing resource managers.<\/p><\/div>\n<p>A federated architecture offers a middle-of-the-road design solution, falling in between a centralized architecture, which is effective at share determination but harder to scale, and a distributed architecture, which can scale well but makes it difficult to impose strong scheduling guarantees.<\/p>\n<p>With a federated architecture, each cluster of machines is separated into loosely coordinating subclusters, allowing Hydra to determine placement and resource sharing separately. Placement is handled locally in each subcluster while share determination is based on an aggregate view of cluster resources. With this architecture, Hydra can scale to individual clusters of over 50,000 machines and perform scheduling decisions at rates that are 10 times to 100 times higher than existing resource managers. Hydra\u2019s decisions are determined by a set of policies that can be dynamically adjusted based on the cluster conditions and the user needs, providing great flexibility to the cluster operators.<\/p>\n<p>\u201cHydra\u2019s carefully designed architecture and policies hide from the users the existence of multiple subclusters, providing them with the illusion of a single massive cluster,\u201d said Karanasos. \u201cUsers simply submit their jobs to the cluster, and Hydra decides the machines that the jobs&#8217; tasks will run on, possibly including machines spread across subclusters.\u201d<\/p>\n<p>Since its deployment, Hydra has scheduled a trillion tasks and processed over a zettabyte of data. The good news is Hydra is open-source as an extension of the <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/hadoop.apache.org\/\">Apache Hadoop YARN resource management project<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>From February 26\u201328, researchers gathered in Boston for the 16th USENIX Symposium on Networked Systems Design and Implementation (NSDI), one of the top conferences in the networking and systems field. Microsoft, a silver sponsor of the event, was represented by researchers serving on the program committee, as well as those presenting papers, including two research [&hellip;]<\/p>\n","protected":false},"author":38022,"featured_media":572418,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"msr-url-field":"","msr-podcast-episode":"","msrModifiedDate":"","msrModifiedDateEnabled":false,"ep_exclude_from_search":false,"footnotes":""},"categories":[194485,194463],"tags":[],"research-area":[13547],"msr-region":[256048],"msr-event-type":[197941],"msr-locale":[268875],"msr-post-option":[],"msr-impact-theme":[],"msr-promo-type":[],"msr-podcast-series":[],"class_list":["post-572415","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-networking","category-systems","msr-research-area-systems-and-networking","msr-region-global","msr-event-type-conferences","msr-locale-en_us"],"msr_event_details":{"start":"","end":"","location":""},"podcast_url":"","podcast_episode":"","msr_research_lab":[199560],"msr_impact_theme":[],"related-publications":[],"related-downloads":[],"related-videos":[],"related-academic-programs":[],"related-groups":[],"related-projects":[],"related-events":[564666],"related-researchers":[],"msr_type":"Post","featured_image_thumbnail":"<img width=\"960\" height=\"540\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2019\/03\/NSDI_Computing-In-The-Loud_Blog_Site_03_2019_1400x788.png\" class=\"img-object-cover\" alt=\"\" decoding=\"async\" loading=\"lazy\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2019\/03\/NSDI_Computing-In-The-Loud_Blog_Site_03_2019_1400x788.png 1400w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2019\/03\/NSDI_Computing-In-The-Loud_Blog_Site_03_2019_1400x788-300x169.png 300w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2019\/03\/NSDI_Computing-In-The-Loud_Blog_Site_03_2019_1400x788-768x432.png 768w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2019\/03\/NSDI_Computing-In-The-Loud_Blog_Site_03_2019_1400x788-1024x576.png 1024w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2019\/03\/NSDI_Computing-In-The-Loud_Blog_Site_03_2019_1400x788-1066x600.png 1066w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2019\/03\/NSDI_Computing-In-The-Loud_Blog_Site_03_2019_1400x788-655x368.png 655w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2019\/03\/NSDI_Computing-In-The-Loud_Blog_Site_03_2019_1400x788-343x193.png 343w\" sizes=\"(max-width: 960px) 100vw, 960px\" \/>","byline":"","formattedDate":"March 13, 2019","formattedExcerpt":"From February 26\u201328, researchers gathered in Boston for the 16th USENIX Symposium on Networked Systems Design and Implementation (NSDI), one of the top conferences in the networking and systems field. Microsoft, a silver sponsor of the event, was represented by researchers serving on the program&hellip;","locale":{"slug":"en_us","name":"English","native":"","english":"English"},"_links":{"self":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/posts\/572415"}],"collection":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/users\/38022"}],"replies":[{"embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/comments?post=572415"}],"version-history":[{"count":5,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/posts\/572415\/revisions"}],"predecessor-version":[{"id":572745,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/posts\/572415\/revisions\/572745"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/media\/572418"}],"wp:attachment":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/media?parent=572415"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/categories?post=572415"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/tags?post=572415"},{"taxonomy":"msr-research-area","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/research-area?post=572415"},{"taxonomy":"msr-region","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-region?post=572415"},{"taxonomy":"msr-event-type","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-event-type?post=572415"},{"taxonomy":"msr-locale","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-locale?post=572415"},{"taxonomy":"msr-post-option","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-post-option?post=572415"},{"taxonomy":"msr-impact-theme","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-impact-theme?post=572415"},{"taxonomy":"msr-promo-type","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-promo-type?post=572415"},{"taxonomy":"msr-podcast-series","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-podcast-series?post=572415"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}