{"id":500648,"date":"2018-08-21T08:04:36","date_gmt":"2018-08-21T15:04:36","guid":{"rendered":"https:\/\/www.microsoft.com\/en-us\/research\/?p=500648"},"modified":"2018-08-21T08:04:36","modified_gmt":"2018-08-21T15:04:36","slug":"microsoft-research-bringing-its-best-to-sigcomm-2018","status":"publish","type":"post","link":"https:\/\/www.microsoft.com\/en-us\/research\/blog\/microsoft-research-bringing-its-best-to-sigcomm-2018\/","title":{"rendered":"Microsoft Research bringing its best to SIGCOMM 2018"},"content":{"rendered":"
<\/p>\n
Microsoft Research is actively developing technologies as we continually strive to make our network and online services the most performant and efficient on the planet and this includes openly sharing our progress in advancing the state of the art with the research community. At the upcoming SIGCOMM 2018 (opens in new tab)<\/span><\/a> \u2013 the annual flagship conference organized by Association for Computing Machinery’s Special Interest Group on Data Communications, held this year in Budapest, August 20-24 \u2013 Microsoft Research will present multiple papers that cover a wide spectrum of its network, from data center networks, to wide area networks, to the large-scale service that analyzes video streams. We are particularly proud of our accomplishments this year and looking forward to sharing our knowledge and experience with you in person.<\/p>\n This post previews several papers that we will be presenting at SIGCOMM.<\/p>\n Let\u2019s begin with data center networks. Storage systems in data centers are an important component of large-scale online services. They typically perform replicated transactional operations for high data availability and integrity. Today, however, such operations suffer from high tail latency even with recent kernel bypass and storage optimizations and thus affect the predictability of end-to-end performance of these services. We observe that the root cause of the problem is the involvement of the CPU, a precious commodity in multi-tenant settings, in the critical path of replicated transactions.<\/p>\n In our paper, Group-Based NIC-Offloading to Accelerate Replicated Transactions in Multi-Tenant Storage Systems<\/a>, we present a new framework that removes the CPU from the critical path of replicated transactions in storage systems by offloading them to commodity RDMA NICs with non-volatile memory as the storage medium. To achieve this, we develop new and general NIC offloading primitives that can perform memory operations on all nodes in a replication group while guaranteeing ACID properties without CPU involvement. We demonstrate that popular storage applications can be easily optimized using our primitives. Our evaluation results show that HyperLoop can reduce 99th percentile latency \u2248 800\u00d7 with close to 0% CPU consumption on replicas.<\/p>\n Figure 1: Hyperloop reduces 99th percentile latency of group-based storage replication by up to 800x.<\/p><\/div>\n Effective Traffic Engineering for Optical Links in Wide Area Network<\/p>\n Outside data centers, Microsoft researchers also have achieved a breakthrough. Fiber optic cables connecting different data centers are an expensive resource acquired by large organizations with significant monetary investment. Their importance has driven a conservative deployment approach with redundancy and reliability baked in at multiple layers.<\/p>\nLow-latency Storage System Enabled by Our Novel RDMA Networks<\/h3>\n
