{"id":1024353,"date":"2024-04-16T09:00:00","date_gmt":"2024-04-16T16:00:00","guid":{"rendered":""},"modified":"2024-04-18T12:48:01","modified_gmt":"2024-04-18T19:48:01","slug":"microsoft-at-ndsi-2024-discoveries-and-implementations-in-networked-systems","status":"publish","type":"post","link":"https:\/\/www.microsoft.com\/en-us\/research\/blog\/microsoft-at-ndsi-2024-discoveries-and-implementations-in-networked-systems\/","title":{"rendered":"Microsoft at NSDI 2024: Discoveries and implementations in networked systems"},"content":{"rendered":"\n
\"nsdi'24<\/figure>\n\n\n\n

Networked systems and their applications are essential in building the reliable, scalable, secure, and innovative infrastructure required to meet society\u2019s evolving needs. One of the premier events in this field, the 21st<\/sup> USENIX Symposium on Networked Systems Design and Implementation (opens in new tab)<\/span><\/a> (NSDI \u201824), provides a platform for researchers and experts to share insights, present research findings, and collaborate on the latest advances in the design, implementation, and evaluation of networked and distributed systems.<\/p>\n\n\n\n

Microsoft is honored to support NSDI \u201824 as a returning sponsor. This partnership underscores our commitment to fostering innovation and research within the field. Additionally, members of our team have taken on key roles in organizing the event, including contributions to the program committee and leadership as conference co-chair.<\/p>\n\n\n\n

We are pleased to announce that 19 papers from Microsoft researchers and their partners have been accepted to the conference, including a paper on Autothrottle, a resource management framework, which won the Outstanding Paper Award. These papers represent a broad spectrum of research topics, ranging from 5G, space, datacenters, and wide-area networking to applications in artificial intelligence, security, video conferencing, and gaming. They encompass both early-stage research and systems already deployed in production. This post highlights some of this work.<\/p>\n\n\n\n\t

\n\t\t\n\n\t\t

\n\t\tSpotlight: Blog post<\/span>\n\t<\/p>\n\t\n\t

\n\t\t\t\t\t\t
\n\t\t\t\t\n\t\t\t\t\t\"MedFuzz\n\t\t\t\t<\/a>\n\t\t\t<\/div>\n\t\t\t\n\t\t\t
\n\n\t\t\t\t\t\t\t\t\t

MedFuzz: Exploring the robustness of LLMs on medical challenge problems<\/h2>\n\t\t\t\t\n\t\t\t\t\t\t\t\t

Medfuzz tests LLMs by breaking benchmark assumptions, exposing vulnerabilities to bolster real-world accuracy.<\/p>\n\t\t\t\t\n\t\t\t\t\t\t\t\t

\n\t\t\t\t\t
\n\t\t\t\t\t\t\n\t\t\t\t\t\t\tRead more\t\t\t\t\t\t<\/a>\n\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t\t\t<\/div>\n\t<\/div>\n\t<\/div>\n\t\n\n\n

Paper highlights<\/h2>\n\n\n\n

Autothrottle: A Practical Bi-Level Approach to Resource Management for SLO-Targeted Microservices<\/a><\/h4>\n\n\n\n

Outstanding Paper Award<\/em><\/p>\n\n\n\n

As cloud applications increasingly adopt microservices, resource managers face two distinct levels of system behavior: end-to-end application latency and per-service resource usage. To coordinate them, this research introduces Autothrottle, a bi-level resource management framework for microservices with latency service-level objectives (SLOs). Autothrottle employs an application-wide learning-based controller to periodically set performance targets\u2014expressed as CPU throttle ratios\u2014for per-service heuristic controllers to achieve. When tested using production workloads, Autothrottle demonstrated higher CPU savings and fewer SLO violations than the best performing baseline from Kubernetes. <\/p>\n\n\n\n

Application-Level Service Assurance with 5G RAN Slicing<\/a><\/h3>\n\n\n\n

This paper presents Zipper, an innovative Radio Access Network (RAN) slicing system that provides application-specific throughput and latency. Traditional methods focus on overall slice performance, often neglecting individual app needs, leading to optimization challenges. Zipper addresses this by adopting a model-predictive control approach, precisely managing network use per user with an advanced algorithm for optimal bandwidth allocation. Additionally, it offers a tool for network operators to assess the feasibility of new applications without exceeding capacity.<\/p>\n\n\n\n

Spectrumize: Spectrum-efficient Satellite Networks for the Internet of Things<\/a><\/h3>\n\n\n\n

Low Earth Orbit satellite networks offer a promising way to connect low-power IoT devices globally without terrestrial gateways, using cost-effective pico-satellites. This paper addresses the communication challenges these networks face, such as limited link budgets, satellite movement, and signal interference. It introduces a novel technique using the Doppler shift-from-satellite motion to improve packet detection and decoding, even with low signal-to-noise ratios or during packet collisions. This method, called Spectrumize, achieves a threefold increase in packet detection and over 80 percent accuracy in decoding, significantly outperforming conventional methods.<\/p>\n\n\n\n

Solving Max-Min Fair Resource Allocations Quickly on Large Graphs<\/a><\/h3>\n\n\n\n

This paper tackles the problem of max-min fair resource allocation, crucial for WAN traffic engineering and cluster scheduling, especially as scale increases. This research streamlines the process into a single rapid optimization task to accommodate multi-path scenarios. Tests show that these algorithms surpass previous methods, delivering quicker, fairer, and more efficient allocations. Implemented in Azure’s WAN traffic engineering, these methods not only retain solution quality but also achieve about a threefold increase in processing speed.<\/p>\n\n\n\n

Finding Adversarial Inputs for Heuristics using Multi-level Optimization<\/a><\/h3>\n\n\n\n

Production systems often employ heuristics because they are faster and scale better than their optimal counterparts. However, practitioners may not be aware of the performance differences between a heuristic and the optimal solution, or between two heuristics, in real-world scenarios. MetaOpt addresses this by enabling direct comparison of heuristics against optimal solutions or other heuristics. It efficiently processes inputs for a solver to identify performance gaps and generate adversarial inputs that expose these differences, scaling effectively to real-world problems.<\/p>\n\n\n\n

NetVigil: Robust and Low-Cost Anomaly Detection for East-West Data Center Security<\/a><\/h3>\n\n\n\n

This research introduces NetVigil, an advanced anomaly-detection system that monitors east-west traffic in datacenters. It uses graph-based features from network flows and utilizes graph neural networks (GNNs) along with contrastive learning to improve its effectiveness against both common and sophisticated threats. When tested across multiple attack scenarios and real-world data, NetVigil significantly outperformed existing anomaly detection solutions in accuracy, cost-efficiency, and speed, offering a viable addition to safeguard data center traffic.<\/p>\n\n\n\n

Complete list of accepted publications by Microsoft researchers<\/h2>\n\n\n\n

ADR-X: ANN-Assisted Wireless Link Rate Adaptation for Compute-Constrained Embedded Gaming Devices
<\/a><\/strong>Hao Yin, University of Washington; Murali Ramanujam, Princeton University; Joe Schaefer, Stan Adermann, Srihari Narlanka, and Perry Lea, Microsoft; Ravi Netravali, Princeton University; 
Krishna Chintalapudi<\/a>, Microsoft Research<\/p>\n\n\n\n

Application-Level Service Assurance with 5G RAN Slicing<\/a>
<\/strong>Arjun Balasingam, MIT CSAIL; 
Manikanta Kotaru<\/a> and Victor Bahl<\/a>, Microsoft Research<\/p>\n\n\n\n

Autothrottle: A Practical Bi-Level Approach to Resource Management for SLO-Targeted Microservices<\/a>
<\/strong>Zibo Wang, University of Science and Technology of China and Microsoft Research; Pinghe Li, ETH Zurich; 
Chieh-Jan Mike Liang<\/a>, Microsoft Research; Feng Wu, University of Science and Technology of China; Francis Y. Yan<\/a>, Microsoft Research<\/p>\n\n\n\n

CHISEL: An optical slice of the wide-area network<\/a>
<\/strong>Abhishek Vijaya Kumar, Cornell University; Bill Owens, NYSERnet; 
Nikolaj Bj\u00f8rner<\/a>, Binbin Guan, Yawei Yin, and Victor Bahl<\/a>, Microsoft; Rachee Singh, Cornell University<\/p>\n\n\n\n

Cloud-LoRa: Enabling Cloud Radio Access LoRa Networks Using Reinforcement Learning Based Bandwidth-Adaptive Compression<\/a>
<\/strong>Muhammad Osama Shahid, Daniel Koch, Jayaram Raghuram, and Bhuvana Krishnaswamy, University of Wisconsin-Madison; 
Krishna Chintalapudi<\/a>, Microsoft Research; Suman Banerjee, University of Wisconsin-Madison<\/p>\n\n\n\n

Cyclops: A Nanomaterial-based, Battery-Free Intraocular Pressure (IOP) Monitoring System inside Contact Lens<\/a>
<\/strong>Liyao Li, University at Buffalo SUNY and Northwest University; Bozhao Shang and Yun Wu, Northwest University and Shaanxi International Joint Research Centre for the Battery-Free Internet of Things; 
Jie Xiong<\/a>, University of Massachusetts Amherst and Microsoft Research Asia; Xiaojiang Chen, Northwest University and Shaanxi International Joint Research Centre for the Battery-Free Internet of Things; Yaxiong Xie, University at Buffalo SUNY<\/p>\n\n\n\n

ExChain: Exception Dependency Analysis for Root Cause Diagnosis<\/a>
<\/strong>Ao Li, Carnegie Mellon University; 
Shan Lu<\/a>, Microsoft Research and University of Chicago; Suman Nath<\/a>, Microsoft Research; Rohan Padhye and Vyas Sekar, Carnegie Mellon University<\/p>\n\n\n\n

Finding Adversarial Inputs for Heuristics using Multi-level Optimization<\/u><\/a>
<\/strong>Pooria Namyar, Microsoft and University of Southern California; 
Behnaz Arzani<\/a> and Ryan Beckett<\/a>, Microsoft; Santiago Segarra, Microsoft and Rice University; Himanshu Raj<\/a> and Umesh Krishnaswamy, Microsoft; Ramesh Govindan, University of Southern California; Srikanth Kandula<\/a>, Microsoft<\/p>\n\n\n\n

Gemino: Practical and Robust Neural Compression for Video Conferencing<\/a>
<\/strong>Vibhaalakshmi Sivaraman, Pantea Karimi, Vedantha Venkatapathy, and Mehrdad Khani, Massachusetts Institute of Technology; 
Sadjad Fouladi<\/a>, Microsoft Research; Mohammad Alizadeh, Fr\u00e9do Durand, and Vivienne Sze, Massachusetts Institute of Technology<\/p>\n\n\n\n

GRACE: Loss-Resilient Real-Time Video through Neural Codecs<\/a>
<\/strong>Yihua Cheng, Ziyi Zhang, Hanchen Li, Anton Arapin, and Yue Zhang, The University of Chicago; Qizheng Zhang, Stanford University; Yuhan Liu, Kuntai Du, and Xu Zhang, The University of Chicago; 
Francis Y. Yan<\/a>, Microsoft; Amrita Mazumdar, NVIDIA; Nick Feamster and Junchen Jiang, The University of Chicago<\/p>\n\n\n\n

LitePred: Transferable and Scalable Latency Prediction for Hardware-Aware Neural Architecture Search<\/a>
<\/strong>Chengquan Feng, University of Science and Technology of China; 
Li Lyna Zhang<\/a>, Microsoft Research; Yuanchi Liu, University of Science and Technology of China; Jiahang Xu<\/a> and Chengruidong Zhang<\/a>, Microsoft Research; Zhiyuan Wang, University of Science and Technology of China; Ting Cao<\/a> and Mao Yang<\/a>, Microsoft Research; Haisheng Tan, University of Science and Technology of China<\/p>\n\n\n\n

Making Kernel Bypass Practical for the Cloud with Junction<\/u><\/a>
<\/strong>Joshua Fried and Gohar Irfan Chaudhry, MIT CSAIL; 
Enrique Saurez<\/a>, Esha Choukse<\/a>, and \u00cd\u00f1igo Goiri<\/a>, Azure Research \u2013 Systems; Sameh Elnikety<\/a>, Microsoft Research; Rodrigo Fonseca<\/a>, Azure Research \u2013 Systems; Adam Belay, MIT CSAIL<\/p>\n\n\n\n

MESSI: Behavioral Testing of BGP Implementations<\/a>
<\/strong>Rathin Singha and Rajdeep Mondal, University of California Los Angeles; 
Ryan Beckett<\/a>, Microsoft; Siva Kesava Reddy Kakarla<\/a>, Microsoft Research; Todd Millstein and George Varghese, University of California Los Angeles<\/p>\n\n\n\n

NetVigil: Robust and Low-Cost Anomaly Detection for East-West Data Center Security<\/a>
<\/strong>
Kevin Hsieh<\/a>, Microsoft; Mike Wong, Princeton University and Microsoft; Santiago Segarra, Microsoft and Rice University; Sathiya Kumaran Mani<\/a>, Trevor Eberl<\/a>, and Anatoliy Panasyuk, Microsoft; Ravi Netravali, Princeton University; Ranveer Chandra<\/a> and Srikanth Kandula<\/a>, Microsoft<\/p>\n\n\n\n

OPPerTune: Post-Deployment Configuration Tuning of Services Made Easy<\/a>
<\/strong>Gagan Somashekar, Stony Brook University; 
Karan Tandon<\/a> and Anush Kini<\/a>, Microsoft Research; Chieh-Chun Chang and Petr Husak, Microsoft; Ranjita Bhagwan, Google; Mayukh Das<\/a>, Microsoft365 Research; Anshul Gandhi, Stony Brook University; Nagarajan Natarajan<\/a>, Microsoft Research<\/p>\n\n\n\n

Sequence Abstractions for Flexible, Line-Rate Network Monitoring<\/a>
<\/strong>Andrew Johnson, Princeton University; 
Ryan Beckett<\/a>, Microsoft Research; Xiaoqi Chen, Princeton University; Ratul Mahajan, University of Washington; David Walker, Princeton University<\/p>\n\n\n\n

Solving Max-Min Fair Resource Allocations Quickly on Large Graphs<\/u><\/a>
<\/strong>Pooria Namyar, Microsoft and University of Southern California; 
Behnaz Arzani<\/a> and Srikanth Kandula<\/a>, Microsoft; Santiago Segarra, Microsoft and Rice University; Daniel Crankshaw and Umesh Krishnaswamy, Microsoft; Ramesh Govindan, University of Southern California; Himanshu Raj<\/a>, Microsoft<\/p>\n\n\n\n

Spectrumize: Spectrum-efficient Satellite Networks for the Internet of Things<\/a>
<\/strong>Vaibhav Singh,
Tusher Chakraborty<\/a>, and Suraj Jog<\/a>, Microsoft Research; Om Chabra and Deepak Vasisht, UIUC; Ranveer Chandra<\/a>, Microsoft Research<\/p>\n\n\n\n

Vulcan: Automatic Query Planning for Live ML Analytics<\/a>
<\/strong>Yiwen Zhang and Xumiao Zhang, University of Michigan; 
Ganesh Ananthanarayanan<\/a>, Microsoft; Anand Iyer, Georgia Institute of Technology; Yuanchao Shu, Zhejiang University; Victor Bahl<\/a>, Microsoft Research; Z. Morley Mao, University of Michigan and Google; Mosharaf Chowdhury, University of Michigan<\/p>\n\n\n\n

Symposium organizers from Microsoft <\/h2>\n\n\n\n

Program Committee Co-Chair<\/h3>\n\n\n\n

Irene Zhang<\/a><\/p>\n\n\n\n

Program Committee<\/h3>\n\n\n\n

Paolo Costa
<\/a>
Anuj Kalia
<\/a>
Amar Phanishayee
<\/a>
Dan Ports
<\/a>
Francis Yan<\/a> <\/p>\n\n\n\n

Mentoring Co-Chair<\/h3>\n\n\n\n

Jay Lorch<\/a><\/p>\n\n\n\n

Test of Time Awards Committee<\/h3>\n\n\n\n

Jay Lorch
<\/a>
Amar Phanishayee<\/a><\/p>\n\n\n\n

Steering Committee<\/h3>\n\n\n\n

Jay Lorch<\/a>
Amar Phanishayee<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"

Microsoft at NDSI 2024: Discoveries and implementations in networked systems<\/p>\n

Topics range from 5G, space, datacenters, and wide-area networking to applications in artificial intelligence, security, video conferencing, and gaming. Learn more about the discoveries and advances we’re making with networked systems.<\/p>\n","protected":false},"author":37583,"featured_media":1025892,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"msr-url-field":"","msr-podcast-episode":"","msrModifiedDate":"","msrModifiedDateEnabled":false,"ep_exclude_from_search":false,"_classifai_error":"","footnotes":""},"categories":[1],"tags":[],"research-area":[13547],"msr-region":[],"msr-event-type":[],"msr-locale":[268875],"msr-post-option":[243984],"msr-impact-theme":[],"msr-promo-type":[],"msr-podcast-series":[],"class_list":["post-1024353","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-research-blog","msr-research-area-systems-and-networking","msr-locale-en_us","msr-post-option-blog-homepage-featured"],"msr_event_details":{"start":"","end":"","location":""},"podcast_url":"","podcast_episode":"","msr_research_lab":[199565],"msr_impact_theme":[],"related-publications":[],"related-downloads":[],"related-videos":[],"related-academic-programs":[],"related-groups":[144899,714067],"related-projects":[881235,982158,785224,171281],"related-events":[1024173],"related-researchers":[{"type":"user_nicename","value":"Ranveer Chandra","user_id":33344,"display_name":"Ranveer Chandra","author_link":"Ranveer Chandra<\/a>","is_active":false,"last_first":"Chandra, Ranveer","people_section":0,"alias":"ranveer"}],"msr_type":"Post","featured_image_thumbnail":"\"nsdi'24","byline":"Ranveer Chandra<\/a>","formattedDate":"April 16, 2024","formattedExcerpt":"Microsoft at NDSI 2024: Discoveries and implementations in networked systems Topics range from 5G, space, datacenters, and wide-area networking to applications in artificial intelligence, security, video conferencing, and gaming. Learn more about the discoveries and advances we're making with networked systems.","locale":{"slug":"en_us","name":"English","native":"","english":"English"},"_links":{"self":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/posts\/1024353"}],"collection":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/users\/37583"}],"replies":[{"embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/comments?post=1024353"}],"version-history":[{"count":49,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/posts\/1024353\/revisions"}],"predecessor-version":[{"id":1025949,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/posts\/1024353\/revisions\/1025949"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/media\/1025892"}],"wp:attachment":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/media?parent=1024353"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/categories?post=1024353"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/tags?post=1024353"},{"taxonomy":"msr-research-area","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/research-area?post=1024353"},{"taxonomy":"msr-region","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-region?post=1024353"},{"taxonomy":"msr-event-type","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-event-type?post=1024353"},{"taxonomy":"msr-locale","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-locale?post=1024353"},{"taxonomy":"msr-post-option","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-post-option?post=1024353"},{"taxonomy":"msr-impact-theme","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-impact-theme?post=1024353"},{"taxonomy":"msr-promo-type","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-promo-type?post=1024353"},{"taxonomy":"msr-podcast-series","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-podcast-series?post=1024353"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}