{"id":761911,"date":"2021-11-09T07:59:00","date_gmt":"2021-11-09T15:59:00","guid":{"rendered":"https:\/\/www.microsoft.com\/en-us\/research\/?post_type=msr-group&#038;p=761911"},"modified":"2024-06-20T05:20:13","modified_gmt":"2024-06-20T12:20:13","slug":"privacy-preserving-machine-learning-innovation","status":"publish","type":"msr-group","link":"https:\/\/www.microsoft.com\/en-us\/research\/group\/privacy-preserving-machine-learning-innovation\/","title":{"rendered":"Privacy Preserving Machine Learning Innovation"},"content":{"rendered":"<section class=\"mb-3 moray-highlight\">\n\t<div class=\"card-img-overlay mx-lg-0\">\n\t\t<div class=\"card-background  has-background-grey card-background--full-bleed\">\n\t\t\t<img loading=\"lazy\" decoding=\"async\" width=\"2560\" height=\"1441\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2021\/11\/1400x788_PPML_graphic_group_page_no_text_or_circle-scaled.jpg\" class=\"attachment-full size-full\" alt=\"abstract illustrated outlines of a brain, lock, and globe\" style=\"object-position: 1% 51%\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2021\/11\/1400x788_PPML_graphic_group_page_no_text_or_circle-scaled.jpg 2560w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2021\/11\/1400x788_PPML_graphic_group_page_no_text_or_circle-300x169.jpg 300w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2021\/11\/1400x788_PPML_graphic_group_page_no_text_or_circle-1024x576.jpg 1024w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2021\/11\/1400x788_PPML_graphic_group_page_no_text_or_circle-768x432.jpg 768w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2021\/11\/1400x788_PPML_graphic_group_page_no_text_or_circle-1536x864.jpg 1536w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2021\/11\/1400x788_PPML_graphic_group_page_no_text_or_circle-2048x1152.jpg 2048w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2021\/11\/1400x788_PPML_graphic_group_page_no_text_or_circle-1066x600.jpg 1066w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2021\/11\/1400x788_PPML_graphic_group_page_no_text_or_circle-655x368.jpg 655w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2021\/11\/1400x788_PPML_graphic_group_page_no_text_or_circle-343x193.jpg 343w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2021\/11\/1400x788_PPML_graphic_group_page_no_text_or_circle-240x135.jpg 240w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2021\/11\/1400x788_PPML_graphic_group_page_no_text_or_circle-scaled-640x360.jpg 640w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2021\/11\/1400x788_PPML_graphic_group_page_no_text_or_circle-960x540.jpg 960w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2021\/11\/1400x788_PPML_graphic_group_page_no_text_or_circle-1280x720.jpg 1280w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2021\/11\/1400x788_PPML_graphic_group_page_no_text_or_circle-1920x1080.jpg 1920w\" sizes=\"auto, (max-width: 2560px) 100vw, 2560px\" \/>\t\t<\/div>\n\t\t<!-- Foreground -->\n\t\t<div class=\"card-foreground d-flex mt-md-n5 my-lg-5 px-g px-lg-0\">\n\t\t\t<!-- Container -->\n\t\t\t<div class=\"container d-flex mt-md-n5 my-lg-5 align-self-center\">\n\t\t\t\t<!-- Card wrapper -->\n\t\t\t\t<div class=\"w-100 w-lg-col-5\">\n\t\t\t\t\t<!-- Card -->\n\t\t\t\t\t<div class=\"card material-md-card py-5 px-md-5\">\n\t\t\t\t\t\t<div class=\"card-body \">\n\t\t\t\t\t\t\t\n\t\t\t\t\t\t\t\n\n<h1 class=\"wp-block-heading h2\" id=\"privacy-preserving-machine-learning-innovation\">Privacy Preserving Machine Learning Innovation<\/h1>\n\n\t\t\t\t\t\t<\/div>\n\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t<\/div>\n\t\t<\/div>\n\t<\/div>\n<\/section>\n\n\n\n\n\n<h2 class=\"wp-block-heading\" id=\"a-holistic-approach-to-ppml\">A holistic approach to PPML<\/h2>\n\n\n\n<figure class=\"wp-block-image alignright size-full\"><a data-bi-bhvr=\"14\"  data-bi-cn=\"diagram\" href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2021\/11\/1400x788_PPML_graphic_no_logo_simplified_no_icons-scaled.jpg\"><img loading=\"lazy\" decoding=\"async\" width=\"2560\" height=\"1441\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2021\/11\/1400x788_PPML_graphic_no_logo_simplified_no_icons-scaled.jpg\" alt=\"diagram\" class=\"wp-image-793244\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2021\/11\/1400x788_PPML_graphic_no_logo_simplified_no_icons-scaled.jpg 2560w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2021\/11\/1400x788_PPML_graphic_no_logo_simplified_no_icons-300x169.jpg 300w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2021\/11\/1400x788_PPML_graphic_no_logo_simplified_no_icons-1024x577.jpg 1024w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2021\/11\/1400x788_PPML_graphic_no_logo_simplified_no_icons-768x432.jpg 768w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2021\/11\/1400x788_PPML_graphic_no_logo_simplified_no_icons-1536x865.jpg 1536w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2021\/11\/1400x788_PPML_graphic_no_logo_simplified_no_icons-2048x1153.jpg 2048w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2021\/11\/1400x788_PPML_graphic_no_logo_simplified_no_icons-1066x600.jpg 1066w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2021\/11\/1400x788_PPML_graphic_no_logo_simplified_no_icons-655x368.jpg 655w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2021\/11\/1400x788_PPML_graphic_no_logo_simplified_no_icons-343x193.jpg 343w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2021\/11\/1400x788_PPML_graphic_no_logo_simplified_no_icons-240x135.jpg 240w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2021\/11\/1400x788_PPML_graphic_no_logo_simplified_no_icons-scaled-640x360.jpg 640w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2021\/11\/1400x788_PPML_graphic_no_logo_simplified_no_icons-scaled-960x540.jpg 960w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2021\/11\/1400x788_PPML_graphic_no_logo_simplified_no_icons-1280x720.jpg 1280w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2021\/11\/1400x788_PPML_graphic_no_logo_simplified_no_icons-1920x1080.jpg 1920w\" sizes=\"auto, (max-width: 2560px) 100vw, 2560px\" \/><\/a><\/figure>\n\n\n\n<p>Recent research has shown that deploying ML models can, in some cases, implicate privacy in unexpected ways. For example, pretrained public language models that are fine-tuned on private data can be&nbsp;<a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/ieeexplore.ieee.org\/document\/7958568\" target=\"_blank\" rel=\"noreferrer noopener\">misused to recover private information<\/a>, and very large language models have been shown to&nbsp;<a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/arxiv.org\/abs\/2012.07805\" target=\"_blank\" rel=\"noreferrer noopener\">memorize training examples<\/a>, potentially encoding personally identifying information (PII). Finally, inferring that a specific user was part of the training data can also&nbsp;<a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/ieeexplore.ieee.org\/document\/7958568\">impact privacy<\/a>. At Microsoft Research, we believe it\u2019s critical to apply multiple techniques to achieve privacy and confidentiality; no single method can address all aspects alone. This is why we developed the Privacy Preserving Machine Learning (PPML) initiative to preserve the privacy and confidentiality of customer information while enabling next-generation productivity scenarios. With PPML, we take a three-pronged approach: first, we work to understand the risks and requirements around privacy and confidentiality; next, we work to measure the risks; and finally, we work to mitigate the potential for breaches of privacy. We explain the details of this multi-faceted approach below as well as in this <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/nam06.safelinks.protection.outlook.com\/?url=https%3A%2F%2Fwww.microsoft.com%2Fen-us%2Fresearch%2Fblog%2Fprivacy-preserving-machine-learning-maintaining-confidentiality-and-preserving-trust%2F%3FOCID%3Dmsr_blog_PPML_group&data=04%7C01%7Cv-alyhu%40microsoft.com%7C6c737cb129e74ba66f3a08d9a3789a93%7C72f988bf86f141af91ab2d7cd011db47%7C1%7C0%7C637720560684249910%7CUnknown%7CTWFpbGZsb3d8eyJWIjoiMC4wLjAwMDAiLCJQIjoiV2luMzIiLCJBTiI6Ik1haWwiLCJXVCI6Mn0%3D%7C1000&sdata=ORDaEcfgb3Jm3VDEfTzDJQtLZazH10Exsws%2BVZQy10I%3D&reserved=0\">blo<\/a><a href=\"https:\/\/www.microsoft.com\/en-us\/research\/blog\/privacy-preserving-machine-learning-maintaining-confidentiality-and-preserving-trust\/?OCID=msr_blog_PPML_group\">g post<\/a>.<\/p>\n\n\n\n<p><strong>Understand:<\/strong> We work to understand the risk of customer data leakage and potential privacy attacks in a way that helps determine confidentiality properties of ML pipelines. In addition, we believe it\u2019s critical to proactively align with policy makers. We take into account local and international laws and guidance regulating data privacy, such as the&nbsp;<a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/gdpr-info.eu\/\" target=\"_blank\" rel=\"noreferrer noopener\">General Data Protection Regulation<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>&nbsp;(GDPR) and the EU\u2019s policy on&nbsp;<a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/digital-strategy.ec.europa.eu\/en\/policies\/european-approach-artificial-intelligence\" target=\"_blank\" rel=\"noreferrer noopener\">trustworthy AI<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>. We then map these legal principles, our contractual obligations, and responsible AI principles to our technical requirements and develop tools to communicate with policy makers how we meet these requirements.<\/p>\n\n\n\n<p><strong>Measure:<\/strong> Once we understand the risks to privacy and the requirements we must adhere to, we define metrics that can quantify the identified risks and track success towards mitigating them.<\/p>\n\n\n\n<p><strong>Mitigate:<\/strong> We then develop and apply mitigation strategies, such as differential privacy (DP), described in more detail in <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/blog\/privacy-preserving-machine-learning-maintaining-confidentiality-and-preserving-trust\/?OCID=msr_blog_PPML_group\">this blog post<\/a>. After we apply mitigation strategies, we measure their success and use our findings to refine our PPML approach.<\/p>\n\n\n\n<p>Several&nbsp;different technologies&nbsp;and processes contribute&nbsp;to PPML, and we implement them&nbsp;for&nbsp;a number of different use cases,&nbsp;including&nbsp;threat modeling and preventing the leakage of training data.&nbsp;PPML strives to provide a holistic&nbsp;approach to&nbsp;unlock the full potential of customer data for intelligent features while honoring our commitment to privacy and confidentiality.<\/p>\n\n\n\n<figure class=\"wp-block-image aligncenter size-full is-resized\"><a data-bi-bhvr=\"14\"  data-bi-cn=\"Diagram describing Privacy Preserving Machine Learning research at Microsoft\" href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2021\/11\/PPML_Edited_Diagram.png\"><img loading=\"lazy\" decoding=\"async\" width=\"2500\" height=\"841\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2021\/11\/PPML_Edited_Diagram.png\" alt=\"Diagram describing Privacy Preserving Machine Learning research at Microsoft\" class=\"wp-image-793217\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2021\/11\/PPML_Edited_Diagram.png 2500w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2021\/11\/PPML_Edited_Diagram-300x101.png 300w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2021\/11\/PPML_Edited_Diagram-1024x344.png 1024w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2021\/11\/PPML_Edited_Diagram-768x258.png 768w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2021\/11\/PPML_Edited_Diagram-1536x517.png 1536w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2021\/11\/PPML_Edited_Diagram-2048x689.png 2048w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2021\/11\/PPML_Edited_Diagram-240x81.png 240w\" sizes=\"auto, (max-width: 2500px) 100vw, 2500px\" \/><\/a><\/figure>\n\n\n\n<div class=\"wp-block-columns are-vertically-aligned-top is-layout-flex wp-container-core-columns-is-layout-1 wp-block-columns-is-layout-flex\">\n<div class=\"wp-block-column is-vertically-aligned-top is-layout-flow wp-block-column-is-layout-flow\">\n<h4 class=\"wp-block-heading\" id=\"confidential-ai\"><a href=\"https:\/\/www.microsoft.com\/en-us\/research\/project\/confidential-ai\/\">Confidential AI<\/a><\/h4>\n\n\n\n<p>Our goal is to make Azure the most trustworthy cloud platform for AI. The platform we envisage offers confidentiality and integrity against privileged attackers including attacks on the code, data and hardware supply chains, performance close to that offered by GPUs, and programmability of state-of-the-art ML frameworks.<\/p>\n<\/div>\n\n\n\n<div class=\"wp-block-column is-vertically-aligned-top is-layout-flow wp-block-column-is-layout-flow\">\n<h4 class=\"wp-block-heading\" id=\"privacy-in-ai-pai\"><a href=\"https:\/\/www.microsoft.com\/en-us\/research\/group\/privacy-in-ai\/\">Privacy in AI (PAI)<\/a><\/h4>\n\n\n\n<p>The M365 Research Privacy in AI group explores questions related to user privacy and confidentiality in machine learning.&nbsp; Our workstreams consider problems in modeling privacy threats, measuring privacy loss in AI systems, and mitigating identified risks, including applications of differential privacy, federated learning, secure multi-party computation, etc.<\/p>\n<\/div>\n<\/div>\n\n\n\n<div class=\"wp-block-columns are-vertically-aligned-top is-layout-flex wp-container-core-columns-is-layout-2 wp-block-columns-is-layout-flex\">\n<div class=\"wp-block-column is-vertically-aligned-top is-layout-flow wp-block-column-is-layout-flow\">\n<figure class=\"wp-block-image aligncenter size-full\"><a href=\"https:\/\/www.microsoft.com\/en-us\/research\/project\/project-laplace\/\"><img loading=\"lazy\" decoding=\"async\" width=\"1400\" height=\"788\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2021\/10\/laplace_1400x788.jpg\" alt=\"chart, histogram\" class=\"wp-image-787150\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2021\/10\/laplace_1400x788.jpg 1400w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2021\/10\/laplace_1400x788-300x169.jpg 300w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2021\/10\/laplace_1400x788-1024x576.jpg 1024w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2021\/10\/laplace_1400x788-768x432.jpg 768w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2021\/10\/laplace_1400x788-1066x600.jpg 1066w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2021\/10\/laplace_1400x788-655x368.jpg 655w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2021\/10\/laplace_1400x788-343x193.jpg 343w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2021\/10\/laplace_1400x788-240x135.jpg 240w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2021\/10\/laplace_1400x788-640x360.jpg 640w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2021\/10\/laplace_1400x788-960x540.jpg 960w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2021\/10\/laplace_1400x788-1280x720.jpg 1280w\" sizes=\"auto, (max-width: 1400px) 100vw, 1400px\" \/><\/a><\/figure>\n\n\n\n<h4 class=\"wp-block-heading\" id=\"differential-privacy-project-laplace\"><a href=\"https:\/\/www.microsoft.com\/en-us\/research\/project\/project-laplace\/\">Differential Privacy: Project Laplace<\/a><\/h4>\n\n\n\n<p>Differential Privacy (DP) is the gold standard of privacy protection, with a vast body of academic literature and a growing number of large-scale deployments across the industry and the government. In machine learning scenarios DP works through adding small amounts of statistical random noise during training, the purpose of which is to conceal contributions of individual parties. When DP is employed, a mathematical proof ensures that the final ML model learns only general trends in the data without acquiring information specific to individual parties. To expand the scope of scenarios where DP can be successfully applied we push the boundaries of the state of the art in DP training algorithms to address the issues of scalability, efficiency, and privacy\/utility trade-offs.<\/p>\n<\/div>\n\n\n\n<div class=\"wp-block-column is-vertically-aligned-top is-layout-flow wp-block-column-is-layout-flow\">\n<figure class=\"wp-block-image aligncenter size-full\"><a href=\"https:\/\/www.microsoft.com\/en-us\/research\/project\/project-ftl\/\"><img loading=\"lazy\" decoding=\"async\" width=\"1400\" height=\"788\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2021\/10\/FLUTE-architecture.jpg\" alt=\"FLUTE architecture diagram\" class=\"wp-image-787153\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2021\/10\/FLUTE-architecture.jpg 1400w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2021\/10\/FLUTE-architecture-300x169.jpg 300w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2021\/10\/FLUTE-architecture-1024x576.jpg 1024w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2021\/10\/FLUTE-architecture-768x432.jpg 768w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2021\/10\/FLUTE-architecture-1066x600.jpg 1066w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2021\/10\/FLUTE-architecture-655x368.jpg 655w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2021\/10\/FLUTE-architecture-343x193.jpg 343w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2021\/10\/FLUTE-architecture-240x135.jpg 240w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2021\/10\/FLUTE-architecture-640x360.jpg 640w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2021\/10\/FLUTE-architecture-960x540.jpg 960w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2021\/10\/FLUTE-architecture-1280x720.jpg 1280w\" sizes=\"auto, (max-width: 1400px) 100vw, 1400px\" \/><\/a><\/figure>\n\n\n\n<h4 class=\"wp-block-heading\" id=\"project-flute\"><a href=\"https:\/\/www.microsoft.com\/en-us\/research\/project\/project-ftl\/\">Project FLUTE<\/a><\/h4>\n\n\n\n<p>The goal of FLUTE is to create technologies that allow model training on private data without central curation. We apply techniques from federated learning, differential privacy, and high-performance computing, to enable cross-silo model training with strong experimental results. We have released FLUTE as an <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/github.com\/microsoft\/msrflute\" target=\"_blank\" rel=\"noreferrer noopener\">open-source toolkit on github<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>.<\/p>\n<\/div>\n<\/div>\n\n\n\n<div style=\"padding-bottom:64px; padding-top:64px\" class=\"wp-block-msr-immersive-section alignfull row has-background has-light-gray-background-color wp-block-msr-immersive-section\">\n\t\n\t<div class=\"container\">\n\t\t<div class=\"wp-block-msr-immersive-section__wrapper\">\n\t\t\t<p class=\"has-text-align-center has-gray-color has-text-color\">TOOLS<\/p>\n\n\n\n<div class=\"wp-block-columns are-vertically-aligned-top is-layout-flex wp-container-core-columns-is-layout-3 wp-block-columns-is-layout-flex\">\n<div class=\"wp-block-column is-vertically-aligned-top is-layout-flow wp-block-column-is-layout-flow\">\n<h3 class=\"wp-block-heading has-text-align-center\" id=\"privacy-random-variable-prv-accountant\">Privacy Random Variable (PRV) Accountant<\/h3>\n\n\n\n<p class=\"has-text-align-center\"> A fast algorithm to optimally compose privacy guarantees of differentially private (DP) mechanisms to arbitrary accuracy.<\/p>\n\n\n\n<div class=\"wp-block-buttons is-content-justification-center is-horizontal is-content-justification-center is-layout-flex wp-container-core-buttons-is-layout-1 wp-block-buttons-is-layout-flex\">\n<div class=\"wp-block-button is-style-fill\"><a data-bi-type=\"button\" class=\"wp-block-button__link wp-element-button\" href=\"https:\/\/www.microsoft.com\/en-us\/research\/publication\/numerical-composition-of-differential-privacy\/\">Read the paper<\/a><\/div>\n\n\n\n<div class=\"wp-block-button is-style-fill-github\"><a data-bi-type=\"button\" class=\"wp-block-button__link wp-element-button\" href=\"https:\/\/github.com\/microsoft\/prv_accountant\" target=\"_blank\" rel=\"noreferrer noopener\">Download<\/a><\/div>\n<\/div>\n<\/div>\n\n\n\n<div class=\"wp-block-column is-vertically-aligned-top is-layout-flow wp-block-column-is-layout-flow\">\n<h3 class=\"wp-block-heading has-text-align-center\" id=\"dp-transformers\">DP-Transformers<\/h3>\n\n\n\n<p>Motivated by our recent&nbsp;<a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/openreview.net\/forum?id=Q42f0dfjECO\" target=\"_blank\" rel=\"noreferrer noopener\">work<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>, we are releasing a repository for training transformer models with differential privacy. Our <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/www.github.com\/microsoft\/dp-transformers\" target=\"_blank\" rel=\"noreferrer noopener\">GitHub<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> repository is based on integrating the&nbsp;<a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/opacus.ai\/\" target=\"_blank\" rel=\"noreferrer noopener\">Opacus library<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>&nbsp;to the&nbsp;<a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/www.github.com\/microsoft\/dp-transformers\" target=\"_blank\" rel=\"noreferrer noopener\">Hugging Face<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>&nbsp;platform. We aim to serve the privacy-preserving ML community in utilizing the state-of-the-art models while respecting the privacy of the individuals constituting what these models learn from.<\/p>\n\n\n\n<div class=\"wp-block-buttons is-content-justification-center is-content-justification-center is-layout-flex wp-container-core-buttons-is-layout-2 wp-block-buttons-is-layout-flex\">\n<div class=\"wp-block-button is-style-fill-github\"><a data-bi-type=\"button\" class=\"wp-block-button__link has-text-align-center wp-element-button\" href=\"https:\/\/www.github.com\/microsoft\/dp-transformers\" target=\"_blank\" rel=\"noreferrer noopener\">Download<\/a><\/div>\n<\/div>\n<\/div>\n<\/div>\t\t<\/div>\n\t<\/div>\n\n\t<\/div>\n\n\n\n<h2 class=\"wp-block-heading has-text-align-center\" id=\"related-research\">Related research<\/h2>\n\n\n\n<div class=\"wp-block-media-text has-vertical-margin-small  has-vertical-padding-none  has-media-on-the-right is-stacked-on-mobile\" data-bi-an=\"media-text\"><div class=\"wp-block-media-text__content\" data-bi-an=\"media-text\">\n<h3 class=\"wp-block-heading\" id=\"ezpc-easy-secure-multi-party-computation\"><a href=\"https:\/\/www.microsoft.com\/en-us\/research\/project\/ezpc-easy-secure-multi-party-computation\/\">EzPC (Easy Secure Multi-party Computation)<\/a><\/h3>\n\n\n\n<p> The EzPC project focuses on providing a scalable, performant, and usable system for secure Multi-Party Computation (MPC). MPC, through cryptographic protocols, allows multiple parties with sensitive information to compute joint functions on their data without sharing the data in the clear with any entity. In the context of machine learning, an example of such a task is that of secure inference\u2014where a model owner can offer inference as a service to a data owner without either entity seeing any data in the clear. The EzPC system automatically generates MPC protocols for this task from standard TensorFlow\/ONNX code.<\/p>\n<\/div><figure class=\"wp-block-media-text__media\"><a href=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2021\/11\/ezpc-diagram.jpg\"><img loading=\"lazy\" decoding=\"async\" width=\"800\" height=\"389\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2021\/11\/ezpc-diagram.jpg\" alt=\"diagram explaining the EzPC system\" class=\"wp-image-793784 size-full\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2021\/11\/ezpc-diagram.jpg 800w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2021\/11\/ezpc-diagram-300x146.jpg 300w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2021\/11\/ezpc-diagram-768x373.jpg 768w, https:\/\/www.microsoft.com\/en-us\/research\/wp-content\/uploads\/2021\/11\/ezpc-diagram-240x117.jpg 240w\" sizes=\"auto, (max-width: 800px) 100vw, 800px\" \/><\/a><\/figure><\/div>\n\n\n","protected":false},"excerpt":{"rendered":"<p>Recent research has shown that deploying ML models can, in some cases, implicate privacy in unexpected ways. For example, pretrained public language models that are fine-tuned on private data can be&nbsp;misused to recover private information, and very large language models have been shown to&nbsp;memorize training examples, potentially encoding personally identifying information (PII). Finally, inferring that [&hellip;]<\/p>\n","protected":false},"featured_media":793565,"template":"","meta":{"msr-url-field":"","msr-podcast-episode":"","msrModifiedDate":"","msrModifiedDateEnabled":false,"ep_exclude_from_search":false,"_classifai_error":"","msr_group_start":"","footnotes":""},"research-area":[13558],"msr-group-type":[243694],"msr-locale":[268875],"msr-impact-theme":[],"class_list":["post-761911","msr-group","type-msr-group","status-publish","has-post-thumbnail","hentry","msr-research-area-security-privacy-cryptography","msr-group-type-group","msr-locale-en_us"],"msr_group_start":"","msr_detailed_description":"","msr_further_details":"","msr_hero_images":[],"msr_research_lab":[199560,199561,199562,199565],"related-researchers":[{"type":"guest","display_name":"Jim Kleewein","user_id":786892,"people_section":"Executive sponsors","alias":""},{"type":"user_nicename","display_name":"Jaime Teevan","user_id":33975,"people_section":"Executive sponsors","alias":"teevan"},{"type":"user_nicename","display_name":"Saravan Rajmohan","user_id":41039,"people_section":"Leadership team","alias":"saravar"},{"type":"user_nicename","display_name":"Ryen W. White","user_id":33481,"people_section":"Leadership team","alias":"ryenw"},{"type":"user_nicename","display_name":"Manuel Costa","user_id":32794,"people_section":"Leadership team","alias":"manuelc"},{"type":"guest","display_name":"Morris Kabuage","user_id":786916,"people_section":"Leadership team","alias":""},{"type":"user_nicename","display_name":"Kieran McDonald","user_id":39021,"people_section":"Leadership team","alias":"kieranmc"},{"type":"user_nicename","display_name":"danah boyd","user_id":31651,"people_section":"Leadership team","alias":"dmb"},{"type":"user_nicename","display_name":"Victor Ruehle","user_id":41027,"people_section":"Team","alias":"virueh"},{"type":"user_nicename","display_name":"Kim Laine","user_id":32546,"people_section":"Team","alias":"kilai"},{"type":"user_nicename","display_name":"Melissa Chase","user_id":32878,"people_section":"Team","alias":"melissac"},{"type":"user_nicename","display_name":"Boris K&ouml;pf","user_id":37857,"people_section":"Team","alias":"bokoepf"},{"type":"guest","display_name":"Morris Kabuage","user_id":786916,"people_section":"Team","alias":""},{"type":"user_nicename","display_name":"Nishanth Chandran","user_id":33084,"people_section":"Team","alias":"nichandr"},{"type":"user_nicename","display_name":"Robert Sim","user_id":36650,"people_section":"Team","alias":"rsim"},{"type":"user_nicename","display_name":"Sergey Yekhanin","user_id":34990,"people_section":"Team","alias":"yekhanin"},{"type":"user_nicename","display_name":"Daniel Jones","user_id":41030,"people_section":"Team","alias":"jonesdaniel"},{"type":"user_nicename","display_name":"Lukas Wutschitz","user_id":38775,"people_section":"Team","alias":"luwutsch"},{"type":"user_nicename","display_name":"Shruti Tople","user_id":39003,"people_section":"Team","alias":"shtople"},{"type":"user_nicename","display_name":"Santiago Zanella-B\u00e9guelin","user_id":33518,"people_section":"Team","alias":"santiago"},{"type":"user_nicename","display_name":"Andrew Paverd","user_id":37902,"people_section":"Team","alias":"anpaverd"},{"type":"user_nicename","display_name":"Janardhan (Jana) Kulkarni","user_id":32147,"people_section":"Team","alias":"jakul"},{"type":"user_nicename","display_name":"Sivakanth Gopi","user_id":37830,"people_section":"Team","alias":"sigopi"},{"type":"user_nicename","display_name":"Arturs Backurs","user_id":40771,"people_section":"Team","alias":"abackurs"},{"type":"user_nicename","display_name":"Esha Ghosh","user_id":37851,"people_section":"Team","alias":"esghosh"},{"type":"user_nicename","display_name":"Huseyin Atahan Inan","user_id":40426,"people_section":"Team","alias":"huinan"},{"type":"user_nicename","display_name":"Sepideh Mahabadi","user_id":40780,"people_section":"Team","alias":"smahabadi"},{"type":"user_nicename","display_name":"Divya Gupta","user_id":37766,"people_section":"Team","alias":"digup"},{"type":"user_nicename","display_name":"Rahul Sharma","user_id":36308,"people_section":"Team","alias":"rahsha"},{"type":"user_nicename","display_name":"Aseem Rastogi","user_id":36021,"people_section":"Team","alias":"aseemr"},{"type":"user_nicename","display_name":"Kapil Vaswani","user_id":32487,"people_section":"Team","alias":"kapilv"},{"type":"user_nicename","display_name":"Antoine Delignat-Lavaud","user_id":31056,"people_section":"Team","alias":"antdl"},{"type":"user_nicename","display_name":"Stavros Volos","user_id":35437,"people_section":"Team","alias":"svolos"},{"type":"user_nicename","display_name":"C\u00e9dric Fournet","user_id":31819,"people_section":"Team","alias":"fournet"},{"type":"user_nicename","display_name":"Xuchao Zhang","user_id":42045,"people_section":"Team","alias":"xuchaozhang"},{"type":"user_nicename","display_name":"Molly Xia","user_id":41943,"people_section":"Team","alias":"mollyxia"},{"type":"user_nicename","display_name":"Zinan Lin","user_id":42327,"people_section":"Team","alias":"zinanlin"},{"type":"user_nicename","display_name":"Gbola Afonja","user_id":42846,"people_section":"Team","alias":"gafonja"},{"type":"user_nicename","display_name":"Giovanni Cherubin","user_id":41410,"people_section":"Team","alias":"gcherubin"}],"related-publications":[846901,940899,939993,939924,938655,915804,882387,879069,864876,864054,944880,838957,837859,830107,823309,790535,790529,789491,786691,786553,994119,1131444,1128978,1128969,1128957,1047069,1042731,1034898,1025313,1023756,785137,975756,957018,951996,951615,950790,945306,945279,945099,944886,641136,720217,710251,691908,685314,672636,672480,660741,644934,642618,723952,639414,611136,567663,567648,507662,507653,499550,438294,254093,760198,781984,765679,764191,762412,762406,762400,762394,762385,762373,168426,756028,754813,754795,753328,750094,747820,744307,727696],"related-downloads":[758239,866760],"related-videos":[],"related-projects":[866259,648207,658488,568491,507611],"related-events":[],"related-opportunities":[],"related-posts":[793088,939369,945684],"tab-content":[{"id":0,"name":"Research","content":"<h3><a href=\"https:\/\/www.microsoft.com\/en-us\/research\/project\/confidential-ai\/\">Confidential AI<\/a><\/h3>\r\nOur goal is to make Azure the most trustworthy cloud platform for AI. The platform we envisage offers confidentiality and integrity against privileged attackers including attacks on the code, data and hardware supply chains, performance close to that offered by GPUs, and programmability of state-of-the-art ML frameworks. The confidential AI platform will enable multiple entities to collaborate and train accurate models using sensitive data, and serve these models with assurance that their data and models\u2026\r\n<h3><a href=\"https:\/\/www.microsoft.com\/en-us\/research\/project\/project-ftl\/\">Project FTL<\/a><\/h3>\r\nA novel framework for training models in a Federated Learning fashion. One of the novelties of the project is the first attempt to introduce Federated Learning in Speech Recognition tasks. Besides the novelty of the task, the paper describes an easily generalizable FL platform and some of the design decisions used for this task. Among the novel algorithms introduced are a new hierarchical optimization scheme, a gradient selection algorithm, and self-supervised training algorithms.\r\n<h3><a href=\"https:\/\/www.microsoft.com\/en-us\/research\/project\/real-world-reinforcement-learning\/\">Real World Reinforcement Learning<\/a><\/h3>\r\nReal World Reinforcement Learning (Real-World RL) projects enable the next generation of machine learning using interactive reinforcement-based approaches to solve real-world problems.\r\n<h3><a href=\"https:\/\/www.microsoft.com\/en-us\/research\/project\/ezpc-easy-secure-multi-party-computation\/\">EzPC (Easy Secure Multi-party Computation)<\/a><\/h3>\r\nConsider the following scenario: two hospitals, each having sensitive patient data, must compute statistical information about their joint data. Privacy regulations forbid them from sharing data in the clear with any entity. So, can they compute this information while keeping their private data encrypted (or \u201chidden\u201d) from each other? Cryptography and specifically, the primitive Secure Multi-Party Computation (MPC), provides an answer to this seemingly impossible task using sophisticated mathematical protocols. However, two big challenges remain:\u2026\r\n<h3>Project SPIRAL<\/h3>\r\n<h3>Project Florida<\/h3>\r\n<h3>Confidential Computing<\/h3>\r\n<h3>Privacy in ML<\/h3>\r\n<h3>Differential Privacy<\/h3>\r\nDifferential privacy (DP) is widely recognized as a gold standard of privacy protection due to its mathematical rigor. Through the lens of differential privacy, we can design machine learning algorithms that can responsibly train models on private data. However, it is challenging to apply differentially private stochastic gradient descent (DP-SGD) on large deep neural network models because the dimensional dependence of DP: larger model usually leads to worse performance in order to guarantee a same level of differential privacy. This is an essential barrier of applying DP in deep learning era where state-of-the-art performance begs for large models."}],"msr_impact_theme":[],"_links":{"self":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-group\/761911","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-group"}],"about":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/types\/msr-group"}],"version-history":[{"count":78,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-group\/761911\/revisions"}],"predecessor-version":[{"id":1048887,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-group\/761911\/revisions\/1048887"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/media\/793565"}],"wp:attachment":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/media?parent=761911"}],"wp:term":[{"taxonomy":"msr-research-area","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/research-area?post=761911"},{"taxonomy":"msr-group-type","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-group-type?post=761911"},{"taxonomy":"msr-locale","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-locale?post=761911"},{"taxonomy":"msr-impact-theme","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-impact-theme?post=761911"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}