{"id":713494,"date":"2020-12-22T09:32:02","date_gmt":"2020-12-22T17:32:02","guid":{"rendered":"https:\/\/www.microsoft.com\/en-us\/research\/?p=713494"},"modified":"2020-12-22T09:54:30","modified_gmt":"2020-12-22T17:54:30","slug":"unadversarial-examples-designing-objects-for-robust-vision","status":"publish","type":"post","link":"https:\/\/www.microsoft.com\/en-us\/research\/blog\/unadversarial-examples-designing-objects-for-robust-vision\/","title":{"rendered":"Unadversarial examples: Designing objects for robust vision"},"content":{"rendered":"\n<figure class=\"wp-block-image size-large\"><img decoding=\"async\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2020\/12\/1400x788_Robust_object_no_logo-1.gif\" alt=\"An animated GIF shows an unadversarial texture with a splatter paint\u2013like pattern in black, white, yellow, red, blue, neon green, magenta, and teal is added to a plain black and gray human-designed jet to produce an \u201cunadversarial\u201d jet that reflects the unadversarial texture in its design. The unadversarial jet is pictured against a city skyline in three separate images, each with a different weather condition: clean, foggy, and dusty. All three images are outlined in green to denote that the unadversarial jet was correctly classified by the vision system. Underneath, a human-designed jet is picture against the same skyline under the same weather conditions. The \u201cclean\u201d image is outlined in green, while the \u201cfoggy\u201d and \u201cdusty\u201d images are outlined in red, denoting it was incorrectly classified. \"\/><\/figure>\n\n\n\n<p><em>Editor\u2019s note: This post and its research are the result of the collaborative efforts of our team\u2014MIT PhD students <\/em><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"http:\/\/andrewilyas.com\/\"><em>Andrew Ilyas<\/em><\/a><em> and <\/em><a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"http:\/\/loganengstrom.com\/\"><em>Logan Engstrom<\/em><\/a><em>, Senior Researcher <\/em><a href=\"https:\/\/www.microsoft.com\/en-us\/research\/people\/savempra\/\"><em>Sai Vemprala<\/em><\/a><em>, MIT professor<\/em> <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/people.csail.mit.edu\/madry\/\"><em>Aleksander Madry<\/em><\/a><em>, and Partner Research Manager <\/em><a href=\"https:\/\/www.microsoft.com\/en-us\/research\/people\/akapoor\/\"><em>Ashish Kapoor<\/em><\/a><em>.<\/em><\/p>\n\n\n\n<p>Many of the items and objects we use in our daily lives were designed with people in mind. In October, the <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/banknotes.rba.gov.au\/australias-banknotes\/next-generation-banknotes-program\/\">Reserve Bank of Australia put out into the world its redesigned $100 banknote<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>. Some design elements remained the same\u2014such as color and size, characteristics people use to tell the difference between notes\u2014while others changed. New security features to help protect against fraud were added as were raised bumps for people who are blind or have low vision. Good design enables intended audiences to easily acquire information and act on it.<\/p>\n\n\n\n<p class=\"has-text-align-center\"><a href=\"https:\/\/www.microsoft.com\/en-us\/research\/publication\/unadversarial-examples-designing-objects-for-robust-vision\/\">Read Paper<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> &nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;&nbsp;                                                                    <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/github.com\/microsoft\/unadversarial\">Code & Materials<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> <\/p>\n\n\n\n<p>Modern computer vision systems take similar cues\u2014floor markings direct a robot\u2019s course, boxes in a warehouse signal a forklift to move them, and stop signs alert a self-driving car to, well, stop. The neural networks underlying these systems might understand the features that we as humans find helpful, but they might also understand different features even better. In scenarios in which system operators and designers have a level of control over the target objects, what if we designed the objects in a way that makes them more detectable, even under conditions that normally break such systems, such as bad weather or variations in lighting?<\/p>\n\n\n\n<p>We introduce a framework that exploits computer vision systems\u2019 well-known sensitivity to perturbations of their inputs to create <em>robust, <\/em>or <em>unadversarial, objects<\/em>\u2014that is, objects that are optimized specifically for better performance and robustness of vision models. Instead of using perturbations to get neural networks to wrongly classify objects, as is the case with adversarial examples, we use them to encourage the neural network to correctly classify the objects we care about with high confidence.<\/p>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"667\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2020\/12\/Figure1_Object-Design-1024x667.png\" alt=\"An unadversarial texture with a splatter paint\u2013like pattern in black, white, yellow, red, blue, neon green, magenta, and teal is added to a plain black and gray human-designed jet to produce an \u201cunadversarial\u201d jet that reflects the unadversarial texture in its design. The unadversarial jet is pictured against a city skyline in three separate images, each with a different weather condition: clean, foggy, and dusty. All three images are outlined in green to denote that the unadversarial jet was correctly classified by the vision system. Underneath, a human-designed jet is picture against the same skyline under the same weather conditions. The \u201cclean\u201d image is outlined in green, while the \u201cfoggy\u201d and \u201cdusty\u201d images are outlined in red, denoting it was incorrectly classified. \" class=\"wp-image-713509\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2020\/12\/Figure1_Object-Design-1024x667.png 1024w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2020\/12\/Figure1_Object-Design-300x195.png 300w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2020\/12\/Figure1_Object-Design-768x500.png 768w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2020\/12\/Figure1_Object-Design-16x10.png 16w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2020\/12\/Figure1_Object-Design.png 1430w\" sizes=\"(max-width: 1024px) 100vw, 1024px\" \/><figcaption>Figure 1: Optimizing objects for pre-trained neural networks rather than only optimizing the networks themselves can significantly boost performance and robustness on computer vision tasks. Above, a human-designed jet and a jet modified with a texture optimized for easier model detection are correctly classified under normal weather conditions; only the modified jet is correctly classified in the presence of fog or dust.<\/figcaption><\/figure>\n\n\n\n<div class=\"annotations \" data-bi-aN=\"margin-callout\">\n\t<ul class=\"annotations__list card depth-16 bg-body p-4 annotations__list--right\">\n\t\t<li class=\"annotations__list-item\">\n\t\t\t\t\t\t<span class=\"annotations__type d-block text-uppercase font-weight-semibold text-neutral-300 small\">Publication<\/span>\n\t\t\t<a href=\"https:\/\/www.microsoft.com\/en-us\/research\/publication\/unadversarial-examples-designing-objects-for-robust-vision\/\" target=\"_self\" class=\"annotations__link font-weight-semibold text-decoration-none\" data-bi-type=\"annotated-link\" aria-label=\"Unadversarial Examples: Designing Objects for Robust Vision\" data-bi-aN=\"margin-callout\" data-bi-cN=\"Unadversarial Examples: Designing Objects for Robust Vision\">\n\t\t\t\tUnadversarial Examples: Designing Objects for Robust Vision&nbsp;<span class=\"glyph-append glyph-append-chevron-right glyph-append-xsmall\"><\/span>\n\t\t\t<\/a>\n\t\t\t\t\t<\/li>\n\t<\/ul>\n<\/div>\n\n\n\n<p>We show that such optimization of objects for vision systems significantly improves the performance and robustness of these systems, even to unforeseen data shifts and corruptions. An example of this is demonstrated above in Figure 1, where we modify a jet with a pattern optimized to enable image classifiers to more robustly recognize the jet under various weather conditions: while both the original jet and its unadversarial counterpart are correctly classified in normal conditions, only the unadversarial jet is recognized when corruptions like fog or dust are added. We present the details of this research in our paper \u201c<a href=\"https:\/\/www.microsoft.com\/en-us\/research\/publication\/unadversarial-examples-designing-objects-for-robust-vision\/\">Unadversarial Examples: Designing Objects for Robust Vision<\/a>.\u201d<\/p>\n\n\n\n<h2 id=\"why-design-objects-for-neural-networks\">Why design objects for neural networks?<\/h2>\n\n\n\n<p>The fragility of computer vision systems makes reliability and safety a real concern when deploying these systems in the real world. For example, a self-driving car\u2019s stop-sign detection system might be severely affected in the presence of intense weather conditions such as snow or fog. While techniques such as data augmentation, domain randomization, and robust training might seem to improve the performance of such systems, they don\u2019t typically generalize well to corrupted or otherwise unfamiliar data that these systems face when deployed.<\/p>\n\n\n\n\n\t<div class=\"border-bottom border-top border-gray-300 mt-5 mb-5 msr-promo text-center text-md-left alignwide\" data-bi-aN=\"promo\" data-bi-id=\"1002645\">\n\t\t\n\n\t\t<p class=\"msr-promo__label text-gray-800 text-center text-uppercase\">\n\t\t<span class=\"px-4 bg-white display-inline-block font-weight-semibold small\">Spotlight: AI-POWERED EXPERIENCE<\/span>\n\t<\/p>\n\t\n\t<div class=\"row pt-3 pb-4 align-items-center\">\n\t\t\t\t\t\t<div class=\"msr-promo__media col-12 col-md-5\">\n\t\t\t\t<a class=\"bg-gray-300\" href=\"https:\/\/aka.ms\/research-copilot\/?OCID=msr_researchforum_Copilot_MCR_Blog_Promo\" aria-label=\"Microsoft research copilot experience\" data-bi-cN=\"Microsoft research copilot experience\" target=\"_blank\">\n\t\t\t\t\t<img decoding=\"async\" class=\"w-100 display-block\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2024\/01\/MSR-Chat-Promo.png\" alt=\"\" \/>\n\t\t\t\t<\/a>\n\t\t\t<\/div>\n\t\t\t\n\t\t\t<div class=\"msr-promo__content p-3 px-5 col-12 col-md\">\n\n\t\t\t\t\t\t\t\t\t<h2 class=\"h4\">Microsoft research copilot experience<\/h2>\n\t\t\t\t\n\t\t\t\t\t\t\t\t<p class=\"large\">Discover more about research at Microsoft through our AI-powered experience<\/p>\n\t\t\t\t\n\t\t\t\t\t\t\t\t<div class=\"wp-block-buttons justify-content-center justify-content-md-start\">\n\t\t\t\t\t<div class=\"wp-block-button\">\n\t\t\t\t\t\t<a href=\"https:\/\/aka.ms\/research-copilot\/?OCID=msr_researchforum_Copilot_MCR_Blog_Promo\" class=\"btn btn-brand glyph-append glyph-append-chevron-right\" aria-label=\"Microsoft research copilot experience\" data-bi-cN=\"Microsoft research copilot experience\" target=\"_blank\">\n\t\t\t\t\t\t\tStart now\t\t\t\t\t\t<\/a>\n\t\t\t\t\t<\/div>\n\t\t\t\t<\/div>\n\t\t\t\t\t\t\t<\/div><!--\/.msr-promo__content-->\n\t<\/div><!--\/.msr-promo__inner-wrap-->\n\t<\/div><!--\/.msr-promo-->\n\t\n\n\n\n<p>We were motivated to find another approach by scenarios in which system designers and operators not only have control of the neural network itself, but also have some degree of control over the objects they want their model to recognize or detect\u2014for example, a company that operates drones for delivery or transportation. These drones fly from place to place, and an important task for the system is landing safely at the target locations. Human operators may manage the landing pads at these locations, as well as the design of the system, presenting an opportunity to improve the system\u2019s ability to detect the landing pad by modifying the pad itself. <\/p>\n\n\n\n<h2 id=\"designing-robust-objects-for-vision\">Designing robust objects for vision<\/h2>\n\n\n\n<p>Our starting point in designing robust objects for vision is the observation that modern vision models suffer from a severe input sensitivity that can, in particular,&nbsp;be exploited to generate so-called <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/gradientscience.org\/intro_adversarial\/\">adversarial examples<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>: imperceptible perturbations of the input of a vision model that break it. Adversarial examples can potentially be used to intentionally cause system failures; researchers and practitioners use these examples to train systems that are more robust to such attacks. These perturbations are typically constructed by solving the following optimization problem, which maximizes the loss of a machine learning model with respect to the input:<\/p>\n\n\n\n<p class=\"has-text-align-center\">\\(\\delta_{adv} = \\arg\\max_{\\delta \\in \\Delta} L(\\theta; x + \\delta, y),\\)<\/p>\n\n\n\n<p><\/p>\n\n\n\n<p>where \\(\\theta\\) is the set of model parameters; \\(x\\) is a natural image; \\(y\\) is the corresponding correct label; \\(L\\) is the loss function used to train \\(\\theta\\) (for example, cross-entropy loss in classification contexts); and \\(\\Delta\\) is a class of permissible perturbations. In our work, we aim to convert this unusually large input sensitivity from a weakness into a strength. That is, instead of creating misleading inputs, as shown in the above equation, we demonstrate how to optimize inputs that bolster performance, resulting in these unadversarial examples, or robust objects. This is done by simply solving the following optimization problem:<\/p>\n\n\n\n<p class=\"has-text-align-center\">\\(\\delta_{unadv} = \\arg\\min_{\\delta \\in \\Delta} L(\\theta; x + \\delta, y).\\)<\/p>\n\n\n\n<p>In our research, we explore two ways of designing robust objects: via an unadversarial patch applied to the object or by unadversarially altering the texture of the object (Figure 2). Both ways require the above optimization algorithm to iteratively optimize the patch or texture with \\(\\Delta\\) being the set of perturbations spanning the patch or texture. Note that we start with a randomly initialized patch or texture.<\/p>\n\n\n\n<div class=\"wp-block-image\"><figure class=\"aligncenter size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"615\" height=\"261\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2020\/12\/Object-Design_Hadi_Figure1.jpg\" alt=\"At left a gray toy jet with an unadversarial patch patterned in bright colors affixed to its body. At right a 3D rendering of a jet designed in the unadversarial texture, which is white with a variety of colors along the jet\u2019s wings, nose, and tail.   \" class=\"wp-image-713497\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2020\/12\/Object-Design_Hadi_Figure1.jpg 615w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2020\/12\/Object-Design_Hadi_Figure1-300x127.jpg 300w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2020\/12\/Object-Design_Hadi_Figure1-16x7.jpg 16w\" sizes=\"(max-width: 615px) 100vw, 615px\" \/><figcaption>Figure 2: An example of an unadversarial patch (left) placed on a toy jet and an unadversarial texture (right) implicit in the design of a jet.<\/figcaption><\/figure><\/div>\n\n\n\n<ul class=\"wp-block-list\"><li><strong>Unadversarial patch:<\/strong> To train an unadversarial patch, at each iteration, we sample natural image-label pairs (\\(x\\), \\(y\\)) from the training set of the task at hand and place the patch onto the image with random orientation and position.<\/li><li><strong>Unadversarial texture: <\/strong>To train an unadversarial texture, on the other hand, requires a 3D mesh of the object we\u2019d like to design, as well as a set of background images. At each iteration, we use a renderer such as Mitsuba to map the object\u2019s corresponding texture and overlay the rendering onto a random background image.<\/li><\/ul>\n\n\n\n<p>In both cases, the resulting image is passed through a computer vision model, and we run <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/arxiv.org\/pdf\/1706.06083.pdf\">projected gradient descent (PGD)<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> on the end-to-end system to solve the above equation and optimize the texture or patch to be unadversarial. The resulting texture or patch has a unique pattern, as shown in Figure 1, that is then associated with that class of object. You can think of these patterns as fingerprints generated from the model that help the model detect that specific class of object better. &nbsp;&nbsp;<\/p>\n\n\n\n<p>It turns out that this simple technique is general enough to create robust inputs for various vision tasks. In our work, we evaluate our method on the standard benchmarks CIFAR-10 and ImageNet and the robustness-based benchmarks CIFAR-10-C and ImageNet-C and show improved efficacy. We also compare them to baselines such as QR codes.<\/p>\n\n\n\n<h2 id=\"does-this-work-in-practice\">Does this work in practice?<\/h2>\n\n\n\n<p>To further study the practicality of our framework, we go beyond benchmark tasks and perform tests in a high-fidelity 3D simulator, deploy unadversarial examples in a simulated drone setting, and ensure that the performance improvements we observe in the synthetic setting&nbsp;actually transfer to the physical world.<\/p>\n\n\n\n<ul class=\"wp-block-list\"><li><strong>Recognizing objects in a high-fidelity simulator:<\/strong> In this experiment, we demonstrate that our method works well in the more practical scenario of recognizing 3D objects in a high-fidelity simulator. We import 3D objects into <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/microsoft.github.io\/AirSim\/\">Microsoft AirSim<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> and generate unadversarial textures for each. Then, we evaluate the performance of a pre-trained ImageNet model on recognizing each of these objects under various weather conditions. Overall, we observe that the unadversarial objects, including a jet and trailer truck, are more easily recognized than their human-designed counterparts in foggy and dusty conditions, as shown in Figure 3.<\/li><\/ul>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"605\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2020\/12\/Object-Design_-Figure2-1024x605.jpg\" alt=\"3D renderings of three different jets\u2014a jet with an unadversarial texture, a darkly colored jet, and a brightly colored jet\u2014accompanied by two bar charts. On the x-axis of the bar charts is the severity of the weather condition under which the jets were evaluated\u2014fog and dust, respectively\u2014with severity increasing from left to right. On the y-axis is the accuracy of the vision system\u2019s classification. The unadversarial jet is more accurately classified than both the dark and bright jets in all weather severities. Pictured underneath are similar bar charts for human-designed and unadversarial versions of a bus, container ship, and trailer truck. The rate of accuracy for the unadversarial bus is significantly higher than its human-designed counterpart in all weather severities. The classification performance on the human-designed container ship is slightly better under less severe dust and fog, but as the conditions worsen, the accuracy rate for the unadversarial container ship is higher. The unadversarial trailer truck is more accurately classified across weather severities. \" class=\"wp-image-713500\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2020\/12\/Object-Design_-Figure2-1024x605.jpg 1024w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2020\/12\/Object-Design_-Figure2-300x177.jpg 300w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2020\/12\/Object-Design_-Figure2-768x454.jpg 768w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2020\/12\/Object-Design_-Figure2-16x9.jpg 16w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2020\/12\/Object-Design_-Figure2.jpg 1063w\" sizes=\"(max-width: 1024px) 100vw, 1024px\" \/><figcaption>Figure 3: Unadversarial objects lead to better classification performance under various weather conditions compared to their human-designed counterparts.<\/figcaption><\/figure>\n\n\n\n<ul class=\"wp-block-list\"><li><strong>Localization for (simulated) drone landing:<\/strong> We take the realism of our simulations a step further by training patches for use in a simulated drone landing task. Here, the drone has a pre-trained regression model that localizes the landing pad. Our goal is to optimize an unadversarial drone pad to help this drone\u2019s regression model in localizing that pad. Figure 4 depicts an example landing pad localization task and the resulting performance on that task. Usage of an unadversarial landing pad makes the drone landing consistently more reliable.<\/li><\/ul>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"307\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2020\/12\/Object-Design_figure-4-1024x307.jpg\" alt=\"A rendering of an unadversarial landing pad with a colorful tie-dye\u2013like pattern pictured (from left to right) in clear weather conditions, fog, and denser fog. A gray and white standard landing pad is pictured under the same conditions. A bar chart with \u201cseverity of fog\u201d on the x-axis and accuracy on the y-axis shows that under clear conditions, the drone lands correctly 100 percent of the time whether it\u2019s landing on the unadversarial pad or the standard pad. Under foggy conditions, it lands correctly on the unadversarial pad 100 percent of the time and only 60 percent of the time on the standard pad. In more severe fog, the drone fails to land correctly on the standard pad, but lands correctly over 70 percent of the time when working with the unadversarial pad in the same conditions. \" class=\"wp-image-713503\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2020\/12\/Object-Design_figure-4-1024x307.jpg 1024w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2020\/12\/Object-Design_figure-4-300x90.jpg 300w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2020\/12\/Object-Design_figure-4-768x230.jpg 768w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2020\/12\/Object-Design_figure-4-16x5.jpg 16w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2020\/12\/Object-Design_figure-4-1066x320.jpg 1066w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2020\/12\/Object-Design_figure-4.jpg 1068w\" sizes=\"(max-width: 1024px) 100vw, 1024px\" \/><figcaption>Figure 4: An unadversarial landing pad is evaluated against a standard landing pad in a drone landing task in foggy conditions. The drone has a higher chance of landing correctly in fog when working with the unadversarial landing pad.<\/figcaption><\/figure>\n\n\n\n<ul class=\"wp-block-list\"><li><strong>Physical-world unadversarial examples<\/strong>: Finally, we study unadversarial examples in the physical world. To this end, we print out unadversarial patches and place them on top of real-world objects. We then classify these object-patch pairs and the corresponding object-only baselines. We find that the unadversarial patches consistently improve performance, even when the object orientations are unusual (Figure 5).<\/li><\/ul>\n\n\n\n<figure class=\"wp-block-image size-large\"><img loading=\"lazy\" decoding=\"async\" width=\"1024\" height=\"402\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2020\/12\/Object-Design-Figure-5-1024x402.jpg\" alt=\"Two groupings of six photos. The first grouping is of a gray toy plane. In the first three photos, an unadversarial patch is affixed to the plane, and each photo captures the plane from a different angle. Below them are photos of the same plane from the same angles minus the patch. The planes with the patch are accurately classified 78 percent of the time, while the planes without the patch are accurately classified 67 percent of the time. The second grouping of photos is of a blue toy car. In the first three photos, an unadversarial patch is affixed to the car, and each photo captures the car from a different angle. Below them are photos of the same car from the same angles minus the patch. The cars with the patch are accurately classified 96 percent of the time, while the cars without the patch are accurately classified 46 percent of the time.\" class=\"wp-image-713506\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2020\/12\/Object-Design-Figure-5-1024x402.jpg 1024w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2020\/12\/Object-Design-Figure-5-300x118.jpg 300w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2020\/12\/Object-Design-Figure-5-768x302.jpg 768w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2020\/12\/Object-Design-Figure-5-16x6.jpg 16w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2020\/12\/Object-Design-Figure-5.jpg 1051w\" sizes=\"(max-width: 1024px) 100vw, 1024px\" \/><figcaption>Figure 5: Unadversarial patches in the physical world: Printed-out patches were placed on top of corresponding objects, and then the objects were photographed in a diverse set of positions. It was found that models classified objects with patches at a consistently higher rate than when the patches weren\u2019t present.<\/figcaption><\/figure>\n\n\n\n<p>Overall, we\u2019ve seen that it\u2019s possible to design objects that boost the performance of computer vision models, even under strong and unforeseen corruptions and distribution shifts. We view our results as a promising route toward increasing reliability and out-of-distribution robustness of computer vision models.<\/p>\n\n\n\n<p><\/p>\n\n\n\n<p><\/p>\n","protected":false},"excerpt":{"rendered":"<p>Many of the items and objects we use in our daily lives were designed with people in mind. In October, the Reserve Bank of Australia put out into the world its redesigned $100 banknote. Some design elements remained the same\u2014such as color and size, characteristics people use to tell the difference between notes\u2014while others changed. New security features to help protect against fraud were added as were raised bumps for people who are blind or have low vision. Good design enables intended audiences to easily acquire information and act on it.<\/p>\n","protected":false},"author":38838,"featured_media":713860,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"msr-url-field":"","msr-podcast-episode":"","msrModifiedDate":"","msrModifiedDateEnabled":false,"ep_exclude_from_search":false,"footnotes":""},"categories":[1],"tags":[],"research-area":[13562],"msr-region":[],"msr-event-type":[],"msr-locale":[268875],"msr-post-option":[],"msr-impact-theme":[],"msr-promo-type":[],"msr-podcast-series":[],"class_list":["post-713494","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-research-blog","msr-research-area-computer-vision","msr-locale-en_us"],"msr_event_details":{"start":"","end":"","location":""},"podcast_url":"","podcast_episode":"","msr_research_lab":[],"msr_impact_theme":[],"related-publications":[],"related-downloads":[],"related-videos":[],"related-academic-programs":[],"related-groups":[237595,867219],"related-projects":[],"related-events":[],"related-researchers":[],"msr_type":"Post","featured_image_thumbnail":"<img width=\"960\" height=\"540\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2020\/12\/1400x788_Robust_objects_blog_stills_no_logo-960x540.jpg\" class=\"img-object-cover\" alt=\"graphical user interface\" decoding=\"async\" loading=\"lazy\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2020\/12\/1400x788_Robust_objects_blog_stills_no_logo-960x540.jpg 960w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2020\/12\/1400x788_Robust_objects_blog_stills_no_logo-300x169.jpg 300w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2020\/12\/1400x788_Robust_objects_blog_stills_no_logo-1024x577.jpg 1024w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2020\/12\/1400x788_Robust_objects_blog_stills_no_logo-768x432.jpg 768w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2020\/12\/1400x788_Robust_objects_blog_stills_no_logo-1536x865.jpg 1536w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2020\/12\/1400x788_Robust_objects_blog_stills_no_logo-2048x1153.jpg 2048w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2020\/12\/1400x788_Robust_objects_blog_stills_no_logo-16x9.jpg 16w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2020\/12\/1400x788_Robust_objects_blog_stills_no_logo-1066x600.jpg 1066w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2020\/12\/1400x788_Robust_objects_blog_stills_no_logo-655x368.jpg 655w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2020\/12\/1400x788_Robust_objects_blog_stills_no_logo-343x193.jpg 343w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2020\/12\/1400x788_Robust_objects_blog_stills_no_logo-640x360.jpg 640w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2020\/12\/1400x788_Robust_objects_blog_stills_no_logo-1280x720.jpg 1280w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2020\/12\/1400x788_Robust_objects_blog_stills_no_logo-1920x1080.jpg 1920w\" sizes=\"(max-width: 960px) 100vw, 960px\" \/>","byline":"Hadi Salman","formattedDate":"December 22, 2020","formattedExcerpt":"Many of the items and objects we use in our daily lives were designed with people in mind. In October, the Reserve Bank of Australia put out into the world its redesigned $100 banknote. Some design elements remained the same\u2014such as color and size, characteristics&hellip;","locale":{"slug":"en_us","name":"English","native":"","english":"English"},"_links":{"self":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/posts\/713494"}],"collection":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/users\/38838"}],"replies":[{"embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/comments?post=713494"}],"version-history":[{"count":33,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/posts\/713494\/revisions"}],"predecessor-version":[{"id":713878,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/posts\/713494\/revisions\/713878"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/media\/713860"}],"wp:attachment":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/media?parent=713494"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/categories?post=713494"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/tags?post=713494"},{"taxonomy":"msr-research-area","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/research-area?post=713494"},{"taxonomy":"msr-region","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-region?post=713494"},{"taxonomy":"msr-event-type","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-event-type?post=713494"},{"taxonomy":"msr-locale","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-locale?post=713494"},{"taxonomy":"msr-post-option","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-post-option?post=713494"},{"taxonomy":"msr-impact-theme","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-impact-theme?post=713494"},{"taxonomy":"msr-promo-type","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-promo-type?post=713494"},{"taxonomy":"msr-podcast-series","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-podcast-series?post=713494"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}