{"id":657531,"date":"2020-05-12T09:41:54","date_gmt":"2020-05-12T16:41:54","guid":{"rendered":"https:\/\/www.microsoft.com\/en-us\/research\/?p=657531"},"modified":"2021-06-23T08:40:17","modified_gmt":"2021-06-23T15:40:17","slug":"wheres-my-stuff-developing-ai-with-help-from-people-who-are-blind-or-low-vision-to-meet-their-needs","status":"publish","type":"post","link":"https:\/\/www.microsoft.com\/en-us\/research\/blog\/wheres-my-stuff-developing-ai-with-help-from-people-who-are-blind-or-low-vision-to-meet-their-needs\/","title":{"rendered":"Where\u2019s my stuff? Developing AI with help from people who are blind or low vision to meet their needs"},"content":{"rendered":"<p><img loading=\"lazy\" decoding=\"async\" class=\"aligncenter wp-image-658257 \" src=\"https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2020\/05\/ORBIT-Hero-Image-1024x577.png\" alt=\"A photo of 10 images taken from a pilot study by people who are blind or low vision. \" width=\"1067\" height=\"601\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2020\/05\/ORBIT-Hero-Image-1024x577.png 1024w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2020\/05\/ORBIT-Hero-Image-300x169.png 300w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2020\/05\/ORBIT-Hero-Image-768x432.png 768w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2020\/05\/ORBIT-Hero-Image-1066x600.png 1066w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2020\/05\/ORBIT-Hero-Image-655x368.png 655w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2020\/05\/ORBIT-Hero-Image-343x193.png 343w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2020\/05\/ORBIT-Hero-Image-640x360.png 640w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2020\/05\/ORBIT-Hero-Image-960x540.png 960w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2020\/05\/ORBIT-Hero-Image-1280x720.png 1280w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2020\/05\/ORBIT-Hero-Image.png 1460w\" sizes=\"(max-width: 1067px) 100vw, 1067px\" \/><strong>Microsoft AI for Accessibility is funding the ORBIT research project, which is enlisting the help of people who are blind or low vision to build a new dataset. People who are blind or low vision can contribute to the project by providing videos of things found in their daily lives. The goal is to improve automatic object recognition to better identify specific personal items. The data will be used for training and testing Artificial Intelligence (AI) models that personalize object recognition. In contrast to previous research efforts, we will request videos rather than images from users who are blind or low vision, as they provide a richer set of information. <\/strong><\/p>\n<p><strong>To help us with our research, we have conducted a pilot study to further investigate how to collect these videos, and we are currently recruiting users who are blind or low vision in the UK to record videos of things that are important to them. Visit <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"http:\/\/orbit.city.ac.uk\/\">the ORBIT dataset homepage<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> for more information on the study and how to sign up.<\/strong><\/p>\n<p><em>To maintain privacy and confidentiality, users\u2019 contributions to the pilot study, and to all future phases of this research, are anonymized and checked before being included in the dataset. Any videos containing information that could lead back to the identity of the users are removed.<\/em><\/p>\n<p>Smartphones are really useful in making visual information accessible to people who are blind or low vision. For instance, the <a href=\"https:\/\/www.microsoft.com\/en-us\/ai\/seeing-ai\">Seeing AI<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> app allows you to take a picture of your surroundings in scene mode, and then it reads aloud what things are recognized in the picture (for example, \u201ca person sitting on a sofa\u201d). AI recognizes objects in a scene easily enough if they are commonly found. For now, however, these apps can\u2019t tell you which of the things it recognizes is yours, and they don\u2019t know about things that are particularly important to users who are blind or low vision. For example, has someone moved your keys again? Did your white cane get mixed up with someone else\u2019s? Imagine being able to easily identify things that are important to you or being able to easily locate your personal stuff.<\/p>\n<p>Apps like Seeing AI use artificial intelligence techniques in computer vision to recognize items. While AI is making great strides toward improving computer vision solutions for many applications, such as automated driving, there are still areas where it does not work so well\u2014personalized object recognition is one such area. Previous research has started to make some advances to solving the problem by looking at how people who are blind or low vision take pictures, what algorithms could be used to personalize object recognition, and which kinds of data are best suited for enabling personalized object recognition.<\/p>\n<p>However, research is currently held back by the lack of available data to use for training and then evaluating AI algorithms for personalized object recognition. Most datasets in computer vision comprise hundreds of thousands or millions of images. But at the moment, the datasets available for personal object recognition are from tens of users and contain maybe hundreds of images. In addition, there has been no effort to collect images of objects that may be particularly important to users who are blind or low vision. Providing a larger dataset for researchers and developers to use to build better AI systems could be a game changer in this area, for people who are blind or low vision in particular but also for everyone.<\/p>\n<p>By funding the ORBIT project, Microsoft AI for Accessibility hopes to help researchers construct a large dataset from users who are blind or low vision, which will help further advance AI as it relates to personalizing object recognition. Researchers from <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/www.city.ac.uk\/\">City, University of London<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>, Microsoft Research, and University of Oxford are collaborating in this effort. Collaborators include the authors of this blog post along with Toby Harris, Mobile App Developer at City, University of London, <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/people\/kahofman\/\">Katja Hofmann<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>, Principal Researcher at&nbsp;Microsoft Research Cambridge,&nbsp;<a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/luisazintgraf.com\/\">Luisa Zintgraf<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>, PhD student at the University of Oxford and Research&nbsp;Intern at Microsoft Research Cambridge.<\/p>\n<p>Unlike previous research efforts, we will collect videos since they provide a richer set of information than images. Our research is also focused on providing realistic testing data so that any new algorithms can be rigorously evaluated. We anticipate that our dataset might be useful for implementations in existing apps, like Seeing AI, and also in novel wearable systems like <a href=\"https:\/\/www.microsoft.com\/en-us\/research\/project\/project-tokyo\/\">Project Tokyo<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>, but our team is keen to future-proof the dataset for new applications that are yet to be imagined. The dataset will be made publicly available for download in two phases: Phase 1 will include about 100 users and thousands of videos, while Phase 2 will gather data from about 1,000 users and contain more than 10,000 videos.<\/p>\n<div id=\"attachment_657534\" style=\"width: 1034px\" class=\"wp-caption aligncenter\"><img loading=\"lazy\" decoding=\"async\" aria-describedby=\"caption-attachment-657534\" class=\"wp-image-657534 size-large\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2020\/05\/ORBIT-Featured-1-1024x436.jpg\" alt=\"\" width=\"1024\" height=\"436\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2020\/05\/ORBIT-Featured-1-1024x436.jpg 1024w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2020\/05\/ORBIT-Featured-1-300x128.jpg 300w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2020\/05\/ORBIT-Featured-1-768x327.jpg 768w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2020\/05\/ORBIT-Featured-1.jpg 1319w\" sizes=\"(max-width: 1024px) 100vw, 1024px\" \/><p id=\"caption-attachment-657534\" class=\"wp-caption-text\">Figure 1: A word cloud showing the frequency of objects captured on video in the pilot study. The larger the word, the more often it occurred.<\/p><\/div>\n<p>To help us with our research, we have conducted a pilot study to investigate how to collect these videos. During that pilot study, we gathered 193 videos from eight people who are blind or low vision, each of whom were asked to take videos of five different objects. For each object, we have videos in different settings in their home and using different filming techniques, some of which they were free to choose. We found that common items that were captured on video included users\u2019 own white canes, keys, glasses, remote controls, bags, and headphones, many of which users wanted to identify and locate. The videos also included some rather surprising items, such as crisps (or chips to Americans). The quality of videos from the pilot study was good for machine learning purposes, revealing no serious issues due to lighting or items being out of frame.<\/p>\n<p>This pilot dataset is available for public use while we develop the larger datasets. Initial investigations by researchers from Microsoft Research have shown that a convolutional neural network trained on <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/github.com\/yihui-he\/mini-ImageNet\">Mini-ImageNet<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> using 50 random frames drawn from the videos can reach accuracy of nearly 50%. A more sophisticated technique, which applies a frame-weighting mechanism supplemented with training on <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"https:\/\/github.com\/google-research\/meta-dataset\">Meta-Dataset<span class=\"sr-only\"> (opens in new tab)<\/span><\/a>, can improve results by 10 percentage points.<\/p>\n<p>The pilot also highlighted that collecting videos is tricky because it must be simultaneously easy for blind users to record them, and the data must be useful for machine learning. Instructions for users who are blind or low vision on how to take videos of objects are important\u2014so that the AI can eventually identify the objects. This aspect presents an additional opportunity for users to learn more about how AI works. We\u2019re currently at the beginning stages of developing a curriculum for users who want to educate themselves about AI, from a base-level understanding to some more advanced concepts like how AI is built.<\/p>\n<p>The next step in our project is to gather video data for the first phase of our dataset release. To do this, we are recruiting users who are blind or low vision in the UK to record videos of things that are important to them. We have built an iPhone app, the ORBIT Camera, for gathering the videos, including guidance to users for how to film the things that they want to have recognized. For anyone interested in taking part, please visit <a class=\"msr-external-link glyph-append glyph-append-open-in-new-tab glyph-append-xsmall\" href=\"http:\/\/orbit.city.ac.uk\/\">the ORBIT dataset homepage<span class=\"sr-only\"> (opens in new tab)<\/span><\/a> for more information on the study and how to sign up. Later in the year, there will be another opportunity to contribute to our project for users who are blind or low vision worldwide.<\/p>\n","protected":false},"excerpt":{"rendered":"<p>Microsoft AI for Accessibility is funding the ORBIT research project, which is enlisting the help of people who are blind or low vision to build a new dataset. People who are blind or low vision can contribute to the project by providing videos of things found in their daily lives. The goal is to improve [&hellip;]<\/p>\n","protected":false},"author":38838,"featured_media":658257,"comment_status":"closed","ping_status":"closed","sticky":false,"template":"","format":"standard","meta":{"msr-url-field":"","msr-podcast-episode":"","msrModifiedDate":"","msrModifiedDateEnabled":false,"ep_exclude_from_search":false,"footnotes":""},"categories":[1],"tags":[],"research-area":[13562],"msr-region":[],"msr-event-type":[],"msr-locale":[268875],"msr-post-option":[],"msr-impact-theme":[],"msr-promo-type":[],"msr-podcast-series":[],"class_list":["post-657531","post","type-post","status-publish","format-standard","has-post-thumbnail","hentry","category-research-blog","msr-research-area-computer-vision","msr-locale-en_us"],"msr_event_details":{"start":"","end":"","location":""},"podcast_url":"","podcast_episode":"","msr_research_lab":[199561],"msr_impact_theme":[],"related-publications":[],"related-downloads":[],"related-videos":[],"related-academic-programs":[],"related-groups":[913161],"related-projects":[830104,295553],"related-events":[],"related-researchers":[{"type":"guest","value":"simone-stumpf","user_id":"657537","display_name":"Simone  Stumpf","author_link":"<a href=\"https:\/\/www.city.ac.uk\/people\/academics\/simone-stumpf\" aria-label=\"Visit the profile page for Simone  Stumpf\">Simone  Stumpf<\/a>","is_active":true,"last_first":"Stumpf, Simone ","people_section":0,"alias":"simone-stumpf"},{"type":"user_nicename","value":"Cecily Morrison","user_id":31356,"display_name":"Cecily Morrison","author_link":"<a href=\"https:\/\/www.microsoft.com\/en-us\/research\/people\/cecilym\/\" aria-label=\"Visit the profile page for Cecily Morrison\">Cecily Morrison<\/a>","is_active":false,"last_first":"Morrison, Cecily","people_section":0,"alias":"cecilym"},{"type":"user_nicename","value":"Daniela Massiceti","user_id":40408,"display_name":"Daniela Massiceti","author_link":"<a href=\"https:\/\/www.microsoft.com\/en-us\/research\/people\/dmassiceti\/\" aria-label=\"Visit the profile page for Daniela Massiceti\">Daniela Massiceti<\/a>","is_active":false,"last_first":"Massiceti, Daniela","people_section":0,"alias":"dmassiceti"},{"type":"user_nicename","value":"Ed Cutrell","user_id":31490,"display_name":"Ed Cutrell","author_link":"<a href=\"https:\/\/www.microsoft.com\/en-us\/research\/people\/cutrell\/\" aria-label=\"Visit the profile page for Ed Cutrell\">Ed Cutrell<\/a>","is_active":false,"last_first":"Cutrell, Ed","people_section":0,"alias":"cutrell"},{"type":"guest","value":"lida-theodorou","user_id":"657540","display_name":"Lida  Theodorou","author_link":"<a href=\"https:\/\/lidatheodorou.com\/About\" aria-label=\"Visit the profile page for Lida  Theodorou\">Lida  Theodorou<\/a>","is_active":true,"last_first":"Theodorou, Lida ","people_section":0,"alias":"lida-theodorou"}],"msr_type":"Post","featured_image_thumbnail":"<img width=\"960\" height=\"540\" src=\"https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2020\/05\/ORBIT-Hero-Image-960x540.png\" class=\"img-object-cover\" alt=\"\" decoding=\"async\" loading=\"lazy\" srcset=\"https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2020\/05\/ORBIT-Hero-Image-960x540.png 960w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2020\/05\/ORBIT-Hero-Image-300x169.png 300w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2020\/05\/ORBIT-Hero-Image-1024x577.png 1024w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2020\/05\/ORBIT-Hero-Image-768x432.png 768w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2020\/05\/ORBIT-Hero-Image-1066x600.png 1066w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2020\/05\/ORBIT-Hero-Image-655x368.png 655w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2020\/05\/ORBIT-Hero-Image-343x193.png 343w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2020\/05\/ORBIT-Hero-Image-640x360.png 640w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2020\/05\/ORBIT-Hero-Image-1280x720.png 1280w, https:\/\/www.microsoft.com\/en-us\/research\/uploads\/prod\/2020\/05\/ORBIT-Hero-Image.png 1460w\" sizes=\"(max-width: 960px) 100vw, 960px\" \/>","byline":"","formattedDate":"May 12, 2020","formattedExcerpt":"Microsoft AI for Accessibility is funding the ORBIT research project, which is enlisting the help of people who are blind or low vision to build a new dataset. People who are blind or low vision can contribute to the project by providing videos of things&hellip;","locale":{"slug":"en_us","name":"English","native":"","english":"English"},"_links":{"self":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/posts\/657531"}],"collection":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/users\/38838"}],"replies":[{"embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/comments?post=657531"}],"version-history":[{"count":10,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/posts\/657531\/revisions"}],"predecessor-version":[{"id":756346,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/posts\/657531\/revisions\/756346"}],"wp:featuredmedia":[{"embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/media\/658257"}],"wp:attachment":[{"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/media?parent=657531"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/categories?post=657531"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/tags?post=657531"},{"taxonomy":"msr-research-area","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/research-area?post=657531"},{"taxonomy":"msr-region","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-region?post=657531"},{"taxonomy":"msr-event-type","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-event-type?post=657531"},{"taxonomy":"msr-locale","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-locale?post=657531"},{"taxonomy":"msr-post-option","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-post-option?post=657531"},{"taxonomy":"msr-impact-theme","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-impact-theme?post=657531"},{"taxonomy":"msr-promo-type","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-promo-type?post=657531"},{"taxonomy":"msr-podcast-series","embeddable":true,"href":"https:\/\/www.microsoft.com\/en-us\/research\/wp-json\/wp\/v2\/msr-podcast-series?post=657531"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}