{"id":488849,"date":"2018-06-01T12:30:00","date_gmt":"2018-06-01T19:30:00","guid":{"rendered":"https:\/\/www.microsoft.com\/en-us\/research\/?post_type=msr-event&p=488849"},"modified":"2025-08-06T11:57:09","modified_gmt":"2025-08-06T18:57:09","slug":"microsoft-cvpr-2018","status":"publish","type":"msr-event","link":"https:\/\/www.microsoft.com\/en-us\/research\/event\/microsoft-cvpr-2018\/","title":{"rendered":"Microsoft @ CVPR 2018"},"content":{"rendered":"\n\n
Venue:<\/strong> Calvin L. Rampton Salt Palace Convention Center (opens in new tab)<\/span><\/a><\/p>\n Website:<\/strong> CVPR 2018 (opens in new tab)<\/span><\/a>Opens in a new tab<\/span><\/p>\n Microsoft is proud to be a diamond sponsor of the Conference on Computer Vision and Pattern Recognition (CVPR (opens in new tab)<\/span><\/a>) June 18 \u2013 22 in Salt Lake City, Utah. Please visit us at booth 537 to chat with our experts, see demos of our latest research and find out about career opportunities with Microsoft.<\/p>\n Marc Pollefeys \u2013 Robust Vision Challenge Organizers Marc Pollefeys<\/strong>, Pawel Olszta<\/strong><\/p>\n David Doria, Tim Franklin<\/strong>, Matt Turek, Jan Ernst, Wei Xia, Stephen Miller, Ben Kadlec<\/p>\n Why FGVC5 Folks Should be Interested in the Microsoft AI for Earth Program Aijun Bai Tuesday | 8:50-10:10 | Room 255 Wednesday | 8:30-10:10 | Ballroom Wednesday | 8:30-10:10 | Room 255 Wednesday | 8:30-10:10 | Room 255 Wednesday | 2:50-4:30 | Room 155 Wednesday | 2:50-4:30 | Room 155 Thursday | 8:30-10:10 | Ballroom Thursday | 8:30-10:10 | Room 155 Thursday | 8:30-10:10 | Ballroom Thursday | 12:50-2:30 | Room 255 Thursday | 12:50-2:30 | Room 155 Thursday | 2:50-4:30 | Room 255 Thursday | 2:50-4:30 | Ballroom Thursday | 2:50-4:30 | Room 155 Friday | 9:30-9:50 | Ballroom E Friday | 9:50-10:10 | Ballroom E Tuesday | 10:10-12:30 | Halls C-E Tuesday | 10:10-12:30 | Halls C-E Tuesday | 10:10-12:30 | Halls C-E Tuesday | 12:30-2:50 | Halls C-E Tuesday | 12:30-2:50 | Halls C-E Tuesday | 12:30-2:50 | Halls C-E Wednesday | 10:10-12:30 | Halls C-E Wednesday | 10:10-12:30 | Halls C-E Wednesday | 10:10-12:30 | Halls C-E Wednesday | 10:10-12:30 | Halls C-E Wednesday | 12:30-2:50 | Halls C-E Wednesday | 12:30-2:50 | Halls C-E Wednesday | 12:30-2:50 | Halls C-E Wednesday | 12:30-2:50 | Halls C-E Wednesday | 4:30-6:30 | Halls C-E Wednesday | 4:30-6:30 | Halls C-E Wednesday | 4:30-6:30 | Halls C-E Wednesday | 4:30-6:30 | Halls C-E Wednesday | 4:30-6:30 | Halls C-E Wednesday | 4:30-6:30 | Halls C-E Thursday | 10:10-12:30 | Halls D-E Thursday | 10:10-12:30 | Halls D-E Thursday | 10:10-12:30 | Halls D-E Thursday | 10:10-12:30 | Halls D-E Thursday | 4:30-6:30 | Halls D-E Thursday | 4:30-6:30 | Halls D-E Thursday | 4:30-6:30 | Halls D-E Thursday | 4:30-6:30 |\u00a0Halls D-E \t\t\t \nProgram Committee members<\/h2>\n
\nSing Bing Kang<\/a>, Stephen Lin, Sebastian Nowozin, and Wenjun Zeng \u2013\u00a0NTIRE 2018 Program Committee
\nGang Hua<\/a>\u00a0\u2013 PBVS 2018 Program Committee
\nDaniel McDuff<\/a>\u00a0\u2013 CVPM 2018 Program Co-Chair
\nTimnit Gebru \u2013 CV-COPS 2018 Program Committee
\nZhengyou Zhang \u2013 Sight and Sound Workshop Organizers<\/p>\nTutorials<\/h2>\n
New from HoloLens: Research Mode (opens in new tab)<\/span><\/a>
\nTuesday | 1:30 \u2013 2:50 | Room 151 – ABCG<\/h4>\nSoftware Engineering in Computer Vision Systems (opens in new tab)<\/span><\/a>
\nFriday | 8:30 \u2013 12:30 | Ballroom C<\/h4>\nWorkshops<\/h2>\n
The Fifth Workshop on Fine-Grained Visual Categorization (opens in new tab)<\/span><\/a>
\nFriday | 9:00 \u2013 5:00 | Room 151 A-C<\/h4>\n
\n9:45 \u2013 10:00
\nDan Morris<\/a><\/p>\nMicrosoft attendees<\/h2>\n
\nLuca Ballan
\nMi\u0107o Banovi\u0107
\nFederica Bogo<\/a>
\nBogdan Burlacu
\nNick Burton
\nIshani Chakraborty
\nTemo Chalasani
\nDong Chen<\/a>
\nXi Chen
\nArti Chhajta
\nJohn Corring
\nJifeng Dai<\/a>
\nQi Dai
\nMandar Dixit
\nLiang Du
\nNan Duan<\/a>
\nXin Duan
\nGoran Dubajic
\nAndrew Fitzgibbon<\/a>
\nDinei Florencio<\/a>
\nJianlong Fu<\/a>
\nSean Goldberg
\nYandong Guo
\nHan Hu<\/a>
\nHoudong Hu
\nGang Hua<\/a>
\nQiuyuan Huang<\/a>
\nSing Bing Kang<\/a>
\nNikolaos Karianakis
\nNoboru Kuno<\/a>
\nNabil Lathiff
\nKuang-Huei Lee
\nXing Li
\nOlga Liakhovich
\nTongliang Liao
\nStephen Lin
\nZicheng Liu<\/a>
\nYan Lu<\/a>
\nChong Luo<\/a>
\nDaniel McDuff<\/a>
\nMeenaz Merchant
\nLeonardo Nunes
\nMarc Pollefeys<\/a>
\nTao Qin<\/a>
\nArun Sacheti
\nPablo Sala<\/a>
\nHarpreet Sawhney
\nPramod Sharma
\nYelong Shen<\/a>
\nJamie Shotton<\/a>
\nYale Song<\/a>
\nBaochen Sun<\/a>
\nXiaoyan Sun<\/a>
\nRavi Theja Yada
\nAli Osman Ulusoy
\nHamidreza Vaezi Joze<\/a>
\nAlon Vinnikov
\nBaoyuan Wang<\/a>
\nJianfeng Wang
\nJingdong Wang<\/a>
\nLijuan Wang<\/a>
\nZhe Wang
\nZhirong Wu
\nJiaolong Yang<\/a>
\nTing Yao
\nSang Ho Yoon<\/a>
\nQuanzeng You
\nCha Zhang<\/a>
\nLei Zhang<\/a>
\nMingxue Zhang
\nPengchuan Zhang<\/a>
\nTing Zhang<\/a>
\nYatao Zhong
\nXiaoyong ZhuOpens in a new tab<\/span><\/p>\nHybrid Camera Pose Estimation<\/h4>\n
\nFederico Camposeco, Andrea Cohen, Marc Pollefeys<\/strong>, Torsten Sattler<\/p>\nRelation Networks for Object Detection (opens in new tab)<\/span><\/a><\/h4>\n
\nHan Hu, Jiayuan Gu<\/strong>, Zheng Zhang<\/strong>, Jifeng Dai<\/strong>, Yichen Wei<\/strong> (opens in new tab)<\/span><\/a><\/p>\nRayNet: Learning Volumetric 3D Reconstruction With Ray Potentials (opens in new tab)<\/span><\/a><\/h4>\n
\nDespoina Paschalidou, Ali Osman Ulusoy<\/strong>, Carolin Schmitt, Luc Van Gool, Andreas Geiger<\/p>\nAutomatic 3D Indoor Scene Modeling From Single Panorama (opens in new tab)<\/span><\/a><\/h4>\n
\nYang Yang, Shi Jin, Ruiyang Liu, Sing Bing Kang<\/strong> (opens in new tab)<\/span><\/a>, Jingyi Yu<\/p>\nBottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering (opens in new tab)<\/span><\/a><\/h4>\n
\nPeter Anderson, Xiaodong He, Chris Buehler<\/strong>, Damien Teney, Mark Johnson, Stephen Gould, Lei Zhang<\/strong><\/p>\nVisual Question Generation as Dual Task of Visual Question Answering (opens in new tab)<\/span><\/a><\/h4>\n
\nYikang Li, Nan Duan<\/strong> (opens in new tab)<\/span><\/a>, Bolei Zhou, Xiao Chu, Wanli Ouyang, Xiaogang Wang, Ming Zhou<\/strong> (opens in new tab)<\/span><\/a><\/p>\nTowards High Performance Video Object Detection (opens in new tab)<\/span><\/a><\/h4>\n
\nXizhou Zhu, Jifeng Dai<\/strong> (opens in new tab)<\/span><\/a>, Lu Yuan<\/strong> (opens in new tab)<\/span><\/a>, Yichen Wei<\/strong> (opens in new tab)<\/span><\/a><\/p>\nConsensus Maximization for Semantic Region Correspondences<\/h4>\n
\nPablo Speciale, Danda P. Paudel, Martin R. Oswald, Hayko Riemenschneider, Luc Van Gool, Marc Pollefeys<\/strong><\/p>\nInLoc: Indoor Visual Localization With Dense Matching and View Synthesis<\/h4>\n
\nHajime Taira, Masatoshi Okutomi, Torsten Sattler, Mircea Cimpoi, Marc Pollefeys<\/strong>, Josef Sivic, Tomas Pajdla, Akihiko Torii<\/p>\nLanguage-Based Image Editing With Recurrent Attentive Models (opens in new tab)<\/span><\/a><\/h4>\n
\nJianbo Chen, Yelong Shen<\/strong>, Jianfeng Gao<\/strong>, Jingjing Liu<\/strong>, Xiaodong Liu<\/strong><\/p>\nBenchmarking 6DOF Outdoor Visual Localization in Changing Conditions<\/h4>\n
\nTorsten Sattler, Will Maddern, Carl Toft, Akihiko Torii, Lars Hammarstrand, Erik Stenborg, Daniel Safari, Masatoshi Okutomi, Marc Pollefeys, Josef Sivic, Fredrik Kahl, Tomas Pajdla<\/strong><\/strong><\/p>\nFeature Space Transfer for Data Augmentation<\/h4>\n
\nBo Liu, Xudong Wang, Mandar Dixit<\/strong>, Roland Kwitt, and Nuno Vasconcelos<\/p>\nInterleaved Structured Sparse Convolutional Neural Networks (opens in new tab)<\/span><\/a><\/h4>\n
\nGuotian Xie, Jingdong Wang<\/strong> (opens in new tab)<\/span><\/a>, Ting Zhang<\/strong> (opens in new tab)<\/span><\/a>, Jianhuang Lai, Richang Hong, Guo-Jun Qi<\/p>\nRevisiting Deep Intrinsic Image Decompositions (opens in new tab)<\/span><\/a><\/h4>\n
\nQingnan Fan, Jiaolong Yang<\/strong> (opens in new tab)<\/span><\/a>, Gang Hua<\/strong> (opens in new tab)<\/span><\/a>, Baoquan Chen, David Wipf<\/strong> (opens in new tab)<\/span><\/a><\/p>\nGood Citizen of CVPR Panel (opens in new tab)<\/span><\/a><\/h3>\n
\nRights and Obligations (Good review and bad review, constructive criticism)
\nKatsu lkeuchi<\/strong><\/p>\n
\nHow to create an inclusive and welcoming culture at CVPR and not have a “clique” culture
\nTimnit Gebru<\/strong>Opens in a new tab<\/span><\/p>\nPosters<\/h2>\n
\nReal-Time Seamless Single Shot 6D Object Pose Prediction<\/a>
\nBugra Tekin, Sudipta Sinha<\/strong><\/a>, Pascal Fua<\/p>\n
\nMiCT: Mixed 3D\/2D Convolutional Tube for Human Action Recognition<\/a>
\nYizhou Zhou, Xiaoyan Sun<\/strong>, Zheng-Jun Zha, Wenjun Zeng<\/strong><\/p>\n
\nHybrid Camera Pose Estimation
\nFederico Camposeco, Andrea Cohen, Marc Pollefeys<\/strong>, Torsten Sattler<\/p>\n
\nGlobal Versus Localized Generative Adversarial Nets<\/a>
\nGuo-Jun Qi, Liheng Zhang, Hao Hu, Marzieh Edraki, Jingdong Wang<\/strong><\/a>, Xian-Sheng Hua<\/p>\n
\nA High-Quality Denoising Dataset for Smartphone Cameras<\/a>
\nAbdelrahman Abdelhamed, Stephen Lin<\/strong>, Michael S. Brown<\/p>\n
\nAugmenting Crowd-Sourced 3D Reconstructions Using Semantic Detections
\nTrue Price, Johannes L. Sch\u00f6nberger<\/strong>, Zhen Wei, Marc Pollefeys<\/strong>, Jan-Michael Frahm<\/p>\n
\nRelation Networks for Object Detection<\/a>
\nHan Hu, Jiayuan Gu<\/strong>, Zheng Zhang<\/strong>, Jifeng Dai<\/strong><\/a>, Yichen Wei<\/strong><\/a><\/p>\n
\nRayNet: Learning Volumetric 3D Reconstruction With Ray Potentials<\/a>
\nDespoina Paschalidou, Ali Osman Ulusoy,<\/strong> Carolin Schmitt, Luc Van Gool, Andreas Geiger<\/p>\n
\nAutomatic 3D Indoor Scene Modeling From Single Panorama<\/a>
\nYang Yang, Shi Jin, Ruiyang Liu, Sing Bing Kang<\/strong><\/a>, Jingyi Yu<\/p>\n
\nPseudo Mask Augmented Object Detection<\/a>
\nXiangyun Zhao, Shuang Liang, Yichen Wei<\/strong><\/a><\/p>\n
\nA Twofold Siamese Network for Real-Time Object Tracking<\/a>
\nAnfeng He, Chong Luo<\/strong><\/a>, Xinmei Tian, Wenjun Zeng<\/strong><\/p>\n
\nCleanNet: Transfer Learning for Scalable Image Classifier Training With Label Noise<\/a>
\nKuang-Huei Lee<\/strong>, Xiaodong He, Lei Zhang<\/strong>, Linjun Yang<\/p>\n
\nEnd-to-End Convolutional Semantic Embeddings<\/a>
\nQuanzeng You<\/strong>, Zhengyou Zhang<\/strong>, Jiebo Luo<\/p>\n
\nGenerative Adversarial Learning Towards Fast Weakly Supervised Detection<\/a>
\nYunhan Shen, Rongrong Ji, Shengchuan Zhang, Wangmeng Zuo, Yan Wang<\/strong><\/p>\n
\nBottom-Up and Top-Down Attention for Image Captioning and Visual Question Answering<\/a>
\nPeter Anderson, Xiaodong He, Chris Buehler<\/strong>, Damien Teney, Mark Johnson, Stephen Gould, Lei Zhang<\/strong><\/p>\n
\nVisual Question Generation as Dual Task of Visual Question Answering<\/a>
\nYikang Li, Nan Duan<\/strong><\/a>, Bolei Zhou, Xiao Chu, Wanli Ouyang, Xiaogang Wang, Ming Zhou<\/strong><\/a><\/p>\n
\nSemantic Visual Localization
\nJohannes L. Sch\u00f6nberger<\/strong>, Marc Pollefeys<\/strong>, Andreas Geiger, Torsten Sattler<\/p>\n
\nStereoscopic Neural Style Transfer<\/a>
\nDongdong Chen, Lu Yuan<\/strong><\/a>, Jing Liao<\/strong>, Nenghai Yu, Gang Hua<\/strong><\/a><\/p>\n
\nTowards Open-Set Identity Preserving Face Synthesis<\/a>
\nJianmin Bao, Dong Chen<\/strong><\/a>, Fang Wen<\/strong>, Houqiang Li, Gang Hua<\/strong><\/a><\/p>\n
\nWeakly-Supervised Semantic Segmentation Network With Deep Seeded Region Growing<\/a>
\nZilong Huang, Xinggang Wang, Jiasi Wang, Wenyu Liu, Jingdong Wang<\/strong><\/a><\/p>\n
\nTowards High Performance Video Object Detection<\/a>
\nXizhou Zhu, Jifeng Dai<\/strong><\/a>, Lu Yuan<\/strong><\/a>, Yichen Wei<\/strong><\/a><\/p>\n
\nInLoc: Indoor Visual Localization With Dense Matching and View Synthesis
\nHajime Taira, Masatoshi Okutomi, Torsten Sattler, Mircea Cimpoi, Marc Pollefeys<\/strong>, Josef Sivic, Tomas Pajdla, Akihiko Torii<\/p>\n
\nConsensus Maximization for Semantic Region Correspondences
\nPablo Speciale, Danda P. Paudel, Martin R. Oswald, Hayko Riemenschneider, Luc Van Gool, Marc Pollefeys<\/strong><\/p>\n
\nArbitrary Style Transfer With Deep Feature Reshuffle<\/a>
\nShuyang Gu, Congliang Chen, Jing Liao<\/strong>, Lu Yuan<\/strong><\/a><\/p>\n
\nLanguage-Based Image Editing With Recurrent Attentive Models<\/a>
\nJianbo Chen, Yelong Shen,<\/strong> Jianfeng Gao<\/strong>, Jingjing Liu<\/strong>, Xiaodong Liu <\/strong><\/p>\n
\nBenchmarking 6DOF Outdoor Visual Localization in Changing Conditions
\nTorsten Sattler, Will Maddern, Carl Toft, Akihiko Torii, Lars Hammarstrand, Erik Stenborg, Daniel Safari, Masatoshi Okutomi, Marc Pollefeys<\/strong>, Josef Sivic, Fredrik Kahl, Tomas Pajdla<\/p>\n
\nInterleaved Structured Sparse Convolutional Neural Networks<\/a>
\nGuotian Xie, Jingdong Wang<\/strong><\/a>, Ting Zhang<\/strong><\/a>, Jianhuang Lai, Richang Hong, Guo-Jun Qi<\/p>\n
\nRevisiting Deep Intrinsic Image Decompositions<\/a>
\nQingnan Fan, Jiaolong Yang<\/strong><\/a>, Gang Hua<\/strong><\/a>, Baoquan Chen, David Wipf<\/strong><\/a>Opens in a new tab<\/span><\/p>\n