{"id":725710,"date":"2021-02-11T15:37:17","date_gmt":"2021-02-11T23:37:17","guid":{"rendered":"https:\/\/www.microsoft.com\/en-us\/research\/?post_type=msr-event&p=725710"},"modified":"2025-08-06T11:51:54","modified_gmt":"2025-08-06T18:51:54","slug":"iclr-2021","status":"publish","type":"msr-event","link":"https:\/\/www.microsoft.com\/en-us\/research\/event\/iclr-2021\/","title":{"rendered":"Microsoft at ICLR 2021"},"content":{"rendered":"\n\n
Conference URL:<\/strong> ICLR 2021 (opens in new tab)<\/span><\/a>Opens in a new tab<\/span><\/p>\n Microsoft is excited to be a Platinum sponsor of the Ninth International Conference on Learning Representations (ICLR) (opens in new tab)<\/span><\/a>. Stop by our virtual booth to chat with our experts, see demos of our latest research and find out more about career opportunities with Microsoft.<\/p>\n Join us at our virtual booth (opens in new tab)<\/span><\/a> during the following PST times:<\/p>\n Senior Program Chair:<\/strong> Katja Hofmann<\/a>Opens in a new tab<\/span><\/p>\n *All times are displayed in PDT<\/em><\/p>\n 01:00 \u2013 03:00 AM | Poster<\/p>\n What Makes Instance Discrimination Good for Transfer Learning?<\/strong><\/p>\n Nanxuan Zhao, Zhirong Wu<\/a>, Rynson W. H. Lau, Stephen Lin<\/a><\/p>\n 09:00 \u2013 11:00 AM | Poster<\/p>\n Learning-based Support Estimation in Sublinear Time<\/strong><\/p>\n Talya Eden, Piotr Indyk, Shyam Narayanan, Ronitt Rubinfeld, Sandeep Silwal, Tal Wagner<\/strong><\/p>\n Spotlight Session: May 4<\/em><\/p>\n Parameter Efficient Multimodal Transformers for Video Representation Learning<\/strong><\/a><\/p>\n Sangho Lee, Youngjae Yu, Gunhee Kim, Thomas Breuel, Jan Kautz, Yale Song<\/a><\/p>\n Shapley Explainability on the Data Manifold<\/strong><\/p>\n Christopher Frye, Damien de Mijolla, Tom Begley, Laurence Cowton, Megan Stanley<\/strong>, Ilya Feige<\/p>\n InfoBERT: Improving Robustness of Language Models from An Information Theoretic Perspective<\/strong><\/a><\/p>\n Boxin Wang, Shuohang Wang<\/a>, Yu Cheng<\/a>, Zhe Gan<\/a>, Ruoxi Jia, Bo Li, Jingjing Liu<\/a><\/p>\n 12:35 \u2013 12:45 PM | Spotlight<\/p>\n Systematic Generalisation with Group Invariant Predictions<\/strong><\/p>\n Faruk Ahmed, Yoshua Bengio, Harm van Seijen<\/a>, Aaron Courville<\/p>\n Poster Session: May 4<\/em><\/p>\n 05:00 \u2013 07:00 PM | Poster<\/p>\n VA-RED$^2$: Video Adaptive Redundancy Reduction<\/strong><\/p>\n Bowen Pan, Rameswar Panda, Camilo Luciano Fosco, Chung-Ching Lin (opens in new tab)<\/span><\/a>, Alex J Andonian, Yue Meng, Kate Saenko, Aude Oliva, Rogerio Feris<\/p>\n SCoRe: Pre-Training for Context Representation in Conversational Semantic Parsing<\/strong><\/a><\/p>\n Tao Yu, Rui Zhang, Alex Polozov<\/a>, Christopher Meek<\/a>, Ahmed Hassan Awadallah<\/a><\/p>\n DEBERTA: Decoding-Enhanced BERT with Disentangled Attention<\/strong><\/a><\/p>\n Pengcheng He<\/a>, Xiaodong Liu<\/a>, Jianfeng Gao<\/a>, Weizhu Chen<\/a><\/p>\n Learning a Latent Simplex in Input Sparsity Time<\/strong><\/p>\n Ainesh Bakshi, Chiranjib Bhattacharyya, Ravi Kannan<\/a>, David Woodruff, Samson Zhou<\/p>\n Spotlight Session: May 6<\/em><\/p>\n Taking Notes on the Fly Helps Language Pre-Training<\/strong><\/p>\n Qiyu Wu, Chen Xing, Yatao Li<\/a>, Guolin Ke<\/a>, Di He<\/a>, Tie-Yan Liu<\/a><\/p>\n MixKD: Towards Efficient Distillation of Large-scale Language Models<\/strong><\/p>\n Kevin J Liang, Weituo Hao, Dinghan Shen<\/strong>, Yufan Zhou, Weizhu Chen<\/a>, Changyou Chen, Lawrence Carin<\/p>\n Rethinking Positional Encoding in Language Pre-training<\/strong><\/p>\n Guolin Ke<\/a>, Di He<\/a>, Tie-Yan Liu<\/a><\/p>\n 01:00 \u2013 03:00 AM | Poster<\/p>\n GraphCodeBERT: Pre-training Code Representations with Data Flow<\/strong><\/a><\/p>\n Daya Guo, Shuo Ren, Shuai Lu, Zhangyin Feng, Duyu Tang<\/a>, Shujie Liu<\/a>, Long Zhou<\/a>, Nan Duan<\/a>, Alexey Svyatkovskiy<\/strong>, Shengyu Fu<\/strong>, Michele Tufano<\/strong>, Shao Kun Deng<\/strong>, Colin Clement<\/strong>, Dawn Drain<\/strong>, Neel Sundaresan<\/strong>, Jian Yin, Daxin Jiang<\/a>, Ming Zhou<\/p>\n 09:00 \u2013 11:00 AM | Poster<\/p>\n Self-Supervised Learning of Compressed Video Representations<\/strong><\/a><\/p>\n Youngjae Yu, Sangho Lee, Gunhee Kim, Yale Song<\/a><\/p>\n Data-Efficient Reinforcement Learning with Self-Predictive Representations<\/strong><\/p>\n Max Schwarzer, Ankesh Anand<\/strong>, Rishab Goel, R Devon Hjelm<\/a>, Aaron Courville, Philip Bachman<\/a><\/p>\n Spotlight Session: May 6<\/em><\/p>\n Provable Rich Observation Reinforcement Learning with Combinatorial Latent States<\/strong><\/p>\n Dipendra Misra<\/a>, Qinghua Liu, Chi Jin, John Langford<\/a><\/p>\n Approximate Nearest Neighbor Negative Contrastive Learning for Dense Text Retrieval<\/strong><\/a><\/p>\n Lee Xiong<\/strong>, Chenyan Xiong<\/a>, Ye Li<\/strong>, Kwok-Fung Tang<\/strong>, Jialin Liu<\/strong>, Paul N. Bennett<\/a>, Junaid Ahmed<\/strong>, Arnold Overwikj<\/strong><\/p>\n Systematic Generalisation with Group Invariant Predictions<\/strong><\/p>\n Faruk Ahmed, Yoshua Bengio, Harm van Seijen<\/a>, Aaron Courville<\/p>\n Spotlight Session: May 3<\/em><\/p>\n 01:28 \u2013 01:38 PM | Spotlight<\/p>\n Learning-based Support Estimation in Sublinear Time<\/strong><\/p>\n Talya Eden, Piotr Indyk, Shyam Narayanan, Ronitt Rubinfeld, Sandeep Silwal, Tal Wagner<\/strong><\/p>\n Poster Session: May 3<\/em><\/p>\n 05:00 \u2013 07:00 PM | Poster<\/p>\n CoDA: Contrast-enhanced and Diversity-promoting Data Augmentation for Natural Language Understanding<\/strong><\/p>\n Yanru Qu, Dinghan Shen<\/strong>, Yelong Shen<\/strong>, Sandra Sajeev<\/strong>, Weizhu Chen<\/a>, Jiawei Han<\/p>\n Knowledge Distillation as Semiparametric Inference<\/strong><\/p>\n Tri Dao, Govinda M Kamath<\/strong>, Vasilis Syrgkanis<\/a>, Lester Mackey<\/a><\/p>\n Debiasing Concept-based Explanations with Causal Analysis<\/strong><\/p>\n Mohammad Taha Bahadori, David Heckerman<\/a><\/p>\n Aligning AI with Shared Human Values<\/strong><\/p>\n Dan Hendrycks, Collin Burns, Steven Basart, Andrew Critch, Jerry Li<\/a>, Dawn Song, Jacob Steinhardt<\/p>\n 01:00 \u2013 03:00 AM | Poster<\/p>\n Active Contrastive Learning of Audio-Visual Video Representations<\/strong><\/a><\/p>\n Shuang Ma<\/strong>, Zhaoyang Zeng, Daniel McDuff<\/a>, Yale Song<\/a><\/p>\n Dance Revolution: Long-Term Dance Generation with Music via Curriculum Learning<\/strong><\/p>\n Ruozi Huang, Huang Hu<\/strong>, Wei Wu, Kei Sawada, Mi Zhang, Daxin Jiang<\/a><\/p>\n Return-Based Contrastive Representation Learning for Reinforcement Learning<\/strong><\/a><\/p>\n Guoqing Liu, Chuheng Zhang, Li Zhao<\/a>, Tao Qin<\/a>, Jinhua Zhu, Li Jian, Nenghai Yu, Tie-Yan Liu<\/a><\/p>\n Byzantine-Resilient Non-Convex Stochastic Gradient Descent<\/strong><\/p>\n Zeyuan AllenZhu<\/a>, Faeze Ebrahimianghazani, Jerry Li<\/a>, Dan Alistarh<\/p>\n 09:00 \u2013 11:00 AM | Poster<\/p>\n Learning to Represent Action Values as a Hypergraph on the Action Vertices<\/strong><\/p>\n Arash Tavakoli, Mehdi Fatemi<\/a>, Petar Kormushev<\/p>\n Optimism in Reinforcement Learning with Generalized Linear Function Approximation<\/strong><\/p>\n Yining Wang, Ruosong Wang, Simon Shaolei Du, Akshay Krishnamurthy<\/a><\/p>\n SEED: Self-supervised Distillation For Visual Representation<\/strong><\/p>\n Zhiyuan Fang, Jianfeng Wang<\/strong>, Lijuan Wang<\/a>, Lei Zhang<\/a>, Yezhou Yang, Zicheng Liu<\/a><\/p>\n 05:00 \u2013 07:00 PM | Poster<\/p>\n Filtered Inner Product Projection for Crosslingual Embedding Alignment<\/strong><\/a><\/p>\n Vin Sachidananda, Ziyi Yang, Chenguang Zhu<\/a><\/p>\n Economic Hyperparameter Optimization With Blended Search Strategy<\/strong><\/p>\n Chi Wang<\/a>, Qingyun Wu<\/a>, Silu Huang<\/a>, Amin Saied<\/strong><\/p>\n ALFWorld: Aligning Text and Embodied Environments for Interactive Learning<\/strong><\/a><\/p>\n Mohit Shridhar, Xingdi Yuan<\/a>, Marc-Alexandre Cote<\/a>, Yonatan Bisk, Adam Trischler<\/a>, Matthew Hausknecht<\/a><\/p>\n AdaFuse: Adaptive Temporal Fusion Network for Efficient Action Recognition<\/strong><\/p>\n Yue Meng, Rameswar Panda, Chung-Ching Lin (opens in new tab)<\/span><\/a>, Prasanna Sattigeri, Leonid Karlinsky, Kate Saenko, Aude Oliva, Rogerio Feris<\/p>\n AdaSpeech: Adaptive Text to Speech for Custom Voice<\/strong><\/p>\n Mingjian Chen, Xu Tan<\/a>, Bohan Li, Yanqing Liu, Tao Qin<\/a>, Sheng Zhao, Tie-Yan Liu<\/a><\/p>\n Revisiting Dynamic Convolution via Matrix Decomposition<\/strong><\/p>\n Yunsheng Li, Yinpeng Chen<\/a>, Xiyang Dai<\/strong>, Mengchen Liu<\/strong>, Dongdong Chen<\/strong>, Ye Yu<\/strong>, Lu Yuan<\/a>, Zicheng Liu<\/a>, Mei Chen<\/a>, Nuno Vasconcelos<\/p>\n 07:25 \u2013 07:35 PM | Spotlight<\/p>\n Large Scale Image Completion via Co-Modulated Generative Adversarial Networks<\/strong><\/p>\n Shengyu Zhao<\/strong>, Jonathan Cui, Yilun Sheng<\/strong>, Yue Dong<\/a>, Xiao Liang, Eric I-Chao Chang<\/a>, Yan Xu<\/p>\n Poster Session: May 6<\/em><\/p>\n 07:55 \u2013 08:10 PM | Oral<\/p>\n Deformable DETR: Deformable Transformers for End-to-End Object Detection<\/strong><\/p>\n Xizhou Zhu<\/strong>, Weijie Su<\/strong>, Lewei Lu<\/strong>, Bin Li<\/a>, Xiaogang Wang<\/strong>, Jifeng Dai<\/strong><\/p>\n Poster Session: May 6<\/em><\/p>\n 01:00 \u2013 03:00 AM | Poster<\/p>\n Do not Let Privacy Overbill Utility: Gradient Embedding Perturbation for Private Learning<\/strong><\/a><\/p>\n Da Yu<\/strong>, Huishuai Zhang<\/a>, Wei Chen<\/a>, Tie-Yan Liu<\/a><\/p>\n IOT: Instance-wise Layer Reordering for Transformer Structures<\/strong><\/p>\n Jinhua Zhu, Lijun Wu<\/a>, Yingce Xia<\/a>, Shufang Xie<\/a>, Tao Qin<\/a>, Wengang Zhou, Houqiang Li, Tie-Yan Liu<\/a><\/p>\n Deformable DETR: Deformable Transformers for End-to-End Object Detection<\/strong><\/p>\n Xizhou Zhu<\/strong>, Chat with us!<\/h2>\n
\n
Organizing Committee<\/h2>\n
Accepted papers<\/h2>\n
Monday, May 3<\/h3>\n
\nTuesday, May 4<\/h3>\n
\nWednesday, May 5<\/h3>\n
\nThursday, May 6<\/h3>\n