Research
Project page for internal access
Rich media communication: https://microsoftapc.sharepoint.com/teams/rich-media (opens in new tab)
Computer vision in MCG: https://microsoftapc.sharepoint.com/teams/mcg-vision (opens in new tab)
Selected papers (* denotes intern at MSRA)
“QDFormer: Towards Robust Audiovisual Segmentation in Complex Environments with Quantization-based Semantic Decomposition”,
Xiang Li*, Jinglu Wang, Xiaohao Xu*, Xiulian Peng, Rita Singh, Yan Lu, Bhiksha Raj; CVPR 2024.
[paper (opens in new tab)]
“PaintSeg: Training-free Segmentation via Painting”,
Xiang Li*, Chung-Ching Lin, Yinpeng Chen, Jinglu Wang, Zicheng Liu, Bhiksha Raj; NeurlPS 2023.
[paper (opens in new tab)]
“Towards Noise-Tolerant Speech-Referring Video Object Segmentation: Bridging Speech and Text”,
Xiang Li*, Jinglu Wang, Xiaohao Xu, Muqiao Yang, Rita Singh, Bhiksha Raj; EMNLP 2023.
[paper]
“Rethinking Voice-Face Correlation: A Geometry View”,
Xiang Li*, Yandong Wen, Muqiao Yang, Jinglu Wang, Rita Singh, Bhiksha Raj; ACM-MM 2023.
[paper (opens in new tab)]
“Towards Robust Referring Video Object Segmentation with Cyclic Relational Consensus”,
Xiang Li*, Jinglu Wang, Xiaohao Xu, Xiao Li, Bhiksha Raj, Yan Lu; ICCV 2023.
[paper (opens in new tab)]
“Efficient View Synthesis with Neural Radiance Distribution Field”,
Yushuang Wu*, Xiao Li, Jinglu Wang, Xiaoguang Han, Shuguang Cui, Yan Lu; ICCV 2023.
[paper]
“Unsupervised Temporal Correspondence Learning for Unified Video Object Removal”,
Zhongdao Wang*, Jinglu Wang, Xiao Li, Yali Li, Yan Lu, Shengjin Wang; TIP 2023.
[paper]
“Structural Multiplane Image: Bridging Neural View Synthesis and 3D Reconstruction”,
Mingfang Zhang*, Jinglu Wang, Xiao Li, Yifei Huang, Yoichi Sato, Yan Lu; CVPR 2023.
[paper (opens in new tab)]
“High-Fidelity and Freely Controllable Talking Head Video Generation”,
Yue Gao, Yuan Zhou, Jinglu Wang, Xiao Li, Xiang Ming, Yan Lu; CVPR 2023.
[paper (opens in new tab)]
“Two-shot Video Object Segmentation”,
Kun Yan*, Xiao Li, Fangyun Wei, Jinglu Wang, Chengbin Zhang, Ping Wang, Yan Lu; CVPR 2023.
[paper (opens in new tab)]
“Neural Capture of Animatable 3D Human from Monocular Video”,
Gusi Te*, Xiu Li*, Xiao Li, Jinglu Wang, Wei Hu, Yan Lu; ECCV 2022.
[paper (opens in new tab)]
“Video Instance Segmentation by Instance Flow Assembly”,
Xiang Li*, Jinglu Wang, Xiao Li, Yan Lu; TMM 2022.
[paper (opens in new tab)]
“Hybrid Instance-Aware Temporal Fusion for Online Video Instance Segmentation”,
Xiang Li*, Jinglu Wang, Xiao Li, Yan Lu; AAAI 2022.
[paper (opens in new tab)]
“Towards Robust Video Object Segmentation with Adaptive Object Calibration”,
Xiaohao Xu*, Jinglu Wang, Xiang Ming, Yan Lu; ACM-MM 2022.
[paper (opens in new tab)]
“Reliable Propagation-Correction Modulation for Video Object Segmentation”,
Xiaohao Xu*, Jinglu Wang, Xiao Li, Yan Lu; AAAI 2022.
[paper (opens in new tab)]
“MonoGRNet: A General Framework for Monocular 3D Object Detection”,
Zengyi Qin*, Jinglu Wang, Yan Lu; T-PAMI 2021.
[paper (opens in new tab)]
“Weakly-supervised Temporal Action Localization by Uncertainty Modeling”,
Pilhyeon Lee*, Jinglu Wang, Yan Lu, Hyeran Byun; AAAI 2021.
[paper (opens in new tab)]
“RT-VENet: A Convolutional Network for Real-time Video Enhancement”,
Mohan Zhang*, Qiqi Gao*, Jinglu Wang, Henrik Turbell, David Zhao, Yan Lu; ACM-MM 2020.
[paper (opens in new tab)]
“Weakly Supervised 3D Object Detection from Point Clouds”,
Zengyi Qin*, Jinglu Wang, Yan Lu; ACM-MM 2020.
[paper (opens in new tab)]
“Joint Semantic Segmentation and Boundary Detection using Iterative Pyramid Contexts”,
Mingmin Zhen, Jinglu Wang, Lei Zhou, Shiwei Li, Tianwei Shen, Jiaxiang Shang, Tian Fang, Quan Long; CVPR 2020.
[paper (opens in new tab)]
“Triangulation Learning Network: from Monocular to Stereo 3D Object Detection”,
Zengyi Qin*, Jinglu Wang, Yan Lu; Computer Vision and Pattern Recognition (CVPR) 2019.
[paper (opens in new tab)][project (opens in new tab)]
“MVPNet: Multi-View Point Regression Networks for 3D Object Reconstruction from A Single Image”,
Jinglu Wang, Bo Sun*, Yan Lu; The Thirty-Third AAAI Conference on Artificial Intelligence (AAAI-19). (Oral)
[paper (opens in new tab)][project (opens in new tab)]
“MonoGRNet: A Geometric Reasoning Network for Monocular 3D Object Localization”,
Zengyi Qin*, Jinglu Wang, Yan Lu; The Thirty-Third AAAI Conference on Artificial Intelligence (AAAI-19). (Oral)
[paper (opens in new tab)] [project (opens in new tab)]
“Learning Fully Dense Neural Networks for Image Semantic Segmentation”,
Mingmin Zhen, Jinglu Wang, Lei Zhou, Tian Fang, Long Quan; The Thirty-Third AAAI Conference on Artificial Intelligence (AAAI-19).
[paper (opens in new tab)]
“Progressive Large Scale-Invariant Image Matching in Scale Space”,
Lei Zhou, Siyu Zhu, Tianwei Shen, Jinglu Wang, Tian Fang, Long Quan; The IEEE International Conference on Computer Vision (ICCV) 2017.
[paper (opens in new tab)]
“Color Correction for Image-Based Modelling in the Large”,
Tianwei Shen, Jinglu Wang, Tian Fang, Siyu Zhu, Long Quan; The 13th Asian Conference on Computer Vision (ACCV) 2016.
[paper (opens in new tab)]
“Image-based Building Regularization Using Structural Linear Features”,
Jinglu Wang, Tian Fang , Qingkun Su, Siyu Zhu, Jingbo Liu, Shengnan Cai , Chiew-Lan Tai, Long Quan; IEEE Transactions on Visualization and Computer Graphics (TVCG) 2015.
[paper (opens in new tab)]
“Structure-driven Facade Parsing With Irregular Patterns”,
Jinglu Wang, Chun Liu, Tianwei Shen, Long Quan. Asian Conference on Pattern Recognition (ACPR) 2015. (Oral)
[paper (opens in new tab)]
“Semantic Segmentation of Large-Scale Urban 3D Data with Low Annotation Cost”,
Jinglu Wang, Shiwei Li, Honghui Zhang, Long Quan; Computer Vision and Pattern Recognition (CVPR) Workshop 2015.
[paper (opens in new tab)]
“Higher-order CRF Structural Segmentation of 3D Reconstructed Surfaces”,
Jingbo Liu, Jinglu Wang, Tian Fang, Chiew-Lan Tai, Long Quan; International Conference on Computer Vision (ICCV) 2015.
[paper (opens in new tab)]
“Joint Segmentation of Images and Scanned Point Cloud in Large-Scale Street Scenes with Low Annotation Cost”,
Honghui Zhang, Jinglu Wang, Tian Fang, Long Quan; IEEE Transactions On IMAGE PROCESSING (TIP) 2014.
[paper (opens in new tab)]
“Learning CRFs for Image Parsing with Adaptive Subgradient Descent”,
Honghui Zhang, Jingdong Wang, Ping Tan, Jinglu Wang, Long Quan; IEEE International Conference on Computer Vision (ICCV) 2013.
[paper (opens in new tab)]