{"id":578437,"date":"2019-04-11T08:55:08","date_gmt":"2019-04-11T15:55:08","guid":{"rendered":"https:\/\/www.microsoft.com\/en-us\/research\/?post_type=msr-event&p=578437"},"modified":"2025-08-06T11:56:30","modified_gmt":"2025-08-06T18:56:30","slug":"icassp-2019","status":"publish","type":"msr-event","link":"https:\/\/www.microsoft.com\/en-us\/research\/event\/icassp-2019\/","title":{"rendered":"Microsoft @ ICASSP 2019"},"content":{"rendered":"\n\n
Venue:<\/strong> Brighton Conference Centre (opens in new tab)<\/span><\/a><\/p>\n Website:<\/strong> ICASSP 2019 (opens in new tab)<\/span><\/a>Opens in a new tab<\/span><\/p>\n Microsoft is excited to be a Silver sponsor of the 44th<\/sup> International Conference on Acoustics, Speech, and Signal Processing (ICASSP) (opens in new tab)<\/span><\/a> May 12 \u2013 17, 2019, in Brighton, UK. Stop by our booth to chat with our experts, see demos of our latest research and find out more about career opportunities with Microsoft.<\/p>\n Frank K. Soong<\/a> Amy Siebenthaler Opens in a new tab<\/span><\/p>\n A Pitch-Aware Approach to Single-Channel Speech Separation<\/strong> Ke Wang, Frank Soong<\/a>, Lei Xie<\/p>\n A Sparsity Measure for Echo Density Growth in General Environments<\/strong> Helena Peic Tukuljac, Ville Pulkki, Hannes Gamper<\/a>, Keith Godin<\/strong>, Ivan Tashev<\/a>, Nikunj Raghuvanshi<\/a><\/p>\n Blind Room Volume Estimation from Single-Channel Noisy Speech<\/strong> Andrea Genovese, Hannes Gamper<\/a>, Ville Pulkki, Nikunj Raghuvanshi<\/a>, Ivan Tashev<\/a><\/p>\n Improving Binaural Ambisonics Decoding by Spherical Harmonics Domain Tapering and Coloration Compensation<\/strong> Christoph Hold, Hannes Gamper<\/a>, Ville Pulkki, Nikunj Raghuvanshi<\/a>, Ivan Tashev<\/a><\/p>\n Static and Dynamic State Predictions for Acoustic Model Combination<\/strong> Kshitiz Kumar<\/strong>, Yifan Gong <\/strong><\/p>\n Gaussian Process LSTM Recurrent Neural Network Language Models for Speech Recognition<\/strong> Max W.Y. Lam, Xie Chen<\/a>, Shoukang Hu, Jianwei Yu, Xunying Liu, Helen Meng<\/p>\n Investigation of Sampling Techniques for Maximum Entropy Language Modeling Training<\/strong> Xie Chen<\/a>, Jun Zhang<\/strong>, Tasos Anastasakos<\/strong>, Fil Alleva<\/p>\n Recurrent Neural Network Language Model Training Using Natural Gradient<\/strong> Jianwei Yu, Max W.Y. Lam, Xie Chen<\/a>, Shoukang Hu, Songxiang Liu, Xixin Wu, Xunying Liu, Helen Meng<\/p>\n Towards Code-Switching ASR for End-to-End CTC Models<\/strong> Ke Li, Jinyu Li<\/a>, Guoli Ye<\/strong>, Rui Zhao<\/strong>, Yifan Gong<\/strong><\/p>\n Adversarial Speaker Verification<\/strong> Zhong Meng<\/strong>, Yong Zhao<\/strong>, Jinyu Li<\/a>, Yifan Gong<\/strong><\/p>\n Attention in Recurrent Neural Networks for Ransomware Detection<\/strong> Rakshit Agrawal, Jack W. Stokes<\/a>, Karthik Selvaraj<\/strong>, Mady Marinescu<\/strong><\/p>\n Encrypted Speech Recognition Using Deep Polynomial Networks<\/strong> Shixiong Zhang, Yifan Gong<\/strong>, Dong Yu<\/p>\n Single-Channel Speech Extraction Using Speaker Inventory and Attention Network<\/strong> Xiong Xiao<\/strong>, Zhuo Chen<\/strong>, Takuya Yoshioka<\/a>, Hakan Erdogan, Changliang Liu<\/strong>, Dimitrios Dimitriadis<\/a>, Jasha Droppo, Yifan Gong<\/strong><\/p>\n Universal Acoustic Modeling Using Neural Mixture Models<\/strong> Amit Das<\/strong>, Jinyu Li<\/a>, Changliang Liu<\/strong>, Yifan Gong<\/strong><\/p>\n Adversarial Speaker Adaptation<\/strong> Zhong Meng<\/strong>, Jinyu Li<\/a>, Yifan Gong<\/strong><\/p>\n Detecting Cyber Attacks Using Anomaly Detection with Explanations and Expert Feedback<\/strong> Md Amran Siddiqui, Jack W. Stokes<\/a>, Christian Seifert<\/strong>, Evan Argyle<\/strong>, Robert McCann<\/strong>, Joshua Neil<\/strong>, Justin Carroll<\/strong><\/p>\n Directional Interference Suppression Using a Spatial Relative Transfer Function Feature<\/strong> Sebastian Braun<\/a>, Ivan Tashev<\/a><\/p>\n NN-Based Ordinal Regression for Assessing Fluency of ESL Speech<\/strong> Shaoguang Mao, Zhiyong Wu, Jingshuai Jiang<\/strong>, Peiyun Liu<\/strong>, Frank Soong<\/a><\/p>\n Non-Intrusive Speech Quality Assessment Using Neural Networks<\/strong><\/a> Anderson R. Avila, Hannes Gamper<\/a>, Chandan Reddy<\/strong>, Ross Cutler<\/strong>, Ivan Tashev<\/a>, Johannes Gehrke<\/a><\/p>\n Conditional Teacher-Student Learning<\/strong> Zhong Meng<\/strong>, Jinyu Li<\/a>, Yong Zhao<\/strong>, Yifan Gong<\/strong><\/p>\n Decoding Homomorphically Encrypted Flac Audio Without Decryption<\/strong> Yuanyuan Tang, Bin Zhu<\/a>, Xiaojing Ma, Mathiopoulos P. Takis, Xia Xie, Hong Huang<\/p>\n Improving Layer Trajectory LSTM with Future Context Frames<\/strong> Jinyu Li<\/a>, Liang Lu<\/strong>, Changliang Liu<\/strong>, Yifan Gong<\/strong><\/p>\n Contextual Out-of-Domain Utterance Handling with Counterfeit Data Augmentation<\/strong><\/a> Sungjin Lee<\/a>, Igor Shalyminov<\/p>\n Dilated Residual Network with Multi-Head Self-Attention for Speech Emotion Recognition<\/strong> Runnan Li, Zhiyong Wu, Jia Jia, Sheng Zhao<\/strong>, Helen Meng<\/p>\n Attentive Adversarial Learning for Domain-Invariant Training<\/strong> Zhong Meng<\/strong>, Jinyu Li<\/a>, Yifan Gong<\/strong><\/p>\n Speech Super Resolution Generative Adversarial Network<\/strong> Sefik Emre Eskimez, Kazuhito Koishida<\/a><\/p>\n Word Characters and Phone Pronunciation Embedding for ASR Confidence Classifier<\/strong> Session Chair: Ivan Tashev<\/a> Acoustic and Lexical Sentiment Analysis for Customer Service Calls<\/strong><\/a> Bryan Li, Dimitrios Dimitriadis<\/a>, Andreas Stolcke<\/a><\/p>\n Domain Adversarial Training for Improving Keyword Spotting Performance of ESL Speech<\/strong> Session Chairs: Yao Qian, Helen Meng, Frank K. Soong<\/a> Learning Latent Representations for Style Control and Transfer in End-to-End Speech Synthesis<\/strong> Ya-Jie Zhang, Shifeng Pan<\/strong>, Lei He<\/strong>, Zhen-Hua Ling<\/p>\n Low-Latency Speaker-Independent Continuous Speech Separation<\/strong> Takuya Yoshioka<\/a>, Zhuo Chen<\/strong>, Changliang Liu<\/strong>, Xiong Xiao<\/strong>, Hakan Erdogan, Dimitrios Dimitriadis<\/a><\/p>\n Cross Modal Audio Search and Retrieval with Joint Embeddings Based on Text and Audio<\/strong> Benjamin Elizalde, Shuayb Zarar<\/a>, Bhiksha Raj<\/p>\n Opens in a new tab<\/span><\/p>\n","protected":false},"excerpt":{"rendered":" Microsoft is excited to be a Silver sponsor of the 44th International Conference on Acoustics, Speech, and Signal Processing (ICASSP) May 12 \u2013 17, 2019, in Brighton, UK.<\/p>\n","protected":false},"featured_media":581446,"template":"","meta":{"msr-url-field":"","msr-podcast-episode":"","msrModifiedDate":"","msrModifiedDateEnabled":false,"ep_exclude_from_search":false,"_classifai_error":"","msr_startdate":"2019-05-12","msr_enddate":"2019-05-17","msr_location":"Brighton, United Kingdom","msr_expirationdate":"","msr_event_recording_link":"","msr_event_link":"","msr_event_link_redirect":false,"msr_event_time":"","msr_hide_region":false,"msr_private_event":false,"msr_hide_image_in_river":0,"footnotes":""},"research-area":[243062,13545],"msr-region":[239178,243014],"msr-event-type":[197941],"msr-video-type":[],"msr-locale":[268875],"msr-program-audience":[],"msr-post-option":[],"msr-impact-theme":[],"class_list":["post-578437","msr-event","type-msr-event","status-publish","has-post-thumbnail","hentry","msr-research-area-audio-acoustics","msr-research-area-human-language-technologies","msr-region-europe","msr-region-middle-east-africa","msr-event-type-conferences","msr-locale-en_us"],"msr_about":"\n\n Venue:<\/strong> Brighton Conference Centre (opens in new tab)<\/span><\/a><\/p>\n Website:<\/strong> ICASSP 2019 (opens in new tab)<\/span><\/a>Opens in a new tab<\/span><\/p>\n Microsoft is excited to be a Silver sponsor of the 44th<\/sup> International Conference on Acoustics, Speech, and Signal Processing (ICASSP)<\/a> May 12 \u2013 17, 2019, in Brighton, UK. Stop by our booth to chat with our experts, see demos of our latest research and find out more about career opportunities with Microsoft.<\/p>\n Frank K. Soong<\/a> Amy Siebenthaler Opens in a new tab<\/span><\/p>\n A Pitch-Aware Approach to Single-Channel Speech Separation<\/strong> Ke Wang, Frank Soong<\/a>, Lei Xie<\/p>\n A Sparsity Measure for Echo Density Growth in General Environments<\/strong> Helena Peic Tukuljac, Ville Pulkki, Hannes Gamper<\/a>, Keith Godin<\/strong>, Ivan Tashev<\/a>, Nikunj Raghuvanshi<\/a><\/p>\n Blind Room Volume Estimation from Single-Channel Noisy Speech<\/strong> Andrea Genovese, Hannes Gamper<\/a>, Ville Pulkki, Nikunj Raghuvanshi<\/a>, Ivan Tashev<\/a><\/p>\n Improving Binaural Ambisonics Decoding by Spherical Harmonics Domain Tapering and Coloration Compensation<\/strong> Christoph Hold, Hannes Gamper<\/a>, Ville Pulkki, Nikunj Raghuvanshi<\/a>, Ivan Tashev<\/a><\/p>\n Static and Dynamic State Predictions for Acoustic Model Combination<\/strong> Kshitiz Kumar<\/strong>, Yifan Gong <\/strong><\/p>\n Gaussian Process LSTM Recurrent Neural Network Language Models for Speech Recognition<\/strong>Session chairs<\/h3>\n
\nIvan Tashev<\/a>
\nJinyu Li<\/a>
\nDavid Wipf<\/a><\/p>\nMicrosoft attendees<\/h3>\n
\nAndreas Stolcke<\/a>
\nAnthony Stark
\nDimitra Emmanouilidou<\/a>
\nDimitrios Dimitriadis<\/a>
\nEric Sun
\nFei Zuo
\nFrank K. Soong<\/a>
\nHamid Palangi<\/a>
\nHannes Gamper<\/a>
\nIvan Tashev<\/a>
\nJack Stokes
\nJian Wu
\nJianfeng Gao<\/a>
\nJinyu Li<\/a>
\nKazuhito Koishida<\/a>
\nKshitiz Kumar
\nLei He
\nMichael Levit<\/a>
\nMortaza Doulaty
\nNanshan Zeng
\nNikunj Raghuvanshi<\/a>
\nOren Barkan
\nSarangarajan Parthasarathy<\/a>
\nSebastian Braun<\/a>
\nShifeng Pan
\nShuayb Zarar<\/a>
\nSungjin Lee<\/a>
\nTasos Anastasakos
\nXiaoyang Chen
\nXuedong Huang<\/a>
\nYan Huang<\/a>
\nYao Tian
\nYashesh Gaur
\nYifan Gong
\nYong Zhao<\/p>\n
\nTuesday, May 14, 2019 | 1:30 PM\u20133:30 PM | Music Source Separation and Spatial Audio | Poster Area E<\/p>\n
\nTuesday, May 14, 2019 | 1:30 PM\u20133:30 PM | Acoustic Environments and Music Analysis | Poster Area D<\/p>\n
\nTuesday, May 14, 2019 | 1:30 PM\u20133:30 PM | Acoustic Environments and Music Analysis | Poster Area D<\/p>\n
\nTuesday, May 14, 2019 | 1:30 PM\u20133:30 PM | Music Source Separation and Spatial Audio | Poster Area E<\/p>\n
\nTuesday, May 14, 2019 | 1:30 PM\u20133:30 PM | Deep Learning Applications I | Auditorium 2<\/p>\n
\n
\nTuesday, May 14, 2019 | 5:30 PM\u20137:30 PM | Language Modeling, ASR and Punctuation Prediction | Poster Area C<\/p>\n
\nTuesday, May 14, 2019 | 5:30 PM\u20137:30 PM | Language Modeling, ASR and Punctuation Prediction | Poster Area C<\/p>\n
\nTuesday, May 14, 2019 | 5:30 PM\u20137:30 PM | Language Modeling, ASR and Punctuation Prediction | Poster Area C<\/p>\n
\nTuesday, May 14, 2019 | 5:30 PM\u20137:30 PM | Multi-lingual Speech Recognition | Poster Area A<\/p>\n
\n
\nWednesday, May 15, 2019 | 8:30 AM\u201310:30 AM | Features and Robustness for Speaker Identification | Poster Area B<\/p>\n
\nWednesday, May 15, 2019 | 8:30 AM\u201310:30 AM | Deep Learning III | Poster Area G<\/p>\n
\nWednesday, May 15, 2019 | 8:30 AM\u201310:30 AM | Novel Architectures and Training Strategies for ASR | Auditorium 1<\/p>\n
\nWednesday, May 15, 2019 | 8:30 AM\u201310:30 AM | Source Separation and Speech Enhancement I | Meeting Room 1<\/p>\n
\nWednesday, May 15, 2019 | 8:30 AM\u201310:30 AM | Novel Architectures and Training Strategies for ASR | Auditorium 1<\/p>\n
\n
\nWednesday, May 15, 2019 | 1:30 PM\u20133:30 PM | Feature Learning and Adaptation for ASR | Auditorium 1<\/p>\n
\nWednesday, May 15, 2019 | 1:30 PM\u20133:30 PM | Learning Theory and Methods I | Auditorium 2<\/p>\n
\n
\nWednesday, May 15, 2019 | 4:00 PM\u20136:00 PM | Quality Measures and Sensor Array Processing | Poster Area D<\/p>\n
\nWednesday, May 15, 2019 | 4:00 PM\u20136:00 PM | Training Regimes for Emotion and Sentiment Analysis | Poster Area C<\/p>\n
\nWednesday, May 15, 2019 | 4:00 PM\u20136:00 PM | Quality Measures and Sensor Array Processing | Poster Area D<\/p>\n
\n
\nThursday, May 16, 2019 | 8:00 AM\u201310:00 AM | ASR Training Strategies and Toolkits | Poster Area A<\/p>\n
\nThursday, May 16, 2019 | 8:00 AM\u201310:00 AM | Audio Security and Source Separation | Poster Area D<\/p>\n
\n
\nThursday, May 16, 2019 | 1:00 PM\u20133:00 PM | New Features, Models and Representations\/Audio Visual ASR | Poster Area A<\/p>\n
\n
\nThursday, May 16, 2019 | 3:30 PM\u20135:30 PM | Dialogue | Syndicate 1<\/p>\n
\nThursday, May 16, 2019 | 3:30 PM\u20135:30 PM | Architectures for Emotion and Sentiment Analysis | Poster Area B<\/p>\n
\n
\nThursday, May 16, 2019 | 6:00 PM\u20138:00 PM | Robust Speech Recognition | Poster Area A<\/p>\n
\nThursday, May 16, 2019 | 6:00 PM\u20138:00 PM | Audio and Speech Applications | Poster Area G<\/p>\n
\nThursday, May 16, 2019 | 6:00 PM\u20138:00 PM | Signal Processing for Emerging and Practical Applications | Poster Area E<\/p>\n
\nKshitiz Kumar<\/strong>, Tasos Anastasakos<\/strong>, Yifan Gong<\/strong><\/p>\n
\n
\nFriday, May 17, 2019 | 8:30 AM\u201310:30 AM | Using Multiple Perspectives in Emotion and Sentiment Analysis | Syndicate 3<\/p>\n
\nFriday, May 17, 2019 | 8:30 AM\u201310:30 AM | Artificial Intelligence Based Human-Machine Conversation Technology for Interactive Education | Syndicate 1<\/p>\n
\nJingyong Hou, Pengcheng Guo, Sining Sun, Frank K. Soong<\/a>, Wenping Hu<\/a>, Lei Xie<\/p>\n
\nFriday, May 17, 2019 | 8:30 AM\u201310:30 AM | Speech Synthesis II | Poster Area B<\/p>\n
\n
\nFriday, May 17, 2019 | 1:30 PM\u20133:30 PM | Speech Separation, Enhancement and Denoising | Poster Area A<\/p>\n
\n
\nFriday, May 17, 2019 | 4:00 PM\u20136:00 PM | Multimedia Analysis | Poster Area C<\/p>\nSession chairs<\/h3>\n
\nIvan Tashev<\/a>
\nJinyu Li<\/a>
\nDavid Wipf<\/a><\/p>\nMicrosoft attendees<\/h3>\n
\nAndreas Stolcke<\/a>
\nAnthony Stark
\nDimitra Emmanouilidou<\/a>
\nDimitrios Dimitriadis<\/a>
\nEric Sun
\nFei Zuo
\nFrank K. Soong<\/a>
\nHamid Palangi<\/a>
\nHannes Gamper<\/a>
\nIvan Tashev<\/a>
\nJack Stokes
\nJian Wu
\nJianfeng Gao<\/a>
\nJinyu Li<\/a>
\nKazuhito Koishida<\/a>
\nKshitiz Kumar
\nLei He
\nMichael Levit<\/a>
\nMortaza Doulaty
\nNanshan Zeng
\nNikunj Raghuvanshi<\/a>
\nOren Barkan
\nSarangarajan Parthasarathy<\/a>
\nSebastian Braun<\/a>
\nShifeng Pan
\nShuayb Zarar<\/a>
\nSungjin Lee<\/a>
\nTasos Anastasakos
\nXiaoyang Chen
\nXuedong Huang<\/a>
\nYan Huang<\/a>
\nYao Tian
\nYashesh Gaur
\nYifan Gong
\nYong Zhao<\/p>\n
\nTuesday, May 14, 2019 | 1:30 PM\u20133:30 PM | Music Source Separation and Spatial Audio | Poster Area E<\/p>\n
\nTuesday, May 14, 2019 | 1:30 PM\u20133:30 PM | Acoustic Environments and Music Analysis | Poster Area D<\/p>\n
\nTuesday, May 14, 2019 | 1:30 PM\u20133:30 PM | Acoustic Environments and Music Analysis | Poster Area D<\/p>\n
\nTuesday, May 14, 2019 | 1:30 PM\u20133:30 PM | Music Source Separation and Spatial Audio | Poster Area E<\/p>\n
\nTuesday, May 14, 2019 | 1:30 PM\u20133:30 PM | Deep Learning Applications I | Auditorium 2<\/p>\n
\n
\nTuesday, May 14, 2019 | 5:30 PM\u20137:30 PM | Language Modeling, ASR and Punctuation Prediction | Poster Area C<\/p>\n