2021 IEEE International Conference on Acoustics, Speech and Signal Processing

6-11 June 2021 • Toronto, Ontario, Canada

Extracting Knowledge from Information

2021 IEEE International Conference on Acoustics, Speech and Signal Processing

6-11 June 2021 • Toronto, Ontario, Canada

Extracting Knowledge from Information

Technical Program

SPE-44: Speech Recognition 16: Robust Speech Recognition 2

Session Type: Poster
Time: Thursday, 10 June, 16:30 - 17:15
Location: Gather.Town
Session Chair: Abdelrahman Mohamed, Facebook AI Research (FAIR)
 
SPE-44.1: AN INVESTIGATION OF END-TO-END MODELS FOR ROBUST SPEECH RECOGNITION
         Archiki Prasad; Indian Institute of Technology, Bombay
         Preethi Jyothi; Indian Institute of Technology, Bombay
         Rajbabu Velmurugan; Indian Institute of Technology, Bombay
 
SPE-44.2: END-TO-END DEREVERBERATION, BEAMFORMING, AND SPEECH RECOGNITION WITH IMPROVED NUMERICAL STABILITY AND ADVANCED FRONTEND
         Wangyou Zhang; Shanghai Jiao Tong University
         Christoph Boeddeker; Paderborn University
         Shinji Watanabe; Johns Hopkins University
         Tomohiro Nakatani; NTT Corporation
         Marc Delcroix; NTT Corporation
         Keisuke Kinoshita; NTT Corporation
         Tsubasa Ochiai; NTT Corporation
         Naoyuki Kamo; NTT Corporation
         Reinhold Haeb-Umbach; Paderborn University
         Yanmin Qian; Shanghai Jiao Tong University
 
SPE-44.3: STREAMING MULTI-SPEAKER ASR WITH RNN-T
         Ilya Sklyar; Amazon
         Anna Piunova; Amazon
         Yulan Liu; Amazon
 
SPE-44.4: IMPROVING RNN TRANSDUCER WITH TARGET SPEAKER EXTRACTION AND NEURAL UNCERTAINTY ESTIMATION
         Jiatong Shi; The Johns Hopkins University
         Chunlei Zhang; Tencent AI Lab
         Chao Weng; Tencent AI Lab
         Shinji Watanabe; The Johns Hopkins University
         Meng Yu; Tencent AI Lab
         Dong Yu; Tencent AI Lab
 
SPE-44.5: A PROGRESSIVE LEARNING APPROACH TO ADAPTIVE NOISE AND SPEECH ESTIMATION FOR SPEECH ENHANCEMENT AND NOISY SPEECH RECOGNITION
         Zhaoxu Nian; University of Science and Technology of China
         Yan-Hui Tu; University of Science and Technology of China
         Jun Du; University of Science and Technology of China
         Chin-Hui Lee; Georgia Institute of Technology
 
SPE-44.6: THE ACCENTED ENGLISH SPEECH RECOGNITION CHALLENGE 2020: OPEN DATASETS, TRACKS, BASELINES, RESULTS AND METHODS
         Xian Shi; Audio, Speech and Language Processing Group (ASLP@NPU), School of Computer Science, Northwestern Polytechnical University
         Fan Yu; Audio, Speech and Language Processing Group (ASLP@NPU), School of Computer Science, Northwestern Polytechnical University
         Yizhou Lu; SpeechLab, Department of Computer Science and Engineering, Shanghai Jiao Tong University
         Yuhao Liang; Audio, Speech and Language Processing Group (ASLP@NPU), School of Computer Science, Northwestern Polytechnical University
         Qiangze Feng; Datatang (Beijing) Technology Co., LTD
         Daliang Wang; Datatang (Beijing) Technology Co., LTD
         Yanmin Qian; SpeechLab, Department of Computer Science and Engineering, Shanghai Jiao Tong University
         Lei Xie; Audio, Speech and Language Processing Group (ASLP@NPU), School of Computer Science, Northwestern Polytechnical University