2021 IEEE International Conference on Acoustics, Speech and Signal Processing

6-11 June 2021 • Toronto, Ontario, Canada

Extracting Knowledge from Information

2021 IEEE International Conference on Acoustics, Speech and Signal Processing

6-11 June 2021 • Toronto, Ontario, Canada

Extracting Knowledge from Information

Technical Program

MLSP-24: Applications in Audio and Speech Processing

Session Type: Poster
Time: Wednesday, 9 June, 16:30 - 17:15
Location: Gather.Town
Session Chair: Sven Shepstone, Bang & Olufsen
 
MLSP-24.1: WASSERSTEIN BARYCENTER TRANSPORT FOR ACOUSTIC ADAPTATION
         Eduardo Fernandes Montesuma; Universidade Federal do Ceará
         Fred-Maurice Ngolè Mboula; Université Paris-Saclay
 
MLSP-24.2: EFFICIENT ADVERSARIAL AUDIO SYNTHESIS VIA PROGRESSIVE UPSAMPLING
         Youngwoo Cho; Korea Advanced Institute of Science and Technology (KAIST)
         Minwook Chang; NCSOFT
         Sanghyeon Lee; Korea Advanced Institute of Science and Technology (KAIST)
         Hyoungwoo Lee; Korea University
         Gerard Jounghyun Kim; Korea University
         Jaegul Choo; Korea Advanced Institute of Science and Technology (KAIST)
 
MLSP-24.3: MULTI-CHANNEL SPEECH ENHANCEMENT USING GRAPH NEURAL NETWORKS
         Panagiotis Tzirakis; Facebook
         Anurag Kumar; Facebook
         Jacob Donley; Facebook
 
MLSP-24.4: MULTI-DECODER DPRNN: SOURCE SEPARATION FOR VARIABLE NUMBER OF SPEAKERS
         Junzhe Zhu; University of Illinois at Urbana-Champaign
         Raymond Yeh; University of Illinois at Urbana-Champaign
         Mark Hasegawa-Johnson; University of Illinois at Urbana-Champaign
 
MLSP-24.5: DATA-EFFICIENT FRAMEWORK FOR REAL-WORLD MULTIPLE SOUND SOURCE 2D LOCALIZATION
         Guillaume Le Moing; Inria, Ecole normale superieure, CNRS, PSL Research University
         Phongtharin Vinayavekhin; IBM Research
         Don Joven Agravante; IBM Research
         Tadanobu Inoue; IBM Research
         Jayakorn Vongkulbhisal; IBM Research
         Asim Munawar; IBM Research
         Ryuki Tachibana; IBM Research
 
MLSP-24.6: FUSING INFORMATION STREAMS IN END-TO-END AUDIO-VISUAL SPEECH RECOGNITION
         Wentao Yu; Ruhr University Bochum
         Steffen Zeiler; Ruhr University Bochum
         Dorothea Kolossa; Ruhr University Bochum