MLSP-24: Applications in Audio and Speech Processing |
Session Type: Poster |
Time: Wednesday, 9 June, 16:30 - 17:15 |
Location: Gather.Town |
Virtual Session: View on Virtual Platform |
Session Chair: Sven Shepstone, Bang & Olufsen |
MLSP-24.1: WASSERSTEIN BARYCENTER TRANSPORT FOR ACOUSTIC ADAPTATION |
Eduardo Fernandes Montesuma; Universidade Federal do Ceará |
Fred-Maurice Ngolè Mboula; Université Paris-Saclay |
MLSP-24.2: EFFICIENT ADVERSARIAL AUDIO SYNTHESIS VIA PROGRESSIVE UPSAMPLING |
Youngwoo Cho; Korea Advanced Institute of Science and Technology (KAIST) |
Minwook Chang; NCSOFT |
Sanghyeon Lee; Korea Advanced Institute of Science and Technology (KAIST) |
Hyoungwoo Lee; Korea University |
Gerard Jounghyun Kim; Korea University |
Jaegul Choo; Korea Advanced Institute of Science and Technology (KAIST) |
MLSP-24.3: MULTI-CHANNEL SPEECH ENHANCEMENT USING GRAPH NEURAL NETWORKS |
Panagiotis Tzirakis; Facebook |
Anurag Kumar; Facebook |
Jacob Donley; Facebook |
MLSP-24.4: MULTI-DECODER DPRNN: SOURCE SEPARATION FOR VARIABLE NUMBER OF SPEAKERS |
Junzhe Zhu; University of Illinois at Urbana-Champaign |
Raymond Yeh; University of Illinois at Urbana-Champaign |
Mark Hasegawa-Johnson; University of Illinois at Urbana-Champaign |
MLSP-24.5: DATA-EFFICIENT FRAMEWORK FOR REAL-WORLD MULTIPLE SOUND SOURCE 2D LOCALIZATION |
Guillaume Le Moing; Inria, Ecole normale superieure, CNRS, PSL Research University |
Phongtharin Vinayavekhin; IBM Research |
Don Joven Agravante; IBM Research |
Tadanobu Inoue; IBM Research |
Jayakorn Vongkulbhisal; IBM Research |
Asim Munawar; IBM Research |
Ryuki Tachibana; IBM Research |
MLSP-24.6: FUSING INFORMATION STREAMS IN END-TO-END AUDIO-VISUAL SPEECH RECOGNITION |
Wentao Yu; Ruhr University Bochum |
Steffen Zeiler; Ruhr University Bochum |
Dorothea Kolossa; Ruhr University Bochum |