MLSP-11: Self-supervised Learning for Speech Processing |
Session Type: Poster |
Time: Tuesday, 8 June, 16:30 - 17:15 |
Location: Gather.Town |
Virtual Session: View on Virtual Platform |
Session Chair: Isabel Trancoso, INESC-ID / IST, University of Lisbon |
MLSP-11.1: NEURAL AUDIO FINGERPRINT FOR HIGH-SPECIFIC AUDIO RETRIEVAL BASED ON CONTRASTIVE LEARNING |
Sungkyun Chang; Cochlear.ai |
Donmoon Lee; Cochlear.ai, Seoul National University |
Jeongsoo Park; Cochlear.ai |
Hyungui Lim; Cochlear.ai |
Kyogu Lee; Seoul National University |
Karam Ko; SK Telecom |
Yoonchang Han; Cochlear.ai |
MLSP-11.2: SELF-TRAINING AND PRE-TRAINING ARE COMPLEMENTARY FOR SPEECH RECOGNITION |
Qiantong Xu; Facebook AI Research |
Alexei Baevski; Facebook AI Research |
Tatiana Likhomanenko; Facebook AI Research |
Paden Tomasello; Facebook AI Research |
Alexis Conneau; Facebook AI Research |
Ronan Collobert; Facebook AI Research |
Gabriel Synnaeve; Facebook AI Research |
Michael Auli; Facebook AI Research |
MLSP-11.3: UNSUPERVISED DISCRIMINATIVE LEARNING OF SOUNDS FOR AUDIO EVENT CLASSIFICATION |
Sascha Hornauer; University of California, Berkeley |
Ke Li; University of California, Berkeley |
Stella Yu; University of California, Berkeley |
Shabnam Ghaffarzadegan; Robert Bosch LLC |
Liu Ren; Robert Bosch LLC |
MLSP-11.4: SIMILARITY ANALYSIS OF SELF-SUPERVISED SPEECH REPRESENTATIONS |
Yu-An Chung; Massachusetts Institute of Technology |
Yonatan Belinkov; Technion Henry and Marilyn Taub Faculty of Computer Science |
James Glass; Massachusetts Institute of Technology |
MLSP-11.5: JOINT MASKED CPC AND CTC TRAINING FOR ASR |
Chaitanya Talnikar; Facebook |
Tatiana Likhomanenko; Facebook |
Ronan Collobert; Facebook |
Gabriel Synnaeve; Facebook |
MLSP-11.6: A COMPARISON OF DISCRETE LATENT VARIABLE MODELS FOR SPEECH REPRESENTATION LEARNING |
Henry Zhou; University of Toronto |
Alexei Baevski; Facebook AI Research |
Michael Auli; Facebook AI Research |