SPE-9: Speech Recognition 3: Transformer Models 1 |
Session Type: Poster |
Time: Tuesday, 8 June, 16:30 - 17:15 |
Location: Gather.Town |
Virtual Session: View on Virtual Platform |
Session Chair: Yangyang Shi, Facebook AI
|
|
SPE-9.1: TRANSFORMER-TRANSDUCERS FOR CODE-SWITCHED SPEECH RECOGNITION |
Siddharth Dalmia; Carnegie Mellon University |
Yuzong Liu; Amazon |
Srikanth Ronanki; Amazon |
Katrin Kirchhoff; Amazon |
|
SPE-9.2: WAKE WORD DETECTION WITH STREAMING TRANSFORMERS |
Yiming Wang; Johns Hopkins University |
Hang Lv; Northwestern Polytechnical University |
Daniel Povey; Xiaomi Corporation |
Lei Xie; Northwestern Polytechnical University |
Sanjeev Khudanpur; Johns Hopkins University |
|
SPE-9.3: CAPTURING MULTI-RESOLUTION CONTEXT BY DILATED SELF-ATTENTION |
Niko Moritz; Mitsubishi Electric Research Laboratories (MERL) |
Takaaki Hori; Mitsubishi Electric Research Laboratories (MERL) |
Jonathan Le Roux; Mitsubishi Electric Research Laboratories (MERL) |
|
SPE-9.4: RECENT DEVELOPMENTS ON ESPNET TOOLKIT BOOSTED BY CONFORMER |
Pengcheng Guo; Northwestern Polytechnical University; Johns Hopkins University |
Florian Boyer; LaBRI, University of Bordeaux; Airudit |
Xuankai Chang; Johns Hopkins University |
Tomoki Hayashi; Nagoya University; Human Dataware Lab. Co., Ltd. |
Yosuke Higuchi; Waseda University |
Hirofumi Inaguma; Kyoto University |
Naoyuki Kamo; NTT Corporation |
Chenda Li; Shanghai Jiao Tong University |
Daniel Garcia-Romero; Johns Hopkins University |
Jiatong Shi; Johns Hopkins University |
Jing Shi; Institute of Automation, Chinese Academy of Sciences, China and Johns Hopkins University |
Shinji Watanabe; Johns Hopkins University, |
Kun Wei; Northwestern Polytechnical University |
Wangyou Zhang; Shanghai Jiao Tong University |
Yuekai Zhang; Johns Hopkins University |
|
SPE-9.5: HIERARCHICAL TRANSFORMER-BASED LARGE-CONTEXT END-TO-END ASR WITH LARGE-CONTEXT KNOWLEDGE DISTILLATION |
Ryo Masumura; NTT Corporation |
Naoki Makishima; NTT Corporation |
Mana Ihori; NTT Corporation |
Akihiko Takashima; NTT Corporation |
Tomohiro Tanaka; NTT Corporation |
Shota Orihashi; NTT Corporation |
|
SPE-9.6: END-TO-END MULTI-CHANNEL TRANSFORMER FOR SPEECH RECOGNITION |
Feng-Ju Chang; Amazon |
Martin Radfar; Amazon |
Athanasios Mouchtaris; Amazon |
Brian King; Amazon |
Siegfried Kunzmann; Amazon |
|