MLSP-44: Multimodal Data and Applications |
| Session Type: Poster |
| Time: Friday, 11 June, 13:00 - 13:45 |
| Location: Gather.Town |
| Virtual Session: View on Virtual Platform |
| Session Chair: Shi-Xiong Zhang, Tencent AI Lab |
| MLSP-44.1: MULTIMODAL PUNCTUATION PREDICTION WITH CONTEXTUAL DROPOUT |
| Andrew Silva; Georgia Institute of Technology |
| Barry-John Theobald; Apple |
| Nicholas Apostoloff; Apple |
| MLSP-44.2: MULTI-MODAL LABEL DEQUANTIZED GAUSSIAN PROCESS LATENT VARIABLE MODEL FOR ORDINAL LABEL ESTIMATION |
| Masanao Matsumoto; Hokkaido University |
| Keisuke Maeda; Hokkaido University |
| Naoki Saito; National Institute of Technology, Kushiro College |
| Takahiro Ogawa; Hokkaido University |
| Miki Haseyama; Hokkaido University |
| MLSP-44.3: GENERATIVE INFORMATION FUSION |
| Kenneth Tran; North Carolina State University |
| Wesam Sakla; Lawrence Livermore National Laboratory |
| Hamid Krim; North Carolina State University |
| MLSP-44.4: SELF-AUGMENTED MULTI-MODAL FEATURE EMBEDDING |
| Shinnosuke Matsuo; Kyushu University |
| Seiichi Uchida; Kyushu University |
| Brian Kenji Iwana; Kyushu University |
| MLSP-44.5: OPTIMIZE WHAT MATTERS: TRAINING DNN-HMM KEYWORD SPOTTING MODEL USING END METRIC |
| Ashish Shrivastava; Apple |
| Arnav Kundu; Apple |
| Chandra Dhir; Apple |
| Devang Naik; Apple |
| Oncel Tuzel; Apple |
| MLSP-44.6: CO-ATTENTIONAL TRANSFORMERS FOR STORY-BASED VIDEO UNDERSTANDING |
| Björn Bebensee; Seoul National University |
| Byoung-Tak Zhang; Seoul National University |