AUD-33: Topics in Deep Learning for Speech and Audio |
Session Type: Poster |
Time: Friday, 11 June, 14:00 - 14:45 |
Location: Gather.Town |
Virtual Session: View on Virtual Platform |
Session Chair: Hirokazu Kameoka, Nippon Telegraph and Telephone Corporation |
AUD-33.1: UNIDIRECTIONAL MEMORY-SELF-ATTENTION TRANSDUCER FOR ONLINE SPEECH RECOGNITION |
Jian Luo; Ping An Technology (Shenzhen) Co., Ltd. |
Jianzong Wang; Ping An Technology (Shenzhen) Co., Ltd. |
Ning Cheng; Ping An Technology (Shenzhen) Co., Ltd. |
Jing Xiao; Ping An Technology (Shenzhen) Co., Ltd. |
AUD-33.2: ACCDOA: ACTIVITY-COUPLED CARTESIAN DIRECTION OF ARRIVAL REPRESENTATION FOR SOUND EVENT LOCALIZATION AND DETECTION |
Kazuki Shimada; Sony Corporation |
Yuichiro Koyama; Sony Corporation |
Naoya Takahashi; Sony Corporation |
Shusuke Takahashi; Sony Corporation |
Yuki Mitsufuji; Sony Corporation |
AUD-33.3: SEEN AND UNSEEN EMOTIONAL STYLE TRANSFER FOR VOICE CONVERSION WITH A NEW EMOTIONAL SPEECH DATASET |
Kun Zhou; National University of Singapore |
Berrak Sisman; Singapore University of Technology and Design |
Rui Liu; Singapore University of Technology and Design |
Haizhou Li; National University of Singapore |
AUD-33.4: U-CONVOLUTION BASED RESIDUAL ECHO SUPPRESSION WITH MULTIPLE ENCODERS |
Eesung Kim; Kakao Enterprise |
Jae-Jin Jeon; Kakao Enterprise |
Hyeji Seo; Kakao Enterprise |
AUD-33.5: A MULTI-CHANNEL TEMPORAL ATTENTION CONVOLUTIONAL NEURAL NETWORK MODEL FOR ENVIRONMENTAL SOUND CLASSIFICATION |
You Wang; Georgia Institute of Technology |
Chuyao Feng; Georgia Institute of Technology |
David Anderson; Georgia Institute of Technology |
AUD-33.6: A GENERAL NETWORK ARCHITECTURE FOR SOUND EVENT LOCALIZATION AND DETECTION USING TRANSFER LEARNING AND RECURRENT NEURAL NETWORK |
Thi Ngoc Tho Nguyen; Nanyang Technological University |
Ngoc Khanh Nguyen; Motional |
Huy Phan; Queen Mary University of London |
Lam Pham; Austrian Institute of Technology |
Kenneth Ooi; Nanyang Technological University |
Douglas L. Jones; University of Illinois at Urbana-Champaign |
Woon-Seng Gan; Nanyang Technological University |