2021 IEEE International Conference on Acoustics, Speech and Signal Processing

6-11 June 2021 • Toronto, Ontario, Canada

Extracting Knowledge from Information

2021 IEEE International Conference on Acoustics, Speech and Signal Processing

6-11 June 2021 • Toronto, Ontario, Canada

Extracting Knowledge from Information

Technical Program

Click on the icon to view the manuscript on IEEE XPlore in the IEEE ICASSP 2021 Open Preview.

AUD-33: Topics in Deep Learning for Speech and Audio

Session Type: Poster
Time: Friday, 11 June, 14:00 - 14:45
Location: Gather.Town
Virtual Session: View on Virtual Platform
Session Chair: Hirokazu Kameoka, Nippon Telegraph and Telephone Corporation
 
 AUD-33.1: UNIDIRECTIONAL MEMORY-SELF-ATTENTION TRANSDUCER FOR ONLINE SPEECH RECOGNITION
         Jian Luo; Ping An Technology (Shenzhen) Co., Ltd.
         Jianzong Wang; Ping An Technology (Shenzhen) Co., Ltd.
         Ning Cheng; Ping An Technology (Shenzhen) Co., Ltd.
         Jing Xiao; Ping An Technology (Shenzhen) Co., Ltd.
 
 AUD-33.2: ACCDOA: ACTIVITY-COUPLED CARTESIAN DIRECTION OF ARRIVAL REPRESENTATION FOR SOUND EVENT LOCALIZATION AND DETECTION
         Kazuki Shimada; Sony Corporation
         Yuichiro Koyama; Sony Corporation
         Naoya Takahashi; Sony Corporation
         Shusuke Takahashi; Sony Corporation
         Yuki Mitsufuji; Sony Corporation
 
 AUD-33.3: SEEN AND UNSEEN EMOTIONAL STYLE TRANSFER FOR VOICE CONVERSION WITH A NEW EMOTIONAL SPEECH DATASET
         Kun Zhou; National University of Singapore
         Berrak Sisman; Singapore University of Technology and Design
         Rui Liu; Singapore University of Technology and Design
         Haizhou Li; National University of Singapore
 
 AUD-33.4: U-CONVOLUTION BASED RESIDUAL ECHO SUPPRESSION WITH MULTIPLE ENCODERS
         Eesung Kim; Kakao Enterprise
         Jae-Jin Jeon; Kakao Enterprise
         Hyeji Seo; Kakao Enterprise
 
 AUD-33.5: A MULTI-CHANNEL TEMPORAL ATTENTION CONVOLUTIONAL NEURAL NETWORK MODEL FOR ENVIRONMENTAL SOUND CLASSIFICATION
         You Wang; Georgia Institute of Technology
         Chuyao Feng; Georgia Institute of Technology
         David Anderson; Georgia Institute of Technology
 
 AUD-33.6: A GENERAL NETWORK ARCHITECTURE FOR SOUND EVENT LOCALIZATION AND DETECTION USING TRANSFER LEARNING AND RECURRENT NEURAL NETWORK
         Thi Ngoc Tho Nguyen; Nanyang Technological University
         Ngoc Khanh Nguyen; Motional
         Huy Phan; Queen Mary University of London
         Lam Pham; Austrian Institute of Technology
         Kenneth Ooi; Nanyang Technological University
         Douglas L. Jones; University of Illinois at Urbana-Champaign
         Woon-Seng Gan; Nanyang Technological University