2021 IEEE International Conference on Acoustics, Speech and Signal Processing

6-11 June 2021 • Toronto, Ontario, Canada

Extracting Knowledge from Information

2021 IEEE International Conference on Acoustics, Speech and Signal Processing

6-11 June 2021 • Toronto, Ontario, Canada

Extracting Knowledge from Information

Technical Program

Click on the icon to view the manuscript on IEEE XPlore in the IEEE ICASSP 2021 Open Preview.

AUD-30: Detection and Classification of Acoustic Scenes and Events 5: Scenes

Session Type: Poster
Time: Friday, 11 June, 13:00 - 13:45
Location: Gather.Town
Virtual Session: View on Virtual Platform
Session Chair: Mark Cartwright, New York University
 
 AUD-30.1: CROSS-MODAL SPECTRUM TRANSFORMATION NETWORK FOR ACOUSTIC SCENE CLASSIFICATION
         Yang Liu; University of Surrey
         Alexandros Neophytou; Microsoft
         Sunando Sengupta; Microsoft
         Eric Sommerlade; Microsoft
 
 AUD-30.2: DOMESTIC ACTIVITIES CLUSTERING FROM AUDIO RECORDINGS USING CONVOLUTIONAL CAPSULE AUTOENCODER NETWORK
         Ziheng Lin; School of Electronic and Information Engineering, South China University of Technology, Guangzhou
         Yanxiong Li; School of Electronic and Information Engineering, South China University of Technology, Guangzhou
         Zhangjin Huang; School of Electronic and Information Engineering, South China University of Technology, Guangzhou
         Wenhao Zhang; School of Electronic and Information Engineering, South China University of Technology, Guangzhou
         Yufeng Tan; School of Electronic and Information Engineering, South China University of Technology, Guangzhou
         Yichun Chen; School of Electronic and Information Engineering, South China University of Technology, Guangzhou
         Qianhua He; School of Electronic and Information Engineering, South China University of Technology, Guangzhou
 
 AUD-30.3: SOUND EVENT DETECTION AND SEPARATION: A BENCHMARK ON DESED SYNTHETIC SOUNDSCAPES
         Nicolas Turpault; Université de Lorraine, CNRS, Inria, Loria
         Romain Serizel; Université de Lorraine, CNRS, Inria, Loria
         Scott Wisdom; Google Research
         Hakan Erdogan; Google Research
         John R. Hershey; Google Research
         Eduardo Fonseca; Universitat Pompeu Fabra
         Prem Seetharaman; Descript, Inc.
         Justin Salamon; Adobe Research
 
 AUD-30.4: A TWO-STAGE APPROACH TO DEVICE-ROBUST ACOUSTIC SCENE CLASSIFICATION
         Hu Hu; Georgia Institute of Technology
         Chao-Han Yang; Georgia Institute of Technology
         Xianjun Xia; Tencent Media Lab
         Xue Bai; University of Science and Technology of China
         Xin Tang; University of Science and Technology of China
         Yajian Wang; University of Science and Technology of China
         Shutong Niu; University of Science and Technology of China
         Li Chai; University of Science and Technology of China
         Juanjuan Li; Tencent Media Lab
         Hongning Zhu; Tencent Media Lab
         Feng Bao; Tencent Media Lab
         Yuanjun Zhao; Tencent Media Lab
         Sabato Marco Siniscalchi; University of Enna Kore
         Yannan Wang; Tencent Media Lab
         Jun Du; University of Science and Technology of China
         Chin-Hui Lee; Georgia Institute of Technology
 
 AUD-30.5: SUBSPECTRAL NORMALIZATION FOR NEURAL AUDIO DATA PROCESSING
         Simyung Chang; Qualcomm AI Research
         Hyoungwoo Park; Qualcomm AI Research
         Janghoon Cho; Qualcomm AI Research
         Hyunsin Park; Qualcomm AI Research
         Sungrack Yun; Qualcomm AI Research
         Kyuwoong Hwang; Qualcomm AI Research
 
 AUD-30.6: SLOW-FAST AUDITORY STREAMS FOR AUDIO RECOGNITION
         Evangelos Kazakos; University of Bristol
         Arsha Nagrani; University of Oxford
         Andrew Zisserman; University of Oxford
         Dima Damen; University of Bristol