2021 IEEE International Conference on Acoustics, Speech and Signal Processing

6-11 June 2021 • Toronto, Ontario, Canada

Extracting Knowledge from Information

2021 IEEE International Conference on Acoustics, Speech and Signal Processing

6-11 June 2021 • Toronto, Ontario, Canada

Extracting Knowledge from Information

Technical Program

Click on the icon to view the manuscript on IEEE XPlore in the IEEE ICASSP 2021 Open Preview.

AUD-23: Detection and Classification of Acoustic Scenes and Events 4: Datasets and metrics

Session Type: Poster
Time: Thursday, 10 June, 15:30 - 16:15
Location: Gather.Town
Virtual Session: View on Virtual Platform
Session Chair: Scott Wisdom, Google
 
 AUD-23.1: A CURATED DATASET OF URBAN SCENES FOR AUDIO-VISUAL SCENE ANALYSIS
         Shanshan Wang; Tampere University
         Annamaria Mesaros; Tampere University
         Toni Heittola; Tampere University
         Tuomas Virtanen; Tampere University
 
 AUD-23.2: IMPROVING SOUND EVENT DETECTION METRICS: INSIGHTS FROM DCASE 2020
         Giacomo Ferroni; Audio Analytic
         Nicolas Turpault; INRIA
         Juan Azcarreta; Audio Analytic
         Francesco Tuveri; Audio Analytic
         Romain Serizel; LORIA
         Cagdas Bilen; Audio Analytic
         Sacha Krstulovic; Audio Analytic
 
 AUD-23.3: ARTIFICIALLY SYNTHESISING DATA FOR AUDIO CLASSIFICATION AND SEGMENTATION TO IMPROVE SPEECH AND MUSIC DETECTION IN RADIO BROADCAST
         Satvik Venkatesh; University of Plymouth
         David Moffat; University of Plymouth
         Alexis Kirke; University of Plymouth
         Gözel Shakeri; University of Glasgow
         Stephen Brewster; University of Glasgow
         Jörg Fachner; Anglia Ruskin University
         Helen Odell-Miller; Anglia Ruskin University
         Alex Street; Anglia Ruskin University
         Nicolas Farina; Brighton and Sussex Medical School
         Sube Banerjee; University of Plymouth
         Eduardo Reck Miranda; University of Plymouth
 
 AUD-23.4: LSSED: A LARGE-SCALE DATASET AND BENCHMARK FOR SPEECH EMOTION RECOGNITION
         Weiquan Fan; South China University of Technology
         Xiangmin Xu; South China University of Technology
         Xiaofen Xing; South China University of Technology
         Weidong Chen; South China University of Technology
         Dongyan Huang; UBTECH Robotics Corp
 
 AUD-23.5: ENHANCING AUDIO AUGMENTATION METHODS WITH CONSISTENCY LEARNING
         Turab Iqbal; University of Surrey
         Karim Helwani; Amazon Web Services
         Arvindh Krishnaswamy; Amazon Web Services
         Wenwu Wang; University of Surrey
 
 AUD-23.6: FAST THRESHOLD OPTIMIZATION FOR MULTI-LABEL AUDIO TAGGING USING SURROGATE GRADIENT LEARNING
         Thomas Pellegrini; Université de Toulouse III ; IRIT
         Timothée Masquelier; CERCO UMR 5549, CNRS ; Université de Toulouse III