2021 IEEE International Conference on Acoustics, Speech and Signal Processing

6-11 June 2021 • Toronto, Ontario, Canada

Extracting Knowledge from Information

2021 IEEE International Conference on Acoustics, Speech and Signal Processing

6-11 June 2021 • Toronto, Ontario, Canada

Extracting Knowledge from Information
Login Paper Search My Schedule Paper Index Help

My ICASSP 2021 Schedule

Note: Your custom schedule will not be saved unless you create a new account or login to an existing account.
  1. Create a login based on your email (takes less than one minute)
  2. Perform 'Paper Search'
  3. Select papers that you desire to save in your personalized schedule
  4. Click on 'My Schedule' to see the current list of selected papers
  5. Click on 'Printable Version' to create a separate window suitable for printing (the header and menu will appear, but will not actually print)

Clicking on the Add button next to a session name will add all papers in that session to your custom schedule. Clicking on the session title will show all papers contained within the session and will allow adding of individual papers to your personal schedule.

Note: Times and locations are subject to change.

 
Monday, 7 June
09:30 - 12:00
Challenge Session CHLG-1: COVID-19 Diagnosis
13:00 - 14:45
Challenge Session CHLG-2: ZYELL - NCTUNetwork Anomaly Detection Challenge
15:30 - 17:45
Challenge Session CHLG-3: Multi-Speaker Multi-Style Voice Cloning Challenge (M2VoC)
 
Tuesday, 8 June
13:00 - 13:45
SPE-1: Speech Recognition 1: Neural Transducer Models 1
SPE-2: Speech Recognition 2: Neural transducer Models 2
SPE-3: Speech Synthesis 1: Architecture
SPE-4: Speech Synthesis 2: Controllability
HLT-1: Language Modeling 1: Fusion and Training for End-to-End ASR
HLT-2: Language Modeling 2: Neural Language Models
AUD-1: Audio and Speech Source Separation 1: Speech Separation
AUD-2: Audio and Speech Source Separation 2: Music and Singing Voice Separation
Special Session SS-1: Beamforming for Intelligent Surfaces
IVMSP-1: Object Detection 1
IVMSP-2: Object Detection 2
IFS-1: Multimedia Forensics 1
IFS-2: Multimedia Forensics 2
MLSP-1: Deep Learning Training Methods 1
MLSP-2: Deep Learning Training Methods 2
MLSP-3: Deep Learning Training Methods 3
BIO-1: Brain-Computer Interfaces
BIO-2: Biomedical Signal Processing: Detection and Estimation
BIO-3: Machine Learning for COVID-19 diagnosis
SPTM-1: Detection Theory and Methods 1
SPTM-2: Detection Theory and Methods 2
14:00 - 14:45
SPE-5: Speech Enhancement 1: Speech Separation
SPE-6: Speech Enhancement 2: Speech Separation and Dereverberation
SPE-7: Speaker Recognition 1: Benchmark Evaluation
SPE-8: Speaker Recognition 2: Channel and Domain Robustness
HLT-3: Dialogue Systems 1: General Topics
HLT-4: Dialogue Systems 2: Response Generation
AUD-3: Music Signal Analysis, Processing, and Synthesis 1: Deep Learning
AUD-4: Music Signal Analysis, Processing, and Synthesis 2: Analysis and Processing
Special Session SS-2: Deep Learning Methods for Solving Linear Inverse Problems
Special Session SS-3: Machine Learning in Wireless Networks
IVMSP-3: Image & Video Coding 1
IVMSP-4: Image & Video Coding 2
MMSP-1: Multimedia Signal Processing
MMSP-2: Deep Learning for Multimedia Analysis and Processing
MLSP-4: Machine Learning for Classification Applications 1
MLSP-5: Machine Learning for Classification Applications 2
MLSP-6: Compressed Sensing and Learning
MLSP-7: Tensor Signal Processing
BIO-4: Machine Learning and Signal Processing for Neural Signals
BIO-5: Neuroimaging and Neural Signal Processing
SPTM-3: Estimation, Detection and Learning over Networks 1
SPTM-4: Estimation, Detection and Learning over Networks 2
16:30 - 17:15
SPE-9: Speech Recognition 3: Transformer Models 1
SPE-10: Speech Recognition 4: Transformer Models 2
SPE-11: Voice Conversion 1: Non-parallel Conversion
SPE-12: Voice Conversion 2: Low-Resource & Cross-Lingual Conversion
SPCOM-1: Signal Processing for Networks
SPCOM-2: Information Theory, Coding and Security
AUD-5: Active Noise Control, Echo Reduction, and Feedback Reduction 1: Echo Cancellation
AUD-6: Active Noise Control, Echo Reduction, and Feedback Reduction 2: Active Noise Control and Echo Cancellation
Special Session SS-4: Data Science Methods for COVID-19
IVMSP-5: Super-resolution 1
IVMSP-6: Super-resolution 2 & Multi-scale Processing
ASPS-1: Architectures
ASPS-2: Algorithm/Architecture Co-design
MLSP-8: Learning
MLSP-9: Learning Theory for Neural Networks
MLSP-10: Deep Learning for Speech and Audio
MLSP-11: Self-supervised Learning for Speech Processing
SAM-1: Direction of Arrival Estimation 1
SAM-2: Direction of Arrival Estimation 2
SPTM-5: Sampling, Multirate Signal Processing and Digital Signal Processing 1
SPTM-6: Sampling, Multirate Signal Processing and Digital Signal Processing 2
 
Wednesday, 9 June
08:00 - 09:45
DEMO-1: Show and Tell Demonstrations 1
13:00 - 13:45
SPE-13: Speech Recognition 5: New Algorithms
SPE-14: Speech Recognition 6: New Algorithms for Sparsity/Efficiency
SPE-15: Speech Synthesis 3: Vocoder
SPE-16: Speech Synthesis 4: Front-end
HLT-5: Language Understanding 1: End-to-end Speech Understanding 1
HLT-6: Language Understanding 2: End-to-end Speech Understanding 2
AUD-7: Audio and Speech Source Separation 3: Deep Learning
AUD-8: Audio and Speech Source Separation 4: Multi-Channel Source Separation
Special Session SS-5: Domain Adaptation for Multimedia Signal Processing
IVMSP-7: Machine Learning for Image Processing I
IVMSP-8: Machine Learning for Image Processing II
IVMSP-9: Zero and Few Short Learning
IVMSP-10: Metric Learning and Interpretability
MLSP-12: Federated Learning 1
MLSP-13: Federated Learning 2
MLSP-14: Learning Algorithms 1
MLSP-15: Learning Algorithms 2
BIO-6: Medical Image Segmentation
BIO-7: Medical Image Formation and Reconstruction
SPTM-7: Estimation Theory and Methods 1
SPTM-8: Estimation Theory and Methods 2
14:00 - 14:45
SPE-17: Speech Enhancement 3: Target Speech Extraction
SPE-18: Speech Enhancement 4: Multi-channel Processing
SPE-19: Speaker Recognition 3: Attention and Adversarial
SPE-20: Speaker Recognition 4: Applications
HLT-7: Speech Translation 1: Models
HLT-8: Speech Translation 2: Aspects
AUD-9: Music Information Retrieval and Music Language Processing 1: Beat and Melody
AUD-10: Music Information Retrieval and Music Language Processing 2: Singing Voice
AUD-11: Auditory Modeling and Hearing Instruments
Special Session SS-6: Intelligent Sensing and Communications for Emerging Applications
IVMSP-11: Image & Video Segmentation
IVMSP-12: Image & Video Interpretation and Understanding
MMSP-3: Multimedia Synthesis and Enhancement
MMSP-4: Image, Video and Point Cloud Coding
MLSP-16: ML and Graphs
MLSP-17: Graph Neural Networks
MLSP-18: Matrix Factorization and Applications
MLSP-19: Non-Negative Matrix Factorization
BIO-8: Biological Image Analysis
BIO-9: Medical Image Analysis
SPTM-9: Estimation, Detection and Learning over Networks 3
SPTM-10: Distributed Learning over Graphs
15:30 - 16:15
SPE-21: Speech Recognition 7: Training Methods for End-to-End Modeling
SPE-22: Speech Recognition 8: Multilingual Speech Recognition
SPE-23: Speech Emotion 1: Speech Emotion Recognition
SPE-24: Speech Emotion 2: Neural Networks for Speech Emotion Recognition
SPE-25: Speech Emotion 3: Emotion Recognition - Representations, Data Augmentation
SPE-26: Speaker Verification Spoofing and Countermeasures
AUD-12: Detection and Classification of Acoustic Scenes and Events 1: Few-shot learning
AUD-13: Detection and Classification of Acoustic Scenes and Events 2: Weak supervision
AUD-14: Quality and Intelligibility Measures
Special Session SS-7: Multi-function Radio Frequency System: Radar, Communication, Positioning and Beyond
IVMSP-13: Image Enhancement and Restoration
IVMSP-14: Hyperspectral Imaging
IVMSP-15: Local Descriptors and Texture
IVMSP-16: Point Clouds and Depth
MLSP-20: Attention and Autoencoder Networks
MLSP-21: Generative Neural Networks
MLSP-22: Sequential Learning
CI-1: Theory for Computational Imaging
CI-2: Computational Imaging for Inverse Problems
16:30 - 17:15
SPE-27: Speech Recognition 9: Confidence Measures
SPE-28: Speech Recognition 10: Robustness to Human Speech Variability
SPE-29: Speech Processing 1: Production
SPE-30: Speech Processing 2: General Topics
HLT-9: Style and Text Normalization
HLT-10: Multi-modality in Language
AUD-15: Modeling, Analysis and Synthesis of Acoustic Environments 1: Soundfield Acquisition and Reproduction
AUD-16: Modeling, Analysis and Synthesis of Acoustic Environments 2: Spatial Audio
AUD-17: Modeling, Analysis and Synthesis of Acoustic Environments 3: Acoustic Analysis
Special Session SS-8: Near-ML Decoding of Error-correcting Codes: Algorithms and Implementation
IVMSP-17: Looking at People
IVMSP-18: Faces in Images & Videos
IFS-3: Forensics and Biometrics
IFS-4: Surveillance, Biometrics and Security
MLSP-23: Applications in Music and Audio Processing
MLSP-24: Applications in Audio and Speech Processing
SAM-3: MIMO Radar Array Processing
SAM-4: MIMO and Massive MIMO Array Processing
SPTM-11: Graphs Neural Networks
SPTM-12: Sampling, Filtering and Denoising over Graphs
 
Thursday, 10 June
13:00 - 13:45
SPE-31: Speech Recognition 11: Novel Approaches
SPE-32: Speech Recognition 12: Self-supervised, Semi-supervised, Unsupervised Training
SPE-33: Speech Synthesis 5: Prosody & Style
SPE-34: Speech Synthesis 6: Data Augmentation & Adaptation
HLT-11: Language Understanding 3: Speech Understanding - General Topics
HLT-12: Language Understanding 4: Semantic Understanding
AUD-18: Audio and Speech Source Separation 5: Source Separation
AUD-19: Audio and Speech Source Separation 6: Topics in Source Separation
Special Session SS-9: Contactless and Wireless Sensing for Smart Environments
Special Session SS-10: Computer Audition for Healthcare (CA4H)
IVMSP-19: Deraining and Dehazing
IVMSP-20: Denoising and Deblurring
ASPS-3: IoT
ASPS-4: Autonomous Systems
MLSP-25: Reinforcement Learning 1
MLSP-26: Reinforcement Learning 2
MLSP-27: Reinforcement Learning 3
BIO-10: Deep Learning for EEG Analysis
BIO-11: Deep Learning for Physiological Signals
SPTM-13: Models, Methods and Algorithms 1
SPTM-14: Models, Methods and Algorithms 2
14:00 - 14:45
SPE-35: Speech Enhancement 5: DNS Challenge Task
SPE-36: Speech Enhancement 6: Multi-modal Processing
SPE-37: Speaker Recognition 5: Neural Embedding
SPE-38: Speaker Recognition 6: Self-supervised and Unsupervised Learning
HLT-13: Information Extraction
HLT-14: Language Representations
AUD-20: Music Information Retrieval and Music Language Processing 3: Topics in Music Information Retrieval
AUD-21: Music Information Retrieval and Music Language Processing 4: Structure and Alignment
Special Session SS-11: On-device AI for Audio and Speech Applications
IVMSP-21: Image & Video Quality
IVMSP-22: Image & Video Sensing, Modeling and Representation
MMSP-5: Human Centric Multimedia 1
MMSP-6: Human Centric Multimedia 2
MLSP-28: ML and Time Series
MLSP-29: Deep Learning for Time Series
MLSP-30: Graph Signal Processing
MLSP-31: Recommendation Systems
SAM-5: Microphone Array Signal Processing
SAM-6: Beamforming 1
SPTM-15: Graph Topology Inference and Clustering
SPTM-16: Graph Topology Inference
15:30 - 16:15
SPE-39: Speech Recognition 13: Acoustic Modeling 1
SPE-40: Speech Recognition 14: Acoustic Modeling 2
SPE-41: Voice Activity and Disfluency Detection
SPE-42: Keyword Spotting
SPCOM-3: Beamforming 2
SPCOM-4: Channel Estimation for MIMO and Multiuser Systems
AUD-22: Detection and Classification of Acoustic Scenes and Events 3: Multimodal Scenes and Events
AUD-23: Detection and Classification of Acoustic Scenes and Events 4: Datasets and metrics
Special Session SS-12: Recent Advances in mmWave Radar Sensing for Autonomous Vehicles
IVMSP-23: Applications 1
IVMSP-24: Applications 2
IFS-5: Privacy and Information Security
IFS-6: Anonymization, Security and Privacy
MLSP-32: Optimization Algorithms for Machine Learning
MLSP-33: Optimization Methods
MLSP-34: Subspace Learning and Applications
MLSP-35: Independent Component Analysis
CI-3: Computational Photography
CI-4: Remote Sensing and Coded Aperture Imaging
SPTM-17: Sampling, Multirate Signal Processing and Digital Signal Processing 3
SPTM-18: Sampling Theory, Analysis and Methods
16:30 - 17:15
SPE-43: Speech Recognition 15: Robust Speech Recognition 1
SPE-44: Speech Recognition 16: Robust Speech Recognition 2
SPE-45: Speech Analysis
SPE-46: Corpora and Other Resources
HLT-15: Language Assessment
HLT-16: Applications in Natural Language
AUD-24: Signal Enhancement and Restoration 1: Deep Learning
AUD-25: Signal Enhancement and Restoration 2: Audio Coding and Restoration
AUD-26: Signal Enhancement and Restoration 3: Signal Enhancement
Special Session SS-13: Recent Advances in Multichannel and Multimodal Machine Learning for Speech Applications
IVMSP-25: Tracking
IVMSP-26: Attention for Vision
ASPS-5: Audio & Images
ASPS-6: Sensing & Sensor Processing
ASPS-7: Data Science & Machine Learning
MLSP-36: Pattern Recognition and Classification 1
MLSP-37: Pattern Recognition and Classification 2
MLSP-38: Neural Networks for Clustering and Classification
SAM-7: Detection and Estimation 1
SAM-8: Detection and Estimation 2
SAM-9: Detection and Classification
 
Friday, 11 June
08:00 - 09:45
DEMO-2: Show and Tell Demonstrations 2
11:30 - 12:15
SPE-47: Speech Recognition 17: Speech Adaptation and Normalization
SPE-48: Speech Recognition 18: Low Resource ASR
SPE-49: Speech Synthesis 7: General Topics
SPE-50: Voice Conversion & Speech Synthesis: Singing Voice & Other Topics
SPCOM-5: Detection and Decoding
SPCOM-6: System Design and Optimization
AUD-27: Acoustic Sensor Array Processing 1: Array Design and Calibration
AUD-28: Acoustic Sensor Array Processing 2: Beamforming
AUD-29: Acoustic Sensor Array Processing 3: Acoustic Sensor Arrays
Special Session SS-14: Robust Sensing and Detection in Congested Spectrum
IVMSP-27: Multi-modal Signal Processing
IVMSP-28: Image Synthesis
IFS-7: Information Hiding, Cryptography and Cybersecurity
IFS-8: Watermarking and Data Hiding
MLSP-39: Adversarial Machine Learning
MLSP-40: Contrastive Learning
MLSP-41: Deep Learning Optimization
MLSP-42: Neural Network Pruning
BIO-12: Feature Extraction and Fusion for Biomedical Applications
BIO-13: Deep Learning for Biomedical Applications
SPTM-19: Inference over Graphs
SPTM-20: Signal Processing over Graphs and Sparsity-Aware Signal Processing
13:00 - 13:45
SPE-51: Speech Enhancement 7: Single-channel Processing
SPE-52: Speech Enhancement 8: Echo Cancellation and Other Tasks
SPE-53: Speaker Diarization
SPE-54: End-to-End Speaker Diarization and Recognition
HLT-17: Language Understanding 5: Question Answering and Reading Comprehension
HLT-18: Language Understanding 6: Summarization and Comprehension
AUD-30: Detection and Classification of Acoustic Scenes and Events 5: Scenes
AUD-31: Detection and Classification of Acoustic Scenes and Events 6: Events
SPCOM-7: Communication-enabled Applications
Special Session SS-15: Signal Processing for Collaborative Intelligence
IVMSP-29: Semantic Segmentation
IVMSP-30: Inverse Problems in Image & Video Processing
MMSP-7: Multimodal Perception, Integration and Multisensory Fusion
MMSP-8: Multimedia Retrieval and Signal Detection
MLSP-43: Biomedical Applications
MLSP-44: Multimodal Data and Applications
MLSP-45: Performance Bounds
MLSP-46: Theory and Applications
SAM-10: Sparse Array Design and Processing
SAM-11: Array Calibration and Performance Analysis
SPTM-21: Optimization Methods for Signal Processing
SPTM-22: Signal Processing Theory and Methods
14:00 - 14:45
SPE-55: Language Identification and Low Resource Speech Recognition
SPE-56: Paralinguistics in Speech
SPE-57: Speech, Depression and Sleepiness
SPE-58: Dysarthric Speech Processing
SPCOM-8: Deep learning for communications
SPCOM-9: Online and Active Learning for Communications
AUD-32: Audio for Multimedia and Audio Processing Systems
AUD-33: Topics in Deep Learning for Speech and Audio
AUD-34: Acoustic System Identification and Modeling
Special Session SS-16: Theoretical Foundations of Graph Neural Networks
IVMSP-31: Applications 3
IVMSP-32: Applications 4
IVMSP-33: Action Recognition
IVMSP-34: Inpaiting and Occlusions Handling
MLSP-47: Applications of Machine Learning
MLSP-48: Neural Network Applications
SAM-12: Tracking and Localization
SAM-13: Multi-Channel Data Fusion and Processing
SPTM-23: Bayesian Signal Processing
SPTM-24: Sparsity-aware Processing