IEEE ICASSP 2021 || Toronto, Ontario, Canada || 6-11 June 2021

Note: Your custom schedule will not be saved unless you create a new account or login to an existing account.

Create a login based on your email (takes less than one minute)
Perform 'Paper Search'
Select papers that you desire to save in your personalized schedule
Click on 'My Schedule' to see the current list of selected papers
Click on 'Printable Version' to create a separate window suitable for printing (the header and menu will appear, but will not actually print)

Clicking on the Add button next to a session name will add all papers in that session to your custom schedule. Clicking on the session title will show all papers contained within the session and will allow adding of individual papers to your personal schedule.

Note: Times and locations are subject to change.


Monday, 7 June
09:30 - 12:00
Challenge Session CHLG-1: COVID-19 Diagnosis
13:00 - 14:45
Challenge Session CHLG-2: ZYELL - NCTUNetwork Anomaly Detection Challenge
15:30 - 17:45
Challenge Session CHLG-3: Multi-Speaker Multi-Style Voice Cloning Challenge (M2VoC)

Tuesday, 8 June
13:00 - 13:45
SPE-1: Speech Recognition 1: Neural Transducer Models 1

SPE-2: Speech Recognition 2: Neural transducer Models 2

SPE-3: Speech Synthesis 1: Architecture

SPE-4: Speech Synthesis 2: Controllability

HLT-1: Language Modeling 1: Fusion and Training for End-to-End ASR

HLT-2: Language Modeling 2: Neural Language Models

AUD-1: Audio and Speech Source Separation 1: Speech Separation

AUD-2: Audio and Speech Source Separation 2: Music and Singing Voice Separation

Special Session SS-1: Beamforming for Intelligent Surfaces

IVMSP-1: Object Detection 1

IVMSP-2: Object Detection 2

IFS-1: Multimedia Forensics 1

IFS-2: Multimedia Forensics 2

MLSP-1: Deep Learning Training Methods 1

MLSP-2: Deep Learning Training Methods 2

MLSP-3: Deep Learning Training Methods 3

BIO-1: Brain-Computer Interfaces

BIO-2: Biomedical Signal Processing: Detection and Estimation

BIO-3: Machine Learning for COVID-19 diagnosis

SPTM-1: Detection Theory and Methods 1

SPTM-2: Detection Theory and Methods 2
14:00 - 14:45
SPE-5: Speech Enhancement 1: Speech Separation

SPE-6: Speech Enhancement 2: Speech Separation and Dereverberation

SPE-7: Speaker Recognition 1: Benchmark Evaluation

SPE-8: Speaker Recognition 2: Channel and Domain Robustness

HLT-3: Dialogue Systems 1: General Topics

HLT-4: Dialogue Systems 2: Response Generation

AUD-3: Music Signal Analysis, Processing, and Synthesis 1: Deep Learning

AUD-4: Music Signal Analysis, Processing, and Synthesis 2: Analysis and Processing

Special Session SS-2: Deep Learning Methods for Solving Linear Inverse Problems

Special Session SS-3: Machine Learning in Wireless Networks

IVMSP-3: Image & Video Coding 1

IVMSP-4: Image & Video Coding 2

MMSP-1: Multimedia Signal Processing

MMSP-2: Deep Learning for Multimedia Analysis and Processing

MLSP-4: Machine Learning for Classification Applications 1

MLSP-5: Machine Learning for Classification Applications 2

MLSP-6: Compressed Sensing and Learning

MLSP-7: Tensor Signal Processing

BIO-4: Machine Learning and Signal Processing for Neural Signals

BIO-5: Neuroimaging and Neural Signal Processing

SPTM-3: Estimation, Detection and Learning over Networks 1

SPTM-4: Estimation, Detection and Learning over Networks 2
16:30 - 17:15
SPE-9: Speech Recognition 3: Transformer Models 1

SPE-10: Speech Recognition 4: Transformer Models 2

SPE-11: Voice Conversion 1: Non-parallel Conversion

SPE-12: Voice Conversion 2: Low-Resource & Cross-Lingual Conversion

SPCOM-1: Signal Processing for Networks

SPCOM-2: Information Theory, Coding and Security

AUD-5: Active Noise Control, Echo Reduction, and Feedback Reduction 1: Echo Cancellation

AUD-6: Active Noise Control, Echo Reduction, and Feedback Reduction 2: Active Noise Control and Echo Cancellation

Special Session SS-4: Data Science Methods for COVID-19

IVMSP-5: Super-resolution 1

IVMSP-6: Super-resolution 2 & Multi-scale Processing

ASPS-1: Architectures

ASPS-2: Algorithm/Architecture Co-design

MLSP-8: Learning

MLSP-9: Learning Theory for Neural Networks

MLSP-10: Deep Learning for Speech and Audio

MLSP-11: Self-supervised Learning for Speech Processing

SAM-1: Direction of Arrival Estimation 1

SAM-2: Direction of Arrival Estimation 2

SPTM-5: Sampling, Multirate Signal Processing and Digital Signal Processing 1

SPTM-6: Sampling, Multirate Signal Processing and Digital Signal Processing 2

Wednesday, 9 June
08:00 - 09:45
DEMO-1: Show and Tell Demonstrations 1
13:00 - 13:45
SPE-13: Speech Recognition 5: New Algorithms

SPE-14: Speech Recognition 6: New Algorithms for Sparsity/Efficiency

SPE-15: Speech Synthesis 3: Vocoder

SPE-16: Speech Synthesis 4: Front-end

HLT-5: Language Understanding 1: End-to-end Speech Understanding 1

HLT-6: Language Understanding 2: End-to-end Speech Understanding 2

AUD-7: Audio and Speech Source Separation 3: Deep Learning

AUD-8: Audio and Speech Source Separation 4: Multi-Channel Source Separation

Special Session SS-5: Domain Adaptation for Multimedia Signal Processing

IVMSP-7: Machine Learning for Image Processing I

IVMSP-8: Machine Learning for Image Processing II

IVMSP-9: Zero and Few Short Learning

IVMSP-10: Metric Learning and Interpretability

MLSP-12: Federated Learning 1

MLSP-13: Federated Learning 2

MLSP-14: Learning Algorithms 1

MLSP-15: Learning Algorithms 2

BIO-6: Medical Image Segmentation

BIO-7: Medical Image Formation and Reconstruction

SPTM-7: Estimation Theory and Methods 1

SPTM-8: Estimation Theory and Methods 2
14:00 - 14:45
SPE-17: Speech Enhancement 3: Target Speech Extraction

SPE-18: Speech Enhancement 4: Multi-channel Processing

SPE-19: Speaker Recognition 3: Attention and Adversarial

SPE-20: Speaker Recognition 4: Applications

HLT-7: Speech Translation 1: Models

HLT-8: Speech Translation 2: Aspects

AUD-9: Music Information Retrieval and Music Language Processing 1: Beat and Melody

AUD-10: Music Information Retrieval and Music Language Processing 2: Singing Voice

AUD-11: Auditory Modeling and Hearing Instruments

Special Session SS-6: Intelligent Sensing and Communications for Emerging Applications

IVMSP-11: Image & Video Segmentation

IVMSP-12: Image & Video Interpretation and Understanding

MMSP-3: Multimedia Synthesis and Enhancement

MMSP-4: Image, Video and Point Cloud Coding

MLSP-16: ML and Graphs

MLSP-17: Graph Neural Networks

MLSP-18: Matrix Factorization and Applications

MLSP-19: Non-Negative Matrix Factorization

BIO-8: Biological Image Analysis

BIO-9: Medical Image Analysis

SPTM-9: Estimation, Detection and Learning over Networks 3

SPTM-10: Distributed Learning over Graphs
15:30 - 16:15
SPE-21: Speech Recognition 7: Training Methods for End-to-End Modeling

SPE-22: Speech Recognition 8: Multilingual Speech Recognition

SPE-23: Speech Emotion 1: Speech Emotion Recognition

SPE-24: Speech Emotion 2: Neural Networks for Speech Emotion Recognition

SPE-25: Speech Emotion 3: Emotion Recognition - Representations, Data Augmentation

SPE-26: Speaker Verification Spoofing and Countermeasures

AUD-12: Detection and Classification of Acoustic Scenes and Events 1: Few-shot learning

AUD-13: Detection and Classification of Acoustic Scenes and Events 2: Weak supervision

AUD-14: Quality and Intelligibility Measures

Special Session SS-7: Multi-function Radio Frequency System: Radar, Communication, Positioning and Beyond

IVMSP-13: Image Enhancement and Restoration

IVMSP-14: Hyperspectral Imaging

IVMSP-15: Local Descriptors and Texture

IVMSP-16: Point Clouds and Depth

MLSP-20: Attention and Autoencoder Networks

MLSP-21: Generative Neural Networks

MLSP-22: Sequential Learning

CI-1: Theory for Computational Imaging

CI-2: Computational Imaging for Inverse Problems
16:30 - 17:15
SPE-27: Speech Recognition 9: Confidence Measures

SPE-28: Speech Recognition 10: Robustness to Human Speech Variability

SPE-29: Speech Processing 1: Production

SPE-30: Speech Processing 2: General Topics

HLT-9: Style and Text Normalization

HLT-10: Multi-modality in Language

AUD-15: Modeling, Analysis and Synthesis of Acoustic Environments 1: Soundfield Acquisition and Reproduction

AUD-16: Modeling, Analysis and Synthesis of Acoustic Environments 2: Spatial Audio

AUD-17: Modeling, Analysis and Synthesis of Acoustic Environments 3: Acoustic Analysis

Special Session SS-8: Near-ML Decoding of Error-correcting Codes: Algorithms and Implementation

IVMSP-17: Looking at People

IVMSP-18: Faces in Images & Videos

IFS-3: Forensics and Biometrics

IFS-4: Surveillance, Biometrics and Security

MLSP-23: Applications in Music and Audio Processing

MLSP-24: Applications in Audio and Speech Processing

SAM-3: MIMO Radar Array Processing

SAM-4: MIMO and Massive MIMO Array Processing

SPTM-11: Graphs Neural Networks

SPTM-12: Sampling, Filtering and Denoising over Graphs

Thursday, 10 June
13:00 - 13:45
SPE-31: Speech Recognition 11: Novel Approaches

SPE-32: Speech Recognition 12: Self-supervised, Semi-supervised, Unsupervised Training

SPE-33: Speech Synthesis 5: Prosody & Style

SPE-34: Speech Synthesis 6: Data Augmentation & Adaptation

HLT-11: Language Understanding 3: Speech Understanding - General Topics

HLT-12: Language Understanding 4: Semantic Understanding

AUD-18: Audio and Speech Source Separation 5: Source Separation

AUD-19: Audio and Speech Source Separation 6: Topics in Source Separation

Special Session SS-9: Contactless and Wireless Sensing for Smart Environments

Special Session SS-10: Computer Audition for Healthcare (CA4H)

IVMSP-19: Deraining and Dehazing

IVMSP-20: Denoising and Deblurring

ASPS-3: IoT

ASPS-4: Autonomous Systems

MLSP-25: Reinforcement Learning 1

MLSP-26: Reinforcement Learning 2

MLSP-27: Reinforcement Learning 3

BIO-10: Deep Learning for EEG Analysis

BIO-11: Deep Learning for Physiological Signals

SPTM-13: Models, Methods and Algorithms 1

SPTM-14: Models, Methods and Algorithms 2
14:00 - 14:45
SPE-35: Speech Enhancement 5: DNS Challenge Task

SPE-36: Speech Enhancement 6: Multi-modal Processing

SPE-37: Speaker Recognition 5: Neural Embedding

SPE-38: Speaker Recognition 6: Self-supervised and Unsupervised Learning

HLT-13: Information Extraction

HLT-14: Language Representations

AUD-20: Music Information Retrieval and Music Language Processing 3: Topics in Music Information Retrieval

AUD-21: Music Information Retrieval and Music Language Processing 4: Structure and Alignment

Special Session SS-11: On-device AI for Audio and Speech Applications

IVMSP-21: Image & Video Quality

IVMSP-22: Image & Video Sensing, Modeling and Representation

MMSP-5: Human Centric Multimedia 1

MMSP-6: Human Centric Multimedia 2

MLSP-28: ML and Time Series

MLSP-29: Deep Learning for Time Series

MLSP-30: Graph Signal Processing

MLSP-31: Recommendation Systems

SAM-5: Microphone Array Signal Processing

SAM-6: Beamforming 1

SPTM-15: Graph Topology Inference and Clustering

SPTM-16: Graph Topology Inference
15:30 - 16:15
SPE-39: Speech Recognition 13: Acoustic Modeling 1

SPE-40: Speech Recognition 14: Acoustic Modeling 2

SPE-41: Voice Activity and Disfluency Detection

SPE-42: Keyword Spotting

SPCOM-3: Beamforming 2

SPCOM-4: Channel Estimation for MIMO and Multiuser Systems

AUD-22: Detection and Classification of Acoustic Scenes and Events 3: Multimodal Scenes and Events

AUD-23: Detection and Classification of Acoustic Scenes and Events 4: Datasets and metrics

Special Session SS-12: Recent Advances in mmWave Radar Sensing for Autonomous Vehicles

IVMSP-23: Applications 1

IVMSP-24: Applications 2

IFS-5: Privacy and Information Security

IFS-6: Anonymization, Security and Privacy

MLSP-32: Optimization Algorithms for Machine Learning

MLSP-33: Optimization Methods

MLSP-34: Subspace Learning and Applications

MLSP-35: Independent Component Analysis

CI-3: Computational Photography

CI-4: Remote Sensing and Coded Aperture Imaging

SPTM-17: Sampling, Multirate Signal Processing and Digital Signal Processing 3

SPTM-18: Sampling Theory, Analysis and Methods
16:30 - 17:15
SPE-43: Speech Recognition 15: Robust Speech Recognition 1

SPE-44: Speech Recognition 16: Robust Speech Recognition 2

SPE-45: Speech Analysis

SPE-46: Corpora and Other Resources

HLT-15: Language Assessment

HLT-16: Applications in Natural Language

AUD-24: Signal Enhancement and Restoration 1: Deep Learning

AUD-25: Signal Enhancement and Restoration 2: Audio Coding and Restoration

AUD-26: Signal Enhancement and Restoration 3: Signal Enhancement

Special Session SS-13: Recent Advances in Multichannel and Multimodal Machine Learning for Speech Applications

IVMSP-25: Tracking

IVMSP-26: Attention for Vision

ASPS-5: Audio & Images

ASPS-6: Sensing & Sensor Processing

ASPS-7: Data Science & Machine Learning

MLSP-36: Pattern Recognition and Classification 1

MLSP-37: Pattern Recognition and Classification 2

MLSP-38: Neural Networks for Clustering and Classification

SAM-7: Detection and Estimation 1

SAM-8: Detection and Estimation 2

SAM-9: Detection and Classification

Friday, 11 June
08:00 - 09:45
DEMO-2: Show and Tell Demonstrations 2
11:30 - 12:15
SPE-47: Speech Recognition 17: Speech Adaptation and Normalization

SPE-48: Speech Recognition 18: Low Resource ASR

SPE-49: Speech Synthesis 7: General Topics

SPE-50: Voice Conversion & Speech Synthesis: Singing Voice & Other Topics

SPCOM-5: Detection and Decoding

SPCOM-6: System Design and Optimization

AUD-27: Acoustic Sensor Array Processing 1: Array Design and Calibration

AUD-28: Acoustic Sensor Array Processing 2: Beamforming

AUD-29: Acoustic Sensor Array Processing 3: Acoustic Sensor Arrays

Special Session SS-14: Robust Sensing and Detection in Congested Spectrum

IVMSP-27: Multi-modal Signal Processing

IVMSP-28: Image Synthesis

IFS-7: Information Hiding, Cryptography and Cybersecurity

IFS-8: Watermarking and Data Hiding

MLSP-39: Adversarial Machine Learning

MLSP-40: Contrastive Learning

MLSP-41: Deep Learning Optimization

MLSP-42: Neural Network Pruning

BIO-12: Feature Extraction and Fusion for Biomedical Applications

BIO-13: Deep Learning for Biomedical Applications

SPTM-19: Inference over Graphs

SPTM-20: Signal Processing over Graphs and Sparsity-Aware Signal Processing
13:00 - 13:45
SPE-51: Speech Enhancement 7: Single-channel Processing

SPE-52: Speech Enhancement 8: Echo Cancellation and Other Tasks

SPE-53: Speaker Diarization

SPE-54: End-to-End Speaker Diarization and Recognition

HLT-17: Language Understanding 5: Question Answering and Reading Comprehension

HLT-18: Language Understanding 6: Summarization and Comprehension

AUD-30: Detection and Classification of Acoustic Scenes and Events 5: Scenes

AUD-31: Detection and Classification of Acoustic Scenes and Events 6: Events

SPCOM-7: Communication-enabled Applications

Special Session SS-15: Signal Processing for Collaborative Intelligence

IVMSP-29: Semantic Segmentation

IVMSP-30: Inverse Problems in Image & Video Processing

MMSP-7: Multimodal Perception, Integration and Multisensory Fusion

MMSP-8: Multimedia Retrieval and Signal Detection

MLSP-43: Biomedical Applications

MLSP-44: Multimodal Data and Applications

MLSP-45: Performance Bounds

MLSP-46: Theory and Applications

SAM-10: Sparse Array Design and Processing

SAM-11: Array Calibration and Performance Analysis

SPTM-21: Optimization Methods for Signal Processing

SPTM-22: Signal Processing Theory and Methods
14:00 - 14:45
SPE-55: Language Identification and Low Resource Speech Recognition

SPE-56: Paralinguistics in Speech

SPE-57: Speech, Depression and Sleepiness

SPE-58: Dysarthric Speech Processing

SPCOM-8: Deep learning for communications

SPCOM-9: Online and Active Learning for Communications

AUD-32: Audio for Multimedia and Audio Processing Systems

AUD-33: Topics in Deep Learning for Speech and Audio

AUD-34: Acoustic System Identification and Modeling

Special Session SS-16: Theoretical Foundations of Graph Neural Networks

IVMSP-31: Applications 3

IVMSP-32: Applications 4

IVMSP-33: Action Recognition

IVMSP-34: Inpaiting and Occlusions Handling

MLSP-47: Applications of Machine Learning

MLSP-48: Neural Network Applications

SAM-12: Tracking and Localization

SAM-13: Multi-Channel Data Fusion and Processing

SPTM-23: Bayesian Signal Processing

SPTM-24: Sparsity-aware Processing

2021 IEEE International Conference on Acoustics, Speech and Signal Processing

6-11 June 2021 • Toronto, Ontario, Canada

Extracting Knowledge from Information

2021 IEEE International Conference on Acoustics, Speech and Signal Processing

6-11 June 2021 • Toronto, Ontario, Canada

Extracting Knowledge from Information

My ICASSP 2021 Schedule