Paper ID | AUD-18.5 |
Paper Title |
Multi-Delay Sparse Approach to Residual Crosstalk Reduction for Blind Source Separation |
Authors |
Satoru Emura, Hiroshi Sawada, Shoko Araki, Noboru Harada, NTT Corporation, Japan |
Session | AUD-18: Audio and Speech Source Separation 5: Source Separation |
Location | Gather.Town |
Session Time: | Thursday, 10 June, 13:00 - 13:45 |
Presentation Time: | Thursday, 10 June, 13:00 - 13:45 |
Presentation |
Poster
|
Topic |
Audio and Acoustic Signal Processing: [AUD-SEP] Audio and Speech Source Separation |
Virtual Presentation |
Click here to watch in the Virtual Conference |
Abstract |
For reducing residual crosstalk in the output of blind source separation, we propose a frequency-domain post-filtering method that uses a multi-delay model of complex-valued residual crosstalk and sparsifies the estimates of the source signals. We formulate the reduction of residual crosstalk as an optimization problem using ℓ 1 norm and solve it using the alternating direction method of multiplier. The proposed method improved the source-to-interference ratio from 17.8 to 20.5 dB and the source-to-distortion ratio from 10.2 to 11.3 dB when it was combined with a brute-force solver of FastICA and reverberation time T 60 was 300 ms. |