Paper ID | AUD-18.5 | ||
Paper Title | Multi-Delay Sparse Approach to Residual Crosstalk Reduction for Blind Source Separation | ||
Authors | Satoru Emura, Hiroshi Sawada, Shoko Araki, Noboru Harada, NTT Corporation, Japan | ||
Session | AUD-18: Audio and Speech Source Separation 5: Source Separation | ||
Location | Gather.Town | ||
Session Time: | Thursday, 10 June, 13:00 - 13:45 | ||
Presentation Time: | Thursday, 10 June, 13:00 - 13:45 | ||
Presentation | Poster | ||
Topic | Audio and Acoustic Signal Processing: [AUD-SEP] Audio and Speech Source Separation | ||
Abstract | For reducing residual crosstalk in the output of blind source separation, we propose a frequency-domain post-filtering method that uses a multi-delay model of complex-valued residual crosstalk and sparsifies the estimates of the source signals. We formulate the reduction of residual crosstalk as an optimization problem using ℓ 1 norm and solve it using the alternating direction method of multiplier. The proposed method improved the source-to-interference ratio from 17.8 to 20.5 dB and the source-to-distortion ratio from 10.2 to 11.3 dB when it was combined with a brute-force solver of FastICA and reverberation time T 60 was 300 ms. |