AUD-32: Audio for Multimedia and Audio Processing Systems |
Session Type: Poster |
Time: Friday, 11 June, 14:00 - 14:45 |
Location: Gather.Town |
Virtual Session: View on Virtual Platform |
Session Chair: Daniele Giacobello, Sonos Inc. |
AUD-32.1: LIGHTWEIGHT AND INTERPRETABLE NEURAL MODELING OF AN AUDIO DISTORTION EFFECT USING HYPERCONDITIONED DIFFERENTIABLE BIQUADS |
Shahan Nercessian; iZotope, Inc. |
Andy Sarroff; iZotope, Inc. |
Kurt James Werner; iZotope, Inc. |
AUD-32.3: ATTACKING AND DEFENDING BEHIND A PSYCHOACOUSTICS-BASED CAPTCHA |
Chih-Hsiang Huang; National Tsing Hua University |
Po-Hao Wu; National Tsing Hua University |
Yi-Wen Liu; National Tsing Hua University |
Shan-Hung Wu; National Tsing Hua University |
AUD-32.4: DOUBLE-DCCCAE: ESTIMATION OF BODY GESTURES FROM SPEECH WAVEFORM |
JinHong Lu; University of Edinburgh |
TianHang Liu; University of Edinburgh |
Shuzhuang Xu; University of Edinburgh |
Hiroshi Shimodaira; University of Edinburgh |
AUD-32.5: AUDIO REPLAY SPOOF ATTACK DETECTION BY JOINT SEGMENT-BASED LINEAR FILTER BANK FEATURE EXTRACTION AND ATTENTION-ENHANCED DENSENET-BILSTM NETWORK |
Lian Huang; University of Macau |
Chi-Man Pun; University of Macau |
AUD-32.6: INVESTIGATING LOCAL AND GLOBAL INFORMATION FOR AUTOMATED AUDIO CAPTIONING WITH TRANSFER LEARNING |
Xuenan Xu; Shanghai Jiao Tong University |
Heinrich Dinkel; Shanghai Jiao Tong University |
Mengyue Wu; Shanghai Jiao Tong University |
Zeyu Xie; Shanghai Jiao Tong University |
Kai Yu; Shanghai Jiao Tong University |