2021 IEEE International Conference on Acoustics, Speech and Signal Processing

6-11 June 2021 • Toronto, Ontario, Canada

Extracting Knowledge from Information

2021 IEEE International Conference on Acoustics, Speech and Signal Processing

6-11 June 2021 • Toronto, Ontario, Canada

Extracting Knowledge from Information

Technical Program

SPE-12: Voice Conversion 2: Low-Resource & Cross-Lingual Conversion

Session Type: Poster
Time: Tuesday, 8 June, 16:30 - 17:15
Location: Gather.Town
Session Chair: Tomoki Toda, Nagoya University
 
SPE-12.1: TOWARDS LOW-RESOURCE STARGAN VOICE CONVERSION USING WEIGHT ADAPTIVE INSTANCE NORMALIZATION
         Mingjie Chen; University of Sheffield
         Yanpei Shi; University of Sheffield
         Thomas Hain; University of Sheffield
 
SPE-12.2: AGAIN-VC: A ONE-SHOT VOICE CONVERSION USING ACTIVATION GUIDANCE AND ADAPTIVE INSTANCE NORMALIZATION
         Yen-Hao Chen; National Taiwan University
         Da-Yi Wu; National Taiwan University
         Tsung-Han Wu; National Taiwan University
         Hung-yi Lee; National Taiwan University
 
SPE-12.3: ONE-SHOT VOICE CONVERSION BASED ON SPEAKER AWARE MODULE
         Ying Zhang; Kwai
         Hao Che; Kwai
         Chenxing Li; Kwai
         Xiaorui Wang; Kwai
         Zhongyuan Wang; Kwai
 
SPE-12.4: ZERO-SHOT VOICE CONVERSION WITH ADJUSTED SPEAKER EMBEDDINGS AND SIMPLE ACOUSTIC FEATURES
         Zhiyuan Tan; College of Intelligence and Computing, Tianjin University
         Jianguo Wei; College of Intelligence and Computing, Tianjin University
         Junhai Xu; College of Intelligence and Computing, Tianjin University
         Yuqing He; College of Intelligence and Computing, Tianjin University
         Wenhuan Lu; College of Intelligence and Computing, Tianjin University
 
SPE-12.5: TOWARDS NATURAL AND CONTROLLABLE CROSS-LINGUAL VOICE CONVERSION BASED ON NEURAL TTS MODEL AND PHONETIC POSTERIORGRAM
         Shengkui Zhao; Speech Lab, Alibaba Group
         Hao Wang; Speech Lab, Alibaba Group
         Trung Hieu Nguyen; Speech Lab, Alibaba Group
         Bin Ma; Speech Lab, Alibaba Group
 
SPE-12.6: MULTI-TASK WAVERNN WITH AN INTEGRATED ARCHITECTURE FOR CROSS-LINGUAL VOICE CONVERSION
         Yi Zhou; National University of Singapore
         Xiaohai Tian; National University of Singapore
         Haizhou Li; National University of Singapore