SPE-12: Voice Conversion 2: Low-Resource & Cross-Lingual Conversion |
Session Type: Poster |
Time: Tuesday, 8 June, 16:30 - 17:15 |
Location: Gather.Town |
Virtual Session: View on Virtual Platform |
Session Chair: Tomoki Toda, Nagoya University
|
|
SPE-12.1: TOWARDS LOW-RESOURCE STARGAN VOICE CONVERSION USING WEIGHT ADAPTIVE INSTANCE NORMALIZATION |
Mingjie Chen; University of Sheffield |
Yanpei Shi; University of Sheffield |
Thomas Hain; University of Sheffield |
|
SPE-12.2: AGAIN-VC: A ONE-SHOT VOICE CONVERSION USING ACTIVATION GUIDANCE AND ADAPTIVE INSTANCE NORMALIZATION |
Yen-Hao Chen; National Taiwan University |
Da-Yi Wu; National Taiwan University |
Tsung-Han Wu; National Taiwan University |
Hung-yi Lee; National Taiwan University |
|
SPE-12.3: ONE-SHOT VOICE CONVERSION BASED ON SPEAKER AWARE MODULE |
Ying Zhang; Kwai |
Hao Che; Kwai |
Chenxing Li; Kwai |
Xiaorui Wang; Kwai |
Zhongyuan Wang; Kwai |
|
SPE-12.4: ZERO-SHOT VOICE CONVERSION WITH ADJUSTED SPEAKER EMBEDDINGS AND SIMPLE ACOUSTIC FEATURES |
Zhiyuan Tan; College of Intelligence and Computing, Tianjin University |
Jianguo Wei; College of Intelligence and Computing, Tianjin University |
Junhai Xu; College of Intelligence and Computing, Tianjin University |
Yuqing He; College of Intelligence and Computing, Tianjin University |
Wenhuan Lu; College of Intelligence and Computing, Tianjin University |
|
SPE-12.5: TOWARDS NATURAL AND CONTROLLABLE CROSS-LINGUAL VOICE CONVERSION BASED ON NEURAL TTS MODEL AND PHONETIC POSTERIORGRAM |
Shengkui Zhao; Speech Lab, Alibaba Group |
Hao Wang; Speech Lab, Alibaba Group |
Trung Hieu Nguyen; Speech Lab, Alibaba Group |
Bin Ma; Speech Lab, Alibaba Group |
|
SPE-12.6: MULTI-TASK WAVERNN WITH AN INTEGRATED ARCHITECTURE FOR CROSS-LINGUAL VOICE CONVERSION |
Yi Zhou; National University of Singapore |
Xiaohai Tian; National University of Singapore |
Haizhou Li; National University of Singapore |
|