SPE-12: Voice Conversion 2: Low-Resource & Cross-Lingual Conversion |
| Session Type: Poster |
| Time: Tuesday, 8 June, 16:30 - 17:15 |
| Location: Gather.Town |
| Virtual Session: View on Virtual Platform |
| Session Chair: Tomoki Toda, Nagoya University |
| SPE-12.1: TOWARDS LOW-RESOURCE STARGAN VOICE CONVERSION USING WEIGHT ADAPTIVE INSTANCE NORMALIZATION |
| Mingjie Chen; University of Sheffield |
| Yanpei Shi; University of Sheffield |
| Thomas Hain; University of Sheffield |
| SPE-12.2: AGAIN-VC: A ONE-SHOT VOICE CONVERSION USING ACTIVATION GUIDANCE AND ADAPTIVE INSTANCE NORMALIZATION |
| Yen-Hao Chen; National Taiwan University |
| Da-Yi Wu; National Taiwan University |
| Tsung-Han Wu; National Taiwan University |
| Hung-yi Lee; National Taiwan University |
| SPE-12.3: ONE-SHOT VOICE CONVERSION BASED ON SPEAKER AWARE MODULE |
| Ying Zhang; Kwai |
| Hao Che; Kwai |
| Chenxing Li; Kwai |
| Xiaorui Wang; Kwai |
| Zhongyuan Wang; Kwai |
| SPE-12.4: ZERO-SHOT VOICE CONVERSION WITH ADJUSTED SPEAKER EMBEDDINGS AND SIMPLE ACOUSTIC FEATURES |
| Zhiyuan Tan; College of Intelligence and Computing, Tianjin University |
| Jianguo Wei; College of Intelligence and Computing, Tianjin University |
| Junhai Xu; College of Intelligence and Computing, Tianjin University |
| Yuqing He; College of Intelligence and Computing, Tianjin University |
| Wenhuan Lu; College of Intelligence and Computing, Tianjin University |
| SPE-12.5: TOWARDS NATURAL AND CONTROLLABLE CROSS-LINGUAL VOICE CONVERSION BASED ON NEURAL TTS MODEL AND PHONETIC POSTERIORGRAM |
| Shengkui Zhao; Speech Lab, Alibaba Group |
| Hao Wang; Speech Lab, Alibaba Group |
| Trung Hieu Nguyen; Speech Lab, Alibaba Group |
| Bin Ma; Speech Lab, Alibaba Group |
| SPE-12.6: MULTI-TASK WAVERNN WITH AN INTEGRATED ARCHITECTURE FOR CROSS-LINGUAL VOICE CONVERSION |
| Yi Zhou; National University of Singapore |
| Xiaohai Tian; National University of Singapore |
| Haizhou Li; National University of Singapore |