SPE-11: Voice Conversion 1: Non-parallel Conversion |
Session Type: Poster |
Time: Tuesday, 8 June, 16:30 - 17:15 |
Location: Gather.Town |
Virtual Session: View on Virtual Platform |
Session Chair: Tomoki Toda, Nagoya University |
SPE-11.1: MASKCYCLEGAN-VC: LEARNING NON-PARALLEL VOICE CONVERSION WITH FILLING IN FRAMES |
Takuhiro Kaneko; NTT Corporation |
Hirokazu Kameoka; NTT Corporation |
Kou Tanaka; NTT Corporation |
Nobukatsu Hojo; NTT Corporation |
SPE-11.2: NON-PARALLEL MANY-TO-MANY VOICE CONVERSION BY KNOWLEDGE TRANSFER FROM A TEXT-TO-SPEECH MODEL |
Xinyuan Yu; The Hong Kong University of Science and Technology |
Brian Mak; The Hong Kong University of Science and Technology |
SPE-11.3: NON-PARALLEL MANY-TO-MANY VOICE CONVERSION USING LOCAL LINGUISTIC TOKENS |
Chao Wang; Soochow University |
Yibiao Yu; Soochow University |
SPE-11.4: CRANK: AN OPEN-SOURCE SOFTWARE FOR NONPARALLEL VOICE CONVERSION BASED ON VECTOR-QUANTIZED VARIATIONAL AUTOENCODER |
Kazuhiro Kobayashi; Nagoya University |
Wen-Chin Huang; Nagoya University |
Yi-Chiao Wu; Nagoya University |
Patrick Lumban Tobing; Nagoya University |
Tomoki Hayashi; Nagoya University |
Tomoki Toda; Nagoya University |
SPE-11.5: FRAGMENTVC: ANY-TO-ANY VOICE CONVERSION BY END-TO-END EXTRACTING AND FUSING FINE-GRAINED VOICE FRAGMENTS WITH ATTENTION |
Yist Y. Lin; National Taiwan University |
Chung-Ming Chien; National Taiwan University |
Jheng-Hao Lin; National Taiwan University |
Hung-yi Lee; National Taiwan University |
Lin-shan Lee; National Taiwan University |
SPE-11.6: ANY-TO-ONE SEQUENCE-TO-SEQUENCE VOICE CONVERSION USING SELF-SUPERVISED DISCRETE SPEECH REPRESENTATIONS |
Wen-Chin Huang; Nagoya University |
Yi-Chiao Wu; Nagoya University |
Tomoki Hayashi; Nagoya University |
Tomoki Toda; Nagoya University |