2021 IEEE International Conference on Acoustics, Speech and Signal Processing

6-11 June 2021 • Toronto, Ontario, Canada

Extracting Knowledge from Information

2021 IEEE International Conference on Acoustics, Speech and Signal Processing

6-11 June 2021 • Toronto, Ontario, Canada

Extracting Knowledge from Information

Technical Program

Click on the icon to view the manuscript on IEEE XPlore in the IEEE ICASSP 2021 Open Preview.

SPE-11: Voice Conversion 1: Non-parallel Conversion

Session Type: Poster
Time: Tuesday, 8 June, 16:30 - 17:15
Location: Gather.Town
Virtual Session: View on Virtual Platform
Session Chair: Tomoki Toda, Nagoya University
 
 SPE-11.1: MASKCYCLEGAN-VC: LEARNING NON-PARALLEL VOICE CONVERSION WITH FILLING IN FRAMES
         Takuhiro Kaneko; NTT Corporation
         Hirokazu Kameoka; NTT Corporation
         Kou Tanaka; NTT Corporation
         Nobukatsu Hojo; NTT Corporation
 
 SPE-11.2: NON-PARALLEL MANY-TO-MANY VOICE CONVERSION BY KNOWLEDGE TRANSFER FROM A TEXT-TO-SPEECH MODEL
         Xinyuan Yu; The Hong Kong University of Science and Technology
         Brian Mak; The Hong Kong University of Science and Technology
 
 SPE-11.3: NON-PARALLEL MANY-TO-MANY VOICE CONVERSION USING LOCAL LINGUISTIC TOKENS
         Chao Wang; Soochow University
         Yibiao Yu; Soochow University
 
 SPE-11.4: CRANK: AN OPEN-SOURCE SOFTWARE FOR NONPARALLEL VOICE CONVERSION BASED ON VECTOR-QUANTIZED VARIATIONAL AUTOENCODER
         Kazuhiro Kobayashi; Nagoya University
         Wen-Chin Huang; Nagoya University
         Yi-Chiao Wu; Nagoya University
         Patrick Lumban Tobing; Nagoya University
         Tomoki Hayashi; Nagoya University
         Tomoki Toda; Nagoya University
 
 SPE-11.5: FRAGMENTVC: ANY-TO-ANY VOICE CONVERSION BY END-TO-END EXTRACTING AND FUSING FINE-GRAINED VOICE FRAGMENTS WITH ATTENTION
         Yist Y. Lin; National Taiwan University
         Chung-Ming Chien; National Taiwan University
         Jheng-Hao Lin; National Taiwan University
         Hung-yi Lee; National Taiwan University
         Lin-shan Lee; National Taiwan University
 
 SPE-11.6: ANY-TO-ONE SEQUENCE-TO-SEQUENCE VOICE CONVERSION USING SELF-SUPERVISED DISCRETE SPEECH REPRESENTATIONS
         Wen-Chin Huang; Nagoya University
         Yi-Chiao Wu; Nagoya University
         Tomoki Hayashi; Nagoya University
         Tomoki Toda; Nagoya University