2021 IEEE International Conference on Acoustics, Speech and Signal Processing

6-11 June 2021 • Toronto, Ontario, Canada

Extracting Knowledge from Information

2021 IEEE International Conference on Acoustics, Speech and Signal Processing

6-11 June 2021 • Toronto, Ontario, Canada

Extracting Knowledge from Information

Technical Program

SPE-49: Speech Synthesis 7: General Topics

Session Type: Poster
Time: Friday, 11 June, 11:30 - 12:15
Location: Gather.Town
 
SPE-49.1: CONTEXT-AWARE PROSODY CORRECTION FOR TEXT-BASED SPEECH EDITING
         Max Morrison; Northwestern University
         Lucas Rencker; University of Surrey
         Zeyu Jin; Adobe Research
         Nicholas J. Bryan; Adobe Research
         Juan-Pablo Caceres; Adobe Research
         Bryan Pardo; Adobe Research
 
SPE-49.2: FAST DCTTS: EFFICIENT DEEP CONVOLUTIONAL TEXT-TO-SPEECH
         Minsu Kang; Handong Global University
         Jihyun Lee; Handong Global University
         Simin Kim; Handong Global University
         Injung Kim; Handong Global University
 
SPE-49.3: SPEECH PREDICTION IN SILENT VIDEOS USING VARIATIONAL AUTOENCODERS
         Ravindra Yadav; Indian Institute of Technology, Kanpur
         Ashish Sardana; NVIDIA
         Vinay P Namboodiri; University of Bath
         Rajesh M Hegde; Indian Institute of Technology, Kanpur
 
SPE-49.4: LEARNING DISENTANGLED PHONE AND SPEAKER REPRESENTATIONS IN A SEMI-SUPERVISED VQ-VAE PARADIGM
         Jennifer Williams; University of Edinburgh
         Zhao Yi; National Institute for Informatics
         Erica Cooper; National Institute for Informatics
         Junichi Yamagishi; National Institute for Informatics
 
SPE-49.5: HIGH-INTELLIGIBILITY SPEECH SYNTHESIS FOR DYSARTHRIC SPEAKERS WITH LPCNET-BASED TTS AND CYCLEVAE-BASED VC
         Keisuke Matsubara; Kobe University
         Takuma Okamoto; National Institute of Information and Communications Technology
         Ryoichi Takashima; Kobe University
         Tetsuya Takiguchi; Kobe University
         Tomoki Toda; Nagoya University
         Yoshinori Shiga; National Institute of Information and Communications Technology
         Hisashi Kawai; National Institute of Information and Communications Technology
 
SPE-49.6: DENOISPEECH: DENOISING TEXT TO SPEECH WITH FRAME-LEVEL NOISE MODELING
         Chen Zhang; Zhejiang University
         Yi Ren; Zhejiang University
         Xu Tan; Microsoft Research Asia
         Jinglin Liu; Zhejiang University
         Kejun Zhang; Zhejiang University
         Tao Qin; Microsoft Research Asia
         Sheng Zhao; Microsoft Azure Speech
         Tie-Yan Liu; Microsoft Research Asia