SPE-16: Speech Synthesis 4: Front-end |
Session Type: Poster |
Time: Wednesday, 9 June, 13:00 - 13:45 |
Location: Gather.Town |
Virtual Session: View on Virtual Platform |
Session Chair: Jiangyan Yi, Institute of Automation Chinese Academy of Sciences |
SPE-16.1: GRAPHSPEECH: SYNTAX-AWARE GRAPH ATTENTION NETWORK FOR NEURAL SPEECH SYNTHESIS |
Rui Liu; National University of Singapore |
Berrak Sisman; Singapore University of Technology and Design |
Haizhou Li; National University of Singapore |
SPE-16.2: SYNTACTIC REPRESENTATION LEARNING FOR NEURAL NETWORK BASED TTS WITH SYNTACTIC PARSE TREE TRAVERSAL |
Changhe Song; Tsinghua University |
Jingbei Li; Tsinghua University |
Yixuan Zhou; Tsinghua University |
Zhiyong Wu; Tsinghua University |
Helen Meng; The Chinese University of Hong Kong |
SPE-16.3: A CHAPTER-WISE UNDERSTANDING SYSTEM FOR TEXT-TO-SPEECH IN CHINESE NOVELS |
Junjie Pan; Bytedance |
Lin Wu; Bytedance |
Xiang Yin; Bytedance |
Pengfei Wu; Bytedance |
Chenchang Xu; Bytedance |
Zejun Ma; Bytedance |
SPE-16.4: A UNIVERSAL BERT-BASED FRONT-END MODEL FOR MANDARIN TEXT-TO-SPEECH SYNTHESIS |
Zilong Bai; Ajmide Media Co., Ltd. |
Beibei Hu; Ajmide Media Co., Ltd. |
SPE-16.5: IMPROVING PROSODY MODELLING WITH CROSS-UTTERANCE BERT EMBEDDINGS FOR END-TO-END SPEECH SYNTHESIS |
Guanghui Xu; jd.com |
Wei Song; jd.com |
Zhengchen Zhang; jd.com |
Chao Zhang; jd.com |
Xiaodong He; jd.com |
Bowen Zhou; jd.com |