2021 IEEE International Conference on Acoustics, Speech and Signal Processing

6-11 June 2021 • Toronto, Ontario, Canada

Extracting Knowledge from Information

2021 IEEE International Conference on Acoustics, Speech and Signal Processing

6-11 June 2021 • Toronto, Ontario, Canada

Extracting Knowledge from Information

Technical Program

Paper Detail

Paper IDIVMSP-9.1
Paper Title MULTIPLE-INPUT MULTIPLE-OUTPUT FUSION NETWORK FOR GENERALIZED ZERO-SHOT LEARNING
Authors Fangming Zhong, Guangze Wang, Zhikui Chen, Xu Yuan, Feng Xia, Dalian University of Technology, China
SessionIVMSP-9: Zero and Few Short Learning
LocationGather.Town
Session Time:Wednesday, 09 June, 13:00 - 13:45
Presentation Time:Wednesday, 09 June, 13:00 - 13:45
Presentation Poster
Topic Image, Video, and Multidimensional Signal Processing: [IVTEC] Image & Video Processing Techniques
IEEE Xplore Open Preview  Click here to view in IEEE Xplore
Virtual Presentation  Click here to watch in the Virtual Conference
Abstract Generalized zero-shot learning (GZSL) has attracted considerable attention recently, which trains models with data from seen classes and tests on data from both seen and unseen classes. Most of the existing methods attempt to find a mapping from visual space to semantic space, such mapping can easily result in the domain shift problem. To address this issue, we propose a Multiple-Input Multiple-Output Fusion Network to GZSL. It can generate similar common semantic representation to paired inputs even with only the class semantic embeddings. This makes it possible to synthesize pseudo samples from attributes of unseen classes. Extensive experiments carried out on three benchmark datasets show the effectiveness of the proposed model.