2021 IEEE International Conference on Acoustics, Speech and Signal Processing

Technical Program

Paper ID	IVMSP-9.1
Paper Title	MULTIPLE-INPUT MULTIPLE-OUTPUT FUSION NETWORK FOR GENERALIZED ZERO-SHOT LEARNING
Authors	Fangming Zhong, Guangze Wang, Zhikui Chen, Xu Yuan, Feng Xia, Dalian University of Technology, China
Session	IVMSP-9: Zero and Few Short Learning
Location	Gather.Town
Session Time:	Wednesday, 09 June, 13:00 - 13:45
Presentation Time:	Wednesday, 09 June, 13:00 - 13:45
Presentation	Poster
Topic	Image, Video, and Multidimensional Signal Processing: [IVTEC] Image & Video Processing Techniques
IEEE Xplore Open Preview	Click here to view in IEEE Xplore
Virtual Presentation	Click here to watch in the Virtual Conference
Abstract	Generalized zero-shot learning (GZSL) has attracted considerable attention recently, which trains models with data from seen classes and tests on data from both seen and unseen classes. Most of the existing methods attempt to find a mapping from visual space to semantic space, such mapping can easily result in the domain shift problem. To address this issue, we propose a Multiple-Input Multiple-Output Fusion Network to GZSL. It can generate similar common semantic representation to paired inputs even with only the class semantic embeddings. This makes it possible to synthesize pseudo samples from attributes of unseen classes. Extensive experiments carried out on three benchmark datasets show the effectiveness of the proposed model.