Paper ID | IVMSP-9.1 | ||
Paper Title | MULTIPLE-INPUT MULTIPLE-OUTPUT FUSION NETWORK FOR GENERALIZED ZERO-SHOT LEARNING | ||
Authors | Fangming Zhong, Guangze Wang, Zhikui Chen, Xu Yuan, Feng Xia, Dalian University of Technology, China | ||
Session | IVMSP-9: Zero and Few Short Learning | ||
Location | Gather.Town | ||
Session Time: | Wednesday, 09 June, 13:00 - 13:45 | ||
Presentation Time: | Wednesday, 09 June, 13:00 - 13:45 | ||
Presentation | Poster | ||
Topic | Image, Video, and Multidimensional Signal Processing: [IVTEC] Image & Video Processing Techniques | ||
IEEE Xplore Open Preview | Click here to view in IEEE Xplore | ||
Abstract | Generalized zero-shot learning (GZSL) has attracted considerable attention recently, which trains models with data from seen classes and tests on data from both seen and unseen classes. Most of the existing methods attempt to find a mapping from visual space to semantic space, such mapping can easily result in the domain shift problem. To address this issue, we propose a Multiple-Input Multiple-Output Fusion Network to GZSL. It can generate similar common semantic representation to paired inputs even with only the class semantic embeddings. This makes it possible to synthesize pseudo samples from attributes of unseen classes. Extensive experiments carried out on three benchmark datasets show the effectiveness of the proposed model. |