2021 IEEE International Conference on Acoustics, Speech and Signal Processing

Technical Program

Paper ID	MLSP-44.4
Paper Title	SELF-AUGMENTED MULTI-MODAL FEATURE EMBEDDING
Authors	Shinnosuke Matsuo, Seiichi Uchida, Brian Kenji Iwana, Kyushu University, Japan
Session	MLSP-44: Multimodal Data and Applications
Location	Gather.Town
Session Time:	Friday, 11 June, 13:00 - 13:45
Presentation Time:	Friday, 11 June, 13:00 - 13:45
Presentation	Poster
Topic	Machine Learning for Signal Processing: [MLR-LMM] Learning from multimodal data
IEEE Xplore Open Preview	Click here to view in IEEE Xplore
Virtual Presentation	Click here to watch in the Virtual Conference
Abstract	Oftentimes, patterns can be represented through different modalities. For example, leaf data can be in the form of images or contours. Handwritten characters can also be either online or offline. To exploit this fact, we propose the use of self-augmentation and combine it with multi-modal feature embedding. In order to take advantage of the complementary information from the different modalities, the self-augmented multi-modal feature embedding employs a shared feature space. Through experimental results on classification with online handwriting and leaf images, we demonstrate that the proposed method can create effective embeddings.