Paper ID | BIO-8.3 | ||
Paper Title | CMIM: CROSS-MODAL INFORMATIONMAXIMIZATION FORMEDICAL IMAGING | ||
Authors | Tristan Sylvain, Universite de Montreal, Canada; Francis Dutil, Tess Berthier, Lisa Di Jorio, Imagia Cybernetics, Canada; Margaux Luck, Mila, Canada; Devon Hjelm, Microsoft Research, United States; Yoshua Bengio, Universite de Montreal, Canada | ||
Session | BIO-8: Biological Image Analysis | ||
Location | Gather.Town | ||
Session Time: | Wednesday, 09 June, 14:00 - 14:45 | ||
Presentation Time: | Wednesday, 09 June, 14:00 - 14:45 | ||
Presentation | Poster | ||
Topic | Biomedical Imaging and Signal Processing: [BIO-BIA] Biological image analysis | ||
IEEE Xplore Open Preview | Click here to view in IEEE Xplore | ||
Abstract | In hospitals, data are siloed to specific information systems that make the same information available under different modalities such as the different medical imaging exams the patient undergoes (CT scans, MRI, PET, Ultrasound, etc.) and their associated radiology reports. This offers unique opportunities to obtain and use at train-time those multiple views of the same information that might not always be available at test-time. In this paper, we propose an innovative framework that makes the most of available data by learning good representations of a multi-modal input that are resilient to modality dropping at test-time, using recent advances in mutual information maximization. By maximizing cross-modal information at train time, we are able to outperform several state-of-the-art baselines in two different settings, medical image classification, and segmentation. In particular, our method is shown to have a strong impact on the inference-time performance of weaker modalities. |