2021 IEEE International Conference on Acoustics, Speech and Signal Processing

6-11 June 2021 • Toronto, Ontario, Canada

Extracting Knowledge from Information

2021 IEEE International Conference on Acoustics, Speech and Signal Processing

6-11 June 2021 • Toronto, Ontario, Canada

Extracting Knowledge from Information

Technical Program

Paper Detail

Paper IDCI-3.6
Paper Title CATILOC: CAMERA IMAGE TRANSFORMER FOR INDOOR LOCALIZATION
Authors Ali Ghofrani, Rahil Mahdian Toroghi, Iran Broadcasting University (IRIBU), Iran; Seyed Mojtaba Tabatabaie, Alpha Reality, Iran
SessionCI-3: Computational Photography
LocationGather.Town
Session Time:Thursday, 10 June, 15:30 - 16:15
Presentation Time:Thursday, 10 June, 15:30 - 16:15
Presentation Poster
Topic Computational Imaging: [CIS] Computational Imaging Systems
IEEE Xplore Open Preview  Click here to view in IEEE Xplore
Virtual Presentation  Click here to watch in the Virtual Conference
Abstract In this paper the problem of single image indoor camera localization has been addressed. This is a difficult task, since no GPS is available and the training data being gathered for the indoor positioning system could be subject to many modifications such as occlusion, variation of illumination, or repetitive textures and patterns during the test, and these effects can easily fool any positioning system. In this paper, following the idea of self attention and the transformer networks, we customized the feature extraction system and the output extraction block of a recently used transformer in the image recognition task, so that to achieve the camera 3D position and 4D quaternion information. Moreover, an engineering implementation trick was employed, and the results were evaluated on the 7scenes dataset, and compared to the other state-of-the-art methods. The output results show a consistent outperformance with rather a simpler, and faster configuration.