2021 IEEE International Conference on Acoustics, Speech and Signal Processing

6-11 June 2021 • Toronto, Ontario, Canada

Extracting Knowledge from Information

2021 IEEE International Conference on Acoustics, Speech and Signal Processing

6-11 June 2021 • Toronto, Ontario, Canada

Extracting Knowledge from Information

Technical Program

Paper Detail

Paper IDIVMSP-8.3
Paper Title A PLUG AND PLAY FAST INTERSECTION OVER UNION LOSS FOR BOUNDARY BOX REGRESSION
Authors Zengsheng Kuang, Xian Fang, Nankai University, China; Ruixun Zhang, Massachusetts Institute of Technology, China; Xiuli Shao, Hongpeng Wang, Nankai University, China
SessionIVMSP-8: Machine Learning for Image Processing II
LocationGather.Town
Session Time:Wednesday, 09 June, 13:00 - 13:45
Presentation Time:Wednesday, 09 June, 13:00 - 13:45
Presentation Poster
Topic Image, Video, and Multidimensional Signal Processing: [IVTEC] Image & Video Processing Techniques
IEEE Xplore Open Preview  Click here to view in IEEE Xplore
Virtual Presentation  Click here to watch in the Virtual Conference
Abstract Bounding box regression is a very effective method to improve the localization accuracy of object detection. Recently, the IoU-based regression losses have been widely used in object detection algorithms. However, we observe that they degenerate seriously in the late training period, leading to slow convergence and inaccurate localization. In this paper, we design a Fast Intersection over Union (FIoU) loss, which can not only keep the advantages but also solve the weakness of IoU-based losses. Furthermore, FIoU can be directly applied to Non-Maximum Suppression (NMS) as a criterion to improve the localization performance. Numerous experiments on two popular benchmark datasets show that our method is superior to other the-state-of-art methods.