Paper ID | IVMSP-17.5 | ||
Paper Title | Lightweight Dual-task Networks for Crowd Counting in Aerial Images | ||
Authors | Ye Tian, Chengzhen Duan, Ruilin Zhang, Zhiwei Wei, Harbin Institute of Technology, Shenzhen, China; Hongpeng Wang, Harbin Institute of Technology, Shenzhen; Peng Cheng Laboratory, China | ||
Session | IVMSP-17: Looking at People | ||
Location | Gather.Town | ||
Session Time: | Wednesday, 09 June, 16:30 - 17:15 | ||
Presentation Time: | Wednesday, 09 June, 16:30 - 17:15 | ||
Presentation | Poster | ||
Topic | Image, Video, and Multidimensional Signal Processing: [IVSMR] Image & Video Sensing, Modeling, and Representation | ||
IEEE Xplore Open Preview | Click here to view in IEEE Xplore | ||
Abstract | As a research hotspot of computer vision, crowd counting methods have achieved success in natural images. But crowd counting in aerial images are rarely explored, and existing methods do not perform well because of the higher resolution, smaller object scale and more complex scene. Therefore, this paper proposes a lightweight dual-task network (LDNet) for crowd counting, which only uses bifurcated structure to overcome these new challenges in aerial images without complicated pipelines. To realize this, a complete but efficient Guidance Branch is proposed to assist Counting Branch in fitting crowd distribution. Furthermore, a scene attention mechanism is used to consider the complex scene information, which are never considered by existing methods. Our LDNet outperforms existing methods on aerial crowd counting dataset (Visdrone), and gets better or comparable results on natural crowd counting datasets (UCF CC 50, UCF QNRF, ShanghaiTech Part A). |