Dual-NMS: A Method for Autonomously Removing False Detection Boxes from Aerial Image Object Detection Results.

Lin, Zhiyuan; Wu, Qingxiao; Fu, Shuangfei; Wang, Sikui; Zhang, Zhongyu; Kong, Yanzi

Lin, Zhiyuan; Wu, Qingxiao; Fu, Shuangfei; Wang, Sikui; Zhang, Zhongyu; Kong, Yanzi.

Afiliação

Lin Z; Shenyang Institute of Automation, Chinese Academy of Sciences, Shenyang, 110016, China. linzhiyuan@sia.cn.
Wu Q; Institutes for Robotics and Intelligent Manufacturing, Chinese Academy of Sciences, Shenyang, 110169, China. linzhiyuan@sia.cn.
Fu S; University of Chinese Academy of Sciences, Beijing, 100049, China. linzhiyuan@sia.cn.
Wang S; Key Laboratory of Opto-Electronic Information Processing, Chinese Academy of Sciences, Shenyang, 110016, China. linzhiyuan@sia.cn.
Zhang Z; The Key Lab of Image Understanding and Computer Vision, Shenyang 110016, China. linzhiyuan@sia.cn.
Kong Y; Shenyang Institute of Automation, Chinese Academy of Sciences, Shenyang, 110016, China. wuqingxiao@sia.cn.

Sensors (Basel) ; 19(21)2019 Oct 28.

Article em En | MEDLINE | ID: mdl-31661940

ABSTRACT

ABSTRACT

In the field of aerial image object detection based on deep learning, it's difficult to extract features because the images are obtained from a top-down perspective. Therefore, there are numerous false detection boxes. The existing post-processing methods mainly remove overlapped detection boxes, but it's hard to eliminate false detection boxes. The proposed dual non-maximum suppression (dual-NMS) combines the density of detection boxes that are generated for each detected object with the corresponding classification confidence to autonomously remove the false detection boxes. With the dual-NMS as a post-processing method, the precision is greatly improved under the premise of keeping recall unchanged. In vehicle detection in aerial imagery (VEDAI) and dataset for object detection in aerial images (DOTA) datasets, the removal rate of false detection boxes is over 50%. Additionally, according to the characteristics of aerial images, the correlation calculation layer for feature channel separation and the dilated convolution guidance structure are proposed to enhance the feature extraction ability of the network, and these structures constitute the correlation network (CorrNet). Compared with you only look once (YOLOv3), the mean average precision (mAP) of the CorrNet for DOTA increased by 9.78%. Commingled with dual-NMS, the detection effect in aerial images is significantly improved.

Palavras-chave

aerial image; deep learning; density of detection boxes; dual-NMS; false detection boxes; object detection

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Tipo de estudo: Diagnostic_studies Idioma: En Revista: Sensors (Basel) Ano de publicação: 2019 Tipo de documento: Article País de afiliação: China

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google