LEOD-Net: Learning Line-Encoded Bounding Boxes for Real-Time Object Detection.

Ibrahem, Hatem; Salem, Ahmed; Kang, Hyun-Soo

Ibrahem, Hatem; Salem, Ahmed; Kang, Hyun-Soo.

Afiliação

Ibrahem H; Department of Information and Communication Engineering, School of Electrical and Computer Engineering, Chungbuk National University, Cheongju-si 28644, Korea.
Salem A; Department of Information and Communication Engineering, School of Electrical and Computer Engineering, Chungbuk National University, Cheongju-si 28644, Korea.
Kang HS; Electrical Engineering Department, Faculty of Engineering, Assiut University, Assiut 71515, Egypt.

Sensors (Basel) ; 22(10)2022 May 12.

Article em En | MEDLINE | ID: mdl-35632108

ABSTRACT

ABSTRACT

This paper proposes a learnable line encoding technique for bounding boxes commonly used in the object detection task. A bounding box is simply encoded using two main points the top-left corner and the bottom-right corner of the bounding box; then, a lightweight convolutional neural network (CNN) is employed to learn the lines and propose high-resolution line masks for each category of classes using a pixel-shuffle operation. Post-processing is applied to the predicted line masks to filtrate them and estimate clear lines based on a progressive probabilistic Hough transform. The proposed method was trained and evaluated on two common object detection benchmarks Pascal VOC2007 and MS-COCO2017. The proposed model attains high mean average precision (mAP) values (78.8% for VOC2007 and 48.1% for COCO2017) while processing each frame in a few milliseconds (37 ms for PASCAL VOC and 47 ms for COCO). The strength of the proposed method lies in its simplicity and ease of implementation unlike the recent state-of-the-art methods in object detection, which include complex processing pipelines.

Assuntos

Máscaras; Redes Neurais de Computação

Palavras-chave

convolutional neural networks; line detection; object detection; real-time processing

Texto completo

Adicionar na Minha BVS

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Redes Neurais de Computação / Máscaras Tipo de estudo: Diagnostic_studies / Prognostic_studies Idioma: En Revista: Sensors (Basel) Ano de publicação: 2022 Tipo de documento: Article

Texto completo

Adicionar na Minha BVS

Imprimir

XML

PubMed Links

Buscar no Google