Multi-Channel Convolutional Neural Network Based 3D Object Detection for Indoor Robot Environmental Perception.

Wang, Li; Li, Ruifeng; Shi, Hezi; Sun, Jingwen; Zhao, Lijun; Seah, Hock Soon; Quah, Chee Kwang; Tandianus, Budianto

Wang, Li; Li, Ruifeng; Shi, Hezi; Sun, Jingwen; Zhao, Lijun; Seah, Hock Soon; Quah, Chee Kwang; Tandianus, Budianto.

Afiliação

Wang L; State Key Laboratory of Robotics and System, Harbin Institute of Technology, Harbin 150001, China. 15b908017@hit.edu.cn.
Li R; State Key Laboratory of Robotics and System, Harbin Institute of Technology, Harbin 150001, China. lrf100@hit.edu.cn.
Shi H; EON Reality Pte Ltd, Singapore 138567, Singapore. hezi.shi@eonreality.com.
Sun J; State Key Laboratory of Robotics and System, Harbin Institute of Technology, Harbin 150001, China. 18S008061@stu.hit.edu.cn.
Zhao L; State Key Laboratory of Robotics and System, Harbin Institute of Technology, Harbin 150001, China. zhaolj@hit.edu.cn.
Seah HS; School of Computer Science and Engineering, Nanyang Technological University, Singapore 639798, Singapore. ashsseah@ntu.edu.sg.
Quah CK; ST Electronics (Training & Simulation Systems) Pte Ltd, Singapore 567714, Singapore. quah.cheekwang@stee.stengg.com.
Tandianus B; ST Engineering-NTU Corporate Laboratory, School of Electrical and Electronic Engineering, Nanyang Technological University, Singapore 637335, Singapore. btandianus@ntu.edu.sg.

Sensors (Basel) ; 19(4)2019 Feb 21.

Article em En | MEDLINE | ID: mdl-30795507

ABSTRACT

ABSTRACT

Environmental perception is a vital feature for service robots when working in an indoor environment for a long time. The general 3D reconstruction is a low-level geometric information description that cannot convey semantics. In contrast, higher level perception similar to humans requires more abstract concepts, such as objects and scenes. Moreover, the 2D object detection based on images always fails to provide the actual position and size of an object, which is quite important for a robot's operation. In this paper, we focus on the 3D object detection to regress the object's category, 3D size, and spatial position through a convolutional neural network (CNN). We propose a multi-channel CNN for 3D object detection, which fuses three input channels including RGB, depth, and bird's eye view (BEV) images. We also propose a method to generate 3D proposals based on 2D ones in the RGB image and semantic prior. Training and test are conducted on the modified NYU V2 dataset and SUN RGB-D dataset in order to verify the effectiveness of the algorithm. We also carry out the actual experiments in a service robot to utilize the proposed 3D object detection method to enhance the environmental perception of the robot.

Palavras-chave

3D object detection; environmental perception; indoor robot; multi-channel CNN

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Base de dados: MEDLINE Tipo de estudo: Diagnostic_studies Idioma: En Ano de publicação: 2019 Tipo de documento: Article

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Base de dados: MEDLINE Tipo de estudo: Diagnostic_studies Idioma: En Ano de publicação: 2019 Tipo de documento: Article