Enhanced Spatial and Extended Temporal Graph Convolutional Network for Skeleton-Based Action Recognition.

Li, Fanjia; Li, Juanjuan; Zhu, Aichun; Xu, Yonggang; Yin, Hongsheng; Hua, Gang

Li, Fanjia; Li, Juanjuan; Zhu, Aichun; Xu, Yonggang; Yin, Hongsheng; Hua, Gang.

Afiliação

Li F; School of Information and Control Engineering, China University of Mining and Technology, Xuzhou 221008, China.
Li J; Jiangsu Province Xuzhou Technician Institute, Xuzhou 221151, China.
Zhu A; School of Information and Control Engineering, China University of Mining and Technology, Xuzhou 221008, China.
Xu Y; School of Computer Science and Technology, Nanjing Tech University, Nanjing 211800, China.
Yin H; School of Information and Control Engineering, China University of Mining and Technology, Xuzhou 221008, China.
Hua G; School of Information and Control Engineering, China University of Mining and Technology, Xuzhou 221008, China.

Sensors (Basel) ; 20(18)2020 Sep 15.

Article em En | MEDLINE | ID: mdl-32942579

ABSTRACT

ABSTRACT

In the skeleton-based human action recognition domain, the spatial-temporal graph convolution networks (ST-GCNs) have made great progress recently. However, they use only one fixed temporal convolution kernel, which is not enough to extract the temporal cues comprehensively. Moreover, simply connecting the spatial graph convolution layer (GCL) and the temporal GCL in series is not the optimal solution. To this end, we propose a novel enhanced spatial and extended temporal graph convolutional network (EE-GCN) in this paper. Three convolution kernels with different sizes are chosen to extract the discriminative temporal features from shorter to longer terms. The corresponding GCLs are then concatenated by a powerful yet efficient one-shot aggregation (OSA) + effective squeeze-excitation (eSE) structure. The OSA module aggregates the features from each layer once to the output, and the eSE module explores the interdependency between the channels of the output. Besides, we propose a new connection paradigm to enhance the spatial features, which expand the serial connection to a combination of serial and parallel connections by adding a spatial GCL in parallel with the temporal GCLs. The proposed method is evaluated on three large scale datasets, and the experimental results show that the performance of our method exceeds previous state-of-the-art methods.

Assuntos

Algoritmos; Movimento; Redes Neurais de Computação; Esqueleto/fisiologia; Humanos

Palavras-chave

enhanced spatial; extended temporal; graph convolution network; skeleton-based action recognition

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Esqueleto / Algoritmos / Redes Neurais de Computação / Movimento Limite: Humans Idioma: En Revista: Sensors (Basel) Ano de publicação: 2020 Tipo de documento: Article País de afiliação: China

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google