An initial prediction and fine-tuning model based on improving GCN for 3D human motion prediction.

He, Zhiquan; Zhang, Lujun; Wang, Hengyou

He, Zhiquan; Zhang, Lujun; Wang, Hengyou.

Afiliação

He Z; Guangdong Key Laboratory of Intelligent Information Processing, Shenzhen, China.
Zhang L; Guangdong Multimedia Information Service Engineering Technology Research Center, Shenzhen University, Shenzhen, China.
Wang H; Guangdong Multimedia Information Service Engineering Technology Research Center, Shenzhen University, Shenzhen, China.

Front Comput Neurosci ; 17: 1145209, 2023.

Article em En | MEDLINE | ID: mdl-37089134

RESUMO

Human motion prediction is one of the fundamental studies of computer vision. Much work based on deep learning has shown impressive performance for it in recent years. However, long-term prediction and human skeletal deformation are still challenging tasks for human motion prediction. For accurate prediction, this paper proposes a GCN-based two-stage prediction method. We train a prediction model in the first stage. Using multiple cascaded spatial attention graph convolution layers (SAGCL) to extract features, the prediction model generates an initial motion sequence of future actions based on the observed pose. Since the initial pose generated in the first stage often deviates from natural human body motion, such as a motion sequence in which the length of a bone is changed. So the task of the second stage is to fine-tune the predicted pose and make it closer to natural motion. We present a fine-tuning model including multiple cascaded causally temporal-graph convolution layers (CT-GCL). We apply the spatial coordinate error of joints and bone length error as loss functions to train the fine-tuning model. We validate our model on Human3.6m and CMU-MoCap datasets. Extensive experiments show that the two-stage prediction method outperforms state-of-the-art methods. The limitations of proposed methods are discussed as well, hoping to make a breakthrough in future exploration.

Palavras-chave

GCN-based; causally temporal; motion prediction; spatial attention; two-stage prediction method

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Base de dados: MEDLINE Idioma: En Ano de publicação: 2023 Tipo de documento: Article

Texto completo

Imprimir

XML

PubMed Links