Your browser doesn't support javascript.
loading
Adaptive detrending to accelerate convolutional gated recurrent unit training for contextual video recognition.
Jung, Minju; Lee, Haanvid; Tani, Jun.
Afiliação
  • Jung M; School of Electrical Engineering, Korea Advanced Institute of Science and Technology, Daejeon, Republic of Korea; Cognitive Neurorobotics Research Unit, Okinawa Institute of Science and Technology Graduate University, Okinawa, Japan.
  • Lee H; School of Computing, Korea Advanced Institute of Science and Technology, Daejeon, Republic of Korea.
  • Tani J; Cognitive Neurorobotics Research Unit, Okinawa Institute of Science and Technology Graduate University, Okinawa, Japan. Electronic address: jun.tani@oist.jp.
Neural Netw ; 105: 356-370, 2018 Sep.
Article em En | MEDLINE | ID: mdl-29936360
ABSTRACT
Video image recognition has been extensively studied with rapid progress recently. However, most methods focus on short-term rather than long-term (contextual) video recognition. Convolutional recurrent neural networks (ConvRNNs) provide robust spatio-temporal information processing capabilities for contextual video recognition, but require extensive computation that slows down training. Inspired by normalization and detrending methods, in this paper we propose "adaptive detrending" (AD) for temporal normalization in order to accelerate the training of ConvRNNs, especially of convolutional gated recurrent unit (ConvGRU). For each neuron in a recurrent neural network (RNN), AD identifies the trending change within a sequence and subtracts it, removing the internal covariate shift. In experiments testing for contextual video recognition with ConvGRU, results show that (1) ConvGRU clearly outperforms feed-forward neural networks, (2) AD consistently and significantly accelerates training and improves generalization, (3) performance is further improved when AD is coupled with other normalization methods, and most importantly, (4) the more long-term contextual information is required, the more AD outperforms existing methods.
Assuntos
Palavras-chave

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Reconhecimento Automatizado de Padrão / Aprendizado de Máquina Idioma: En Ano de publicação: 2018 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Reconhecimento Automatizado de Padrão / Aprendizado de Máquina Idioma: En Ano de publicação: 2018 Tipo de documento: Article