L1 -Norm Batch Normalization for Efficient Training of Deep Neural Networks.
IEEE Trans Neural Netw Learn Syst
; 30(7): 2043-2051, 2019 07.
Article
em En
| MEDLINE
| ID: mdl-30418924
Batch normalization (BN) has recently become a standard component for accelerating and improving the training of deep neural networks (DNNs). However, BN brings in additional calculations, consumes more memory, and significantly slows down the training iteration. Furthermore, the nonlinear square and sqrt operations in the normalization process impede low bit-width quantization techniques, which draw much attention to the deep learning hardware community. In this paper, we propose an L1 -norm BN (L1BN) with only linear operations in both forward and backward propagations during training. L1BN is approximately equivalent to the conventional L2 -norm BN (L2BN) by multiplying a scaling factor that equals (π/2)1/2 . Experiments on various convolutional neural networks and generative adversarial networks reveal that L1BN can maintain the same performance and convergence rate as L2BN but with higher computational efficiency. In real application-specified integrated circuit synthesis with reduced resources, L1BN achieves 25% speedup and 37% energy saving compared to the original L2BN. Our hardware-friendly normalization method not only surpasses L2BN in speed but also simplifies the design of deep learning accelerators. Last but not least, L1BN promises a fully quantized training of DNNs, which empowers future artificial intelligence applications on mobile devices with transfer and continual learning capability.
Texto completo:
1
Bases de dados:
MEDLINE
Assunto principal:
Reconhecimento Automatizado de Padrão
/
Redes Neurais de Computação
/
Aprendizado Profundo
Tipo de estudo:
Prognostic_studies
Limite:
Humans
Idioma:
En
Revista:
IEEE Trans Neural Netw Learn Syst
Ano de publicação:
2019
Tipo de documento:
Article