Enhancing deep neural network training efficiency and performance through linear prediction.

Ying, Hejie; Song, Mengmeng; Tang, Yaohong; Xiao, Shungen; Xiao, Zimin

Ying, Hejie; Song, Mengmeng; Tang, Yaohong; Xiao, Shungen; Xiao, Zimin.

Afiliação

Ying H; Ningde Normal University, No. 1 College Road, Ningde, 352101, FuJian, China.
Song M; New Energy Vehicle Motor Industry Technology Development Base, Ningde Normal University, No. 1 College Road, Ningde, 352101, FuJian, China.
Tang Y; Ningde Normal University, No. 1 College Road, Ningde, 352101, FuJian, China. t2135@ndnu.edu.cn.
Xiao S; New Energy Vehicle Motor Industry Technology Development Base, Ningde Normal University, No. 1 College Road, Ningde, 352101, FuJian, China. t2135@ndnu.edu.cn.
Xiao Z; Ningde Normal University, No. 1 College Road, Ningde, 352101, FuJian, China.

Sci Rep ; 14(1): 15197, 2024 Jul 02.

Article em En | MEDLINE | ID: mdl-38956088

ABSTRACT

ABSTRACT

Deep neural networks have achieved remarkable success in various fields. However, training an effective deep neural network still poses challenges. This paper aims to propose a method to optimize the training effectiveness of deep neural networks, with the goal of improving their performance. Firstly, based on the observation that parameters (weights and bias) of deep neural network change in certain rules during training process, the potential of parameters prediction for improving training efficiency is discovered. Secondly, the potential of parameters prediction to improve the performance of deep neural network by noise injection introduced by prediction errors is revealed. And then, considering the limitations comprehensively, a deep neural network Parameters Linear Prediction method is exploit. Finally, performance and hyperparameter sensitivity validations are carried out on some representative backbones. Experimental results show that by employing proposed Parameters Linear Prediction method, as opposed to SGD, has led to an approximate 1% increase in accuracy for optimal model, along with a reduction of about 0.01 in top-1/top-5 error. Moreover, it also exhibits stable performance under various hyperparameter settings, shown the effectiveness of the proposed method and validated its capacity in enhancing network's training efficiency and performance.

Palavras-chave

DNN training; Linear prediction; Parameters changing law; Parameters prediction

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Idioma: En Ano de publicação: 2024 Tipo de documento: Article

Texto completo

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Idioma: En Ano de publicação: 2024 Tipo de documento: Article