Your browser doesn't support javascript.
loading
Network properties determine neural network performance.
Jiang, Chunheng; Huang, Zhenhan; Pedapati, Tejaswini; Chen, Pin-Yu; Sun, Yizhou; Gao, Jianxi.
Afiliação
  • Jiang C; Network Science and Technology Center, Rensselaer Polytechnic Institute, Troy, NY, USA.
  • Huang Z; Department of Computer Science, Rensselaer Polytechnic Institute, Troy, NY, USA.
  • Pedapati T; Network Science and Technology Center, Rensselaer Polytechnic Institute, Troy, NY, USA.
  • Chen PY; Department of Computer Science, Rensselaer Polytechnic Institute, Troy, NY, USA.
  • Sun Y; IBM Thomas J. Watson Research Center, Yorktown Heights, NY, USA.
  • Gao J; IBM Thomas J. Watson Research Center, Yorktown Heights, NY, USA.
Nat Commun ; 15(1): 5718, 2024 Jul 08.
Article em En | MEDLINE | ID: mdl-38977665
ABSTRACT
Machine learning influences numerous aspects of modern society, empowers new technologies, from Alphago to ChatGPT, and increasingly materializes in consumer products such as smartphones and self-driving cars. Despite the vital role and broad applications of artificial neural networks, we lack systematic approaches, such as network science, to understand their underlying mechanism. The difficulty is rooted in many possible model configurations, each with different hyper-parameters and weighted architectures determined by noisy data. We bridge the gap by developing a mathematical framework that maps the neural network's performance to the network characters of the line graph governed by the edge dynamics of stochastic gradient descent differential equations. This framework enables us to derive a neural capacitance metric to universally capture a model's generalization capability on a downstream task and predict model performance using only early training results. The numerical results on 17 pre-trained ImageNet models across five benchmark datasets and one NAS benchmark indicate that our neural capacitance metric is a powerful indicator for model selection based only on early training results and is more efficient than state-of-the-art methods.

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Idioma: En Revista: Nat Commun Ano de publicação: 2024 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Idioma: En Revista: Nat Commun Ano de publicação: 2024 Tipo de documento: Article