Your browser doesn't support javascript.
loading
Optimized model architectures for deep learning on genomic data.
Gündüz, Hüseyin Anil; Mreches, René; Moosbauer, Julia; Robertson, Gary; To, Xiao-Yin; Franzosa, Eric A; Huttenhower, Curtis; Rezaei, Mina; McHardy, Alice C; Bischl, Bernd; Münch, Philipp C; Binder, Martin.
Afiliação
  • Gündüz HA; Department of Statistics, LMU Munich, Munich, Germany.
  • Mreches R; Munich Center for Machine Learning, Munich, Germany.
  • Moosbauer J; Department for Computational Biology of Infection Research, Helmholtz Center for Infection Research, 38124, Braunschweig, Germany.
  • Robertson G; Braunschweig Integrated Centre of Systems Biology (BRICS), Technische Universität Braunschweig, Braunschweig, Germany.
  • To XY; Department of Statistics, LMU Munich, Munich, Germany.
  • Franzosa EA; Munich Center for Machine Learning, Munich, Germany.
  • Huttenhower C; Department for Computational Biology of Infection Research, Helmholtz Center for Infection Research, 38124, Braunschweig, Germany.
  • Rezaei M; Braunschweig Integrated Centre of Systems Biology (BRICS), Technische Universität Braunschweig, Braunschweig, Germany.
  • McHardy AC; Department of Statistics, LMU Munich, Munich, Germany.
  • Bischl B; Munich Center for Machine Learning, Munich, Germany.
  • Münch PC; Department for Computational Biology of Infection Research, Helmholtz Center for Infection Research, 38124, Braunschweig, Germany.
  • Binder M; Braunschweig Integrated Centre of Systems Biology (BRICS), Technische Universität Braunschweig, Braunschweig, Germany.
Commun Biol ; 7(1): 516, 2024 Apr 30.
Article em En | MEDLINE | ID: mdl-38693292
ABSTRACT
The success of deep learning in various applications depends on task-specific architecture design choices, including the types, hyperparameters, and number of layers. In computational biology, there is no consensus on the optimal architecture design, and decisions are often made using insights from more well-established fields such as computer vision. These may not consider the domain-specific characteristics of genome sequences, potentially limiting performance. Here, we present GenomeNet-Architect, a neural architecture design framework that automatically optimizes deep learning models for genome sequence data. It optimizes the overall layout of the architecture, with a search space specifically designed for genomics. Additionally, it optimizes hyperparameters of individual layers and the model training procedure. On a viral classification task, GenomeNet-Architect reduced the read-level misclassification rate by 19%, with 67% faster inference and 83% fewer parameters, and achieved similar contig-level accuracy with ~100 times fewer parameters compared to the best-performing deep learning baselines.
Assuntos

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Genômica / Aprendizado Profundo Limite: Humans Idioma: En Ano de publicação: 2024 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Genômica / Aprendizado Profundo Limite: Humans Idioma: En Ano de publicação: 2024 Tipo de documento: Article