Your browser doesn't support javascript.
loading
Pangenome graph layout by Path-Guided Stochastic Gradient Descent.
Heumos, Simon; Guarracino, Andrea; Schmelzle, Jan-Niklas M; Li, Jiajie; Zhang, Zhiru; Hagmann, Jörg; Nahnsen, Sven; Prins, Pjotr; Garrison, Erik.
Afiliação
  • Heumos S; Quantitative Biology Center (QBiC), University of Tübingen, 72076 Tübingen, Germany.
  • Guarracino A; Biomedical Data Science, Department of Computer Science, University of Tübingen, 72076 Tübingen, Germany.
  • Schmelzle JM; M3 Research Center, University Hospital Tübingen, 72076 Tübingen, Germany.
  • Li J; Institute for Bioinformatics and Medical Informatics (IBMI), University of Tübingen, 72076 Tübingen, Germany.
  • Zhang Z; Department of Genetics, Genomics and Informatics, University of Tennessee Health Science Center, Memphis, TN 38163, United States.
  • Hagmann J; Genomics Research Centre, Human Technopole, 20157 Milan, Italy.
  • Nahnsen S; Department of Computer Engineering, School of Computation, Information and Technology (CIT), Technical University of Munich, 80333 Munich, Germany.
  • Prins P; School of Electrical and Computer Engineering, Cornell University, Ithaca, NY 14853, United States.
  • Garrison E; School of Electrical and Computer Engineering, Cornell University, Ithaca, NY 14853, United States.
Bioinformatics ; 40(7)2024 07 01.
Article em En | MEDLINE | ID: mdl-38960860
ABSTRACT
MOTIVATION The increasing availability of complete genomes demands for models to study genomic variability within entire populations. Pangenome graphs capture the full genomic similarity and diversity between multiple genomes. In order to understand them, we need to see them. For visualization, we need a human-readable graph layout a graph embedding in low (e.g. two) dimensional depictions. Due to a pangenome graph's potential excessive size, this is a significant challenge.

RESULTS:

In response, we introduce a novel graph layout algorithm the Path-Guided Stochastic Gradient Descent (PG-SGD). PG-SGD uses the genomes, represented in the pangenome graph as paths, as an embedded positional system to sample genomic distances between pairs of nodes. This avoids the quadratic cost seen in previous versions of graph drawing by SGD. We show that our implementation efficiently computes the low-dimensional layouts of gigabase-scale pangenome graphs, unveiling their biological features. AVAILABILITY AND IMPLEMENTATION We integrated PG-SGD in ODGI which is released as free software under the MIT open source license. Source code is available at https//github.com/pangenome/odgi.
Assuntos

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Algoritmos / Software Limite: Humans Idioma: En Revista: Bioinformatics Assunto da revista: INFORMATICA MEDICA Ano de publicação: 2024 Tipo de documento: Article País de afiliação: Alemanha

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Algoritmos / Software Limite: Humans Idioma: En Revista: Bioinformatics Assunto da revista: INFORMATICA MEDICA Ano de publicação: 2024 Tipo de documento: Article País de afiliação: Alemanha