Your browser doesn't support javascript.
loading
Pangenome graph construction from genome alignments with Minigraph-Cactus.
Hickey, Glenn; Monlong, Jean; Ebler, Jana; Novak, Adam M; Eizenga, Jordan M; Gao, Yan; Marschall, Tobias; Li, Heng; Paten, Benedict.
Afiliación
  • Hickey G; UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA, USA. glenn.hickey@gmail.com.
  • Monlong J; UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA, USA.
  • Ebler J; Institute for Medical Biometry and Bioinformatics, Medical Faculty, Heinrich Heine University Düsseldorf, Düsseldorf, Germany.
  • Novak AM; Center for Digital Medicine, Heinrich Heine University Düsseldorf, Düsseldorf, Germany.
  • Eizenga JM; UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA, USA.
  • Gao Y; UC Santa Cruz Genomics Institute, University of California, Santa Cruz, Santa Cruz, CA, USA.
  • Li H; Institute for Medical Biometry and Bioinformatics, Medical Faculty, Heinrich Heine University Düsseldorf, Düsseldorf, Germany.
  • Paten B; Center for Digital Medicine, Heinrich Heine University Düsseldorf, Düsseldorf, Germany.
Nat Biotechnol ; 42(4): 663-673, 2024 Apr.
Article en En | MEDLINE | ID: mdl-37165083
ABSTRACT
Pangenome references address biases of reference genomes by storing a representative set of diverse haplotypes and their alignment, usually as a graph. Alternate alleles determined by variant callers can be used to construct pangenome graphs, but advances in long-read sequencing are leading to widely available, high-quality phased assemblies. Constructing a pangenome graph directly from assemblies, as opposed to variant calls, leverages the graph's ability to represent variation at different scales. Here we present the Minigraph-Cactus pangenome pipeline, which creates pangenomes directly from whole-genome alignments, and demonstrate its ability to scale to 90 human haplotypes from the Human Pangenome Reference Consortium. The method builds graphs containing all forms of genetic variation while allowing use of current mapping and genotyping tools. We measure the effect of the quality and completeness of reference genomes used for analysis within the pangenomes and show that using the CHM13 reference from the Telomere-to-Telomere Consortium improves the accuracy of our methods. We also demonstrate construction of a Drosophila melanogaster pangenome.
Asunto(s)

Texto completo: 1 Banco de datos: MEDLINE Asunto principal: Drosophila melanogaster / Secuenciación de Nucleótidos de Alto Rendimiento Límite: Animals / Humans Idioma: En Revista: Nat Biotechnol Asunto de la revista: BIOTECNOLOGIA Año: 2024 Tipo del documento: Article País de afiliación: Estados Unidos

Texto completo: 1 Banco de datos: MEDLINE Asunto principal: Drosophila melanogaster / Secuenciación de Nucleótidos de Alto Rendimiento Límite: Animals / Humans Idioma: En Revista: Nat Biotechnol Asunto de la revista: BIOTECNOLOGIA Año: 2024 Tipo del documento: Article País de afiliación: Estados Unidos