Your browser doesn't support javascript.
loading
Pangenome graphs improve the analysis of structural variants in rare genetic diseases.
Groza, Cristian; Schwendinger-Schreck, Carl; Cheung, Warren A; Farrow, Emily G; Thiffault, Isabelle; Lake, Juniper; Rizzo, William B; Evrony, Gilad; Curran, Tom; Bourque, Guillaume; Pastinen, Tomi.
Afiliação
  • Groza C; Quantitative Life Sciences, McGill University, Montréal, QC, Canada.
  • Schwendinger-Schreck C; Genomic Medicine Center, Children's Mercy Hospital and Research Institute, KC, MO, USA.
  • Cheung WA; Genomic Medicine Center, Children's Mercy Hospital and Research Institute, KC, MO, USA.
  • Farrow EG; Genomic Medicine Center, Children's Mercy Hospital and Research Institute, KC, MO, USA.
  • Thiffault I; Genomic Medicine Center, Children's Mercy Hospital and Research Institute, KC, MO, USA.
  • Lake J; Pacific Biosciences, Menlo Park, CA, USA.
  • Rizzo WB; Child Health Research Institute, Department of Pediatrics, Nebraska Medical Center, Omaha, NE, USA.
  • Evrony G; Center for Human Genetics and Genomics, Department of Pediatrics, Neuroscience & Physiology, New York University Grossman School of Medicine, New York, NY, USA.
  • Curran T; Children's Mercy Research Institute, Kansas City, MO, USA.
  • Bourque G; Canadian Center for Computational Genomics, McGill University, Montréal, QC, Canada. guil.bourque@mcgill.ca.
  • Pastinen T; Department of Human Genetics, McGill University, Montréal, QC, Canada. guil.bourque@mcgill.ca.
Nat Commun ; 15(1): 657, 2024 Jan 22.
Article em En | MEDLINE | ID: mdl-38253606
ABSTRACT
Rare DNA alterations that cause heritable diseases are only partially resolvable by clinical next-generation sequencing due to the difficulty of detecting structural variation (SV) in all genomic contexts. Long-read, high fidelity genome sequencing (HiFi-GS) detects SVs with increased sensitivity and enables assembling personal and graph genomes. We leverage standard reference genomes, public assemblies (n = 94) and a large collection of HiFi-GS data from a rare disease program (Genomic Answers for Kids, GA4K, n = 574 assemblies) to build a graph genome representing a unified SV callset in GA4K, identify common variation and prioritize SVs that are more likely to cause genetic disease (MAF < 0.01). Using graphs, we obtain a higher level of reproducibility than the standard reference approach. We observe over 200,000 SV alleles unique to GA4K, including nearly 1000 rare variants that impact coding sequence. With improved specificity for rare SVs, we isolate 30 candidate SVs in phenotypically prioritized genes, including known disease SVs. We isolate a novel diagnostic SV in KMT2E, demonstrating use of personal assemblies coupled with pangenome graphs for rare disease genomics. The community may interrogate our pangenome with additional assemblies to discover new SVs within the allele frequency spectrum relevant to genetic diseases.
Assuntos

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Genômica / Doenças Raras Idioma: En Ano de publicação: 2024 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Genômica / Doenças Raras Idioma: En Ano de publicação: 2024 Tipo de documento: Article