Your browser doesn't support javascript.
loading
Critical assessment of variant prioritization methods for rare disease diagnosis within the Rare Genomes Project.
Stenton, Sarah L; O'Leary, Melanie; Lemire, Gabrielle; VanNoy, Grace E; DiTroia, Stephanie; Ganesh, Vijay S; Groopman, Emily; O'Heir, Emily; Mangilog, Brian; Osei-Owusu, Ikeoluwa; Pais, Lynn S; Serrano, Jillian; Singer-Berk, Moriel; Weisburd, Ben; Wilson, Michael; Austin-Tse, Christina; Abdelhakim, Marwa; Althagafi, Azza; Babbi, Giulia; Bellazzi, Riccardo; Bovo, Samuele; Carta, Maria Giulia; Casadio, Rita; Coenen, Pieter-Jan; De Paoli, Federica; Floris, Matteo; Gajapathy, Manavalan; Hoehndorf, Robert; Jacobsen, Julius O B; Joseph, Thomas; Kamandula, Akash; Katsonis, Panagiotis; Kint, Cyrielle; Lichtarge, Olivier; Limongelli, Ivan; Lu, Yulan; Magni, Paolo; Mamidi, Tarun Karthik Kumar; Martelli, Pier Luigi; Mulargia, Marta; Nicora, Giovanna; Nykamp, Keith; Pejaver, Vikas; Peng, Yisu; Pham, Thi Hong Cam; Podda, Maurizio S; Rao, Aditya; Rizzo, Ettore; Saipradeep, Vangala G; Savojardo, Castrense.
Afiliación
  • Stenton SL; Division of Genetics and Genomics, Boston Children's Hospital, Harvard Medical School, Boston, MA, USA.
  • O'Leary M; Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.
  • Lemire G; Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA.
  • VanNoy GE; Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.
  • DiTroia S; Division of Genetics and Genomics, Boston Children's Hospital, Harvard Medical School, Boston, MA, USA.
  • Ganesh VS; Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.
  • Groopman E; Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.
  • O'Heir E; Division of Genetics and Genomics, Boston Children's Hospital, Harvard Medical School, Boston, MA, USA.
  • Mangilog B; Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.
  • Osei-Owusu I; Division of Genetics and Genomics, Boston Children's Hospital, Harvard Medical School, Boston, MA, USA.
  • Pais LS; Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.
  • Serrano J; Department of Neurology, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA.
  • Singer-Berk M; Division of Genetics and Genomics, Boston Children's Hospital, Harvard Medical School, Boston, MA, USA.
  • Weisburd B; Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.
  • Wilson M; Division of Genetics and Genomics, Boston Children's Hospital, Harvard Medical School, Boston, MA, USA.
  • Austin-Tse C; Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.
  • Abdelhakim M; Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.
  • Althagafi A; Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.
  • Babbi G; Division of Genetics and Genomics, Boston Children's Hospital, Harvard Medical School, Boston, MA, USA.
  • Bellazzi R; Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.
  • Bovo S; Division of Genetics and Genomics, Boston Children's Hospital, Harvard Medical School, Boston, MA, USA.
  • Carta MG; Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.
  • Casadio R; Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.
  • Coenen PJ; Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.
  • De Paoli F; Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.
  • Floris M; Program in Medical and Population Genetics, Broad Institute of MIT and Harvard, Cambridge, MA, USA.
  • Gajapathy M; Center for Genomic Medicine, Massachusetts General Hospital, Boston, MA, USA.
  • Hoehndorf R; Computational Bioscience Research Center (CBRC), King Abdullah University of Science and Technology (KAUST), Thuwal, Saudi Arabia.
  • Jacobsen JOB; Computational Bioscience Research Center (CBRC), King Abdullah University of Science and Technology (KAUST), Thuwal, Saudi Arabia.
  • Joseph T; Computer, Electrical and Mathematical Sciences & Engineering Division (CEMSE), Computational Bioscience Research Center (CBRC), King Abdullah University of Science and Technology (KAUST), Thuwal, Saudi Arabia.
  • Kamandula A; Computer Science Department, College of Computers and Information Technology, Taif University, Taif, Saudi Arabia.
  • Katsonis P; Biocomputing Group, Department of Pharmacy and Biotechnology, University of Bologna, Bologna, Italy.
  • Kint C; enGenome Srl, Pavia, Italy.
  • Lichtarge O; Department of Electrical, Computer and Biomedical Engineering, University of Pavia, Pavia, Italy.
  • Limongelli I; Department of Agricultural and Food Sciences, University of Bologna, Bologna, Italy.
  • Lu Y; Department of Electrical, Computer and Biomedical Engineering, University of Pavia, Pavia, Italy.
  • Magni P; Biocomputing Group, Department of Pharmacy and Biotechnology, University of Bologna, Bologna, Italy.
  • Mamidi TKK; Invitae, San Francisco, California, USA.
  • Martelli PL; enGenome Srl, Pavia, Italy.
  • Mulargia M; Department of Biomedical Sciences, University of Sassari, Sassari, Italy.
  • Nicora G; Center for Computational Genomics and Data Science, The University of Alabama at Birmingham, Birmingham, AL, USA.
  • Nykamp K; Department of Genetics, Heersink School of Medicine, The University of Alabama at Birmingham, Birmingham, AL, USA.
  • Pejaver V; Hugh Kaul Precision Medicine Institute, The University of Alabama at Birmingham, Birmingham, AL, USA.
  • Peng Y; Computational Bioscience Research Center (CBRC), King Abdullah University of Science and Technology (KAUST), Thuwal, Saudi Arabia.
  • Pham THC; Computer, Electrical and Mathematical Sciences & Engineering Division (CEMSE), Computational Bioscience Research Center (CBRC), King Abdullah University of Science and Technology (KAUST), Thuwal, Saudi Arabia.
  • Podda MS; William Harvey Research Institute, Barts & The London School of Medicine and Dentistry, Queen Mary University of London, Charterhouse Square, London, UK.
  • Rao A; TCS Research, Tata Consultancy Services (TCS) Ltd, Deccan Park, Madhapur, Hyderabad, India.
  • Rizzo E; Khoury College of Computer Sciences, Northeastern University, Boston, MA, USA.
  • Saipradeep VG; Department of Molecular & Human Genetics, Baylor College of Medicine, Houston, TX, USA.
  • Savojardo C; Invitae, San Francisco, California, USA.
medRxiv ; 2023 Aug 04.
Article en En | MEDLINE | ID: mdl-37577678
ABSTRACT

Background:

A major obstacle faced by rare disease families is obtaining a genetic diagnosis. The average "diagnostic odyssey" lasts over five years, and causal variants are identified in under 50%. The Rare Genomes Project (RGP) is a direct-to-participant research study on the utility of genome sequencing (GS) for diagnosis and gene discovery. Families are consented for sharing of sequence and phenotype data with researchers, allowing development of a Critical Assessment of Genome Interpretation (CAGI) community challenge, placing variant prioritization models head-to-head in a real-life clinical diagnostic setting.

Methods:

Predictors were provided a dataset of phenotype terms and variant calls from GS of 175 RGP individuals (65 families), including 35 solved training set families, with causal variants specified, and 30 test set families (14 solved, 16 unsolved). The challenge tasked teams with identifying the causal variants in as many test set families as possible. Ranked variant predictions were submitted with estimated probability of causal relationship (EPCR) values. Model performance was determined by two metrics, a weighted score based on rank position of true positive causal variants and maximum F-measure, based on precision and recall of causal variants across EPCR thresholds.

Results:

Sixteen teams submitted predictions from 52 models, some with manual review incorporated. Top performing teams recalled the causal variants in up to 13 of 14 solved families by prioritizing high quality variant calls that were rare, predicted deleterious, segregating correctly, and consistent with reported phenotype. In unsolved families, newly discovered diagnostic variants were returned to two families following confirmatory RNA sequencing, and two prioritized novel disease gene candidates were entered into Matchmaker Exchange. In one example, RNA sequencing demonstrated aberrant splicing due to a deep intronic indel in ASNS, identified in trans with a frameshift variant, in an unsolved proband with phenotype overlap with asparagine synthetase deficiency.

Conclusions:

By objective assessment of variant predictions, we provide insights into current state-of-the-art algorithms and platforms for genome sequencing analysis for rare disease diagnosis and explore areas for future optimization. Identification of diagnostic variants in unsolved families promotes synergy between researchers with clinical and computational expertise as a means of advancing the field of clinical genome interpretation.
Palabras clave

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Tipo de estudio: Diagnostic_studies / Guideline / Prognostic_studies Idioma: En Revista: MedRxiv Año: 2023 Tipo del documento: Article País de afiliación: Estados Unidos

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Tipo de estudio: Diagnostic_studies / Guideline / Prognostic_studies Idioma: En Revista: MedRxiv Año: 2023 Tipo del documento: Article País de afiliación: Estados Unidos
...