Your browser doesn't support javascript.
loading
VarSight: prioritizing clinically reported variants with binary classification algorithms.
Holt, James M; Wilk, Brandon; Birch, Camille L; Brown, Donna M; Gajapathy, Manavalan; Moss, Alexander C; Sosonkina, Nadiya; Wilk, Melissa A; Anderson, Julie A; Harris, Jeremy M; Kelly, Jacob M; Shaterferdosian, Fariba; Uno-Antonison, Angelina E; Weborg, Arthur; Worthey, Elizabeth A.
Afiliação
  • Holt JM; HudsonAlpha Institute for Biotechnology, Software Development and Informatics, 601 Genome Way, Huntsville, 35806, USA. jholt@hudsonalpha.org.
  • Wilk B; HudsonAlpha Institute for Biotechnology, Software Development and Informatics, 601 Genome Way, Huntsville, 35806, USA.
  • Birch CL; HudsonAlpha Institute for Biotechnology, Software Development and Informatics, 601 Genome Way, Huntsville, 35806, USA.
  • Brown DM; HudsonAlpha Institute for Biotechnology, Software Development and Informatics, 601 Genome Way, Huntsville, 35806, USA.
  • Gajapathy M; HudsonAlpha Institute for Biotechnology, Software Development and Informatics, 601 Genome Way, Huntsville, 35806, USA.
  • Moss AC; HudsonAlpha Institute for Biotechnology, Software Development and Informatics, 601 Genome Way, Huntsville, 35806, USA.
  • Sosonkina N; HudsonAlpha Institute for Biotechnology, Software Development and Informatics, 601 Genome Way, Huntsville, 35806, USA.
  • Wilk MA; University of Alabama at Birmingham, Department of Genetics, 720 20th Street South, Birmingham, 35294, USA.
  • Anderson JA; HudsonAlpha Institute for Biotechnology, Software Development and Informatics, 601 Genome Way, Huntsville, 35806, USA.
  • Harris JM; HudsonAlpha Institute for Biotechnology, Software Development and Informatics, 601 Genome Way, Huntsville, 35806, USA.
  • Kelly JM; HudsonAlpha Institute for Biotechnology, Software Development and Informatics, 601 Genome Way, Huntsville, 35806, USA.
  • Shaterferdosian F; HudsonAlpha Institute for Biotechnology, Software Development and Informatics, 601 Genome Way, Huntsville, 35806, USA.
  • Uno-Antonison AE; HudsonAlpha Institute for Biotechnology, Software Development and Informatics, 601 Genome Way, Huntsville, 35806, USA.
  • Weborg A; HudsonAlpha Institute for Biotechnology, Software Development and Informatics, 601 Genome Way, Huntsville, 35806, USA.
BMC Bioinformatics ; 20(1): 496, 2019 Oct 15.
Article em En | MEDLINE | ID: mdl-31615419
BACKGROUND: When applying genomic medicine to a rare disease patient, the primary goal is to identify one or more genomic variants that may explain the patient's phenotypes. Typically, this is done through annotation, filtering, and then prioritization of variants for manual curation. However, prioritization of variants in rare disease patients remains a challenging task due to the high degree of variability in phenotype presentation and molecular source of disease. Thus, methods that can identify and/or prioritize variants to be clinically reported in the presence of such variability are of critical importance. METHODS: We tested the application of classification algorithms that ingest variant annotations along with phenotype information for predicting whether a variant will ultimately be clinically reported and returned to a patient. To test the classifiers, we performed a retrospective study on variants that were clinically reported to 237 patients in the Undiagnosed Diseases Network. RESULTS: We treated the classifiers as variant prioritization systems and compared them to four variant prioritization algorithms and two single-measure controls. We showed that the trained classifiers outperformed all other tested methods with the best classifiers ranking 72% of all reported variants and 94% of reported pathogenic variants in the top 20. CONCLUSIONS: We demonstrated how freely available binary classification algorithms can be used to prioritize variants even in the presence of real-world variability. Furthermore, these classifiers outperformed all other tested methods, suggesting that they may be well suited for working with real rare disease patient datasets.
Assuntos
Palavras-chave

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Algoritmos / Genômica / Doenças Raras / Doenças Genéticas Inatas / Mutação Tipo de estudo: Guideline / Observational_studies / Prognostic_studies Limite: Humans Idioma: En Ano de publicação: 2019 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Algoritmos / Genômica / Doenças Raras / Doenças Genéticas Inatas / Mutação Tipo de estudo: Guideline / Observational_studies / Prognostic_studies Limite: Humans Idioma: En Ano de publicação: 2019 Tipo de documento: Article