Assessing performance of pathogenicity predictors using clinically relevant variant datasets.

Gunning, Adam C; Fryer, Verity; Fasham, James; Crosby, Andrew H; Ellard, Sian; Baple, Emma L; Wright, Caroline F

Gunning, Adam C; Fryer, Verity; Fasham, James; Crosby, Andrew H; Ellard, Sian; Baple, Emma L; Wright, Caroline F.

Afiliação

Gunning AC; College of Medicine and Health, University of Exeter Medical School Institute of Biomedical and Clinical Science, Exeter, Devon, UK.
Fryer V; Exeter Genomics Laboratory, Royal Devon & Exeter NHS Foundation Trust, Exeter, UK.
Fasham J; Exeter Genomics Laboratory, Royal Devon & Exeter NHS Foundation Trust, Exeter, UK.
Crosby AH; College of Medicine and Health, University of Exeter Medical School Institute of Biomedical and Clinical Science, Exeter, Devon, UK.
Ellard S; College of Medicine and Health, University of Exeter Medical School Institute of Biomedical and Clinical Science, Exeter, Devon, UK.
Baple EL; Exeter Genomics Laboratory, Royal Devon & Exeter NHS Foundation Trust, Exeter, UK.
Wright CF; College of Medicine and Health, University of Exeter Medical School Institute of Biomedical and Clinical Science, Exeter, Devon, UK.

J Med Genet ; 58(8): 547-555, 2021 08.

Article em En | MEDLINE | ID: mdl-32843488

RESUMO

BACKGROUND: Pathogenicity predictors are integral to genomic variant interpretation but, despite their widespread usage, an independent validation of performance using a clinically relevant dataset has not been undertaken. METHODS: We derive two validation datasets: an 'open' dataset containing variants extracted from publicly available databases, similar to those commonly applied in previous benchmarking exercises, and a 'clinically representative' dataset containing variants identified through research/diagnostic exome and panel sequencing. Using these datasets, we evaluate the performance of three recent meta-predictors, REVEL, GAVIN and ClinPred, and compare their performance against two commonly used in silico tools, SIFT and PolyPhen-2. RESULTS: Although the newer meta-predictors outperform the older tools, the performance of all pathogenicity predictors is substantially lower in the clinically representative dataset. Using our clinically relevant dataset, REVEL performed best with an area under the receiver operating characteristic curve of 0.82. Using a concordance-based approach based on a consensus of multiple tools reduces the performance due to both discordance between tools and false concordance where tools make common misclassification. Analysis of tool feature usage may give an insight into the tool performance and misclassification. CONCLUSION: Our results support the adoption of meta-predictors over traditional in silico tools, but do not support a consensus-based approach as in current practice.

Assuntos

Biologia Computacional/métodos; Variação Genética/genética; Genômica/métodos; Exoma/genética; Humanos; Curva ROC

Palavras-chave

genetic testing; genetic variation; genetics; genomics; human genetics

Texto completo

Adicionar na Minha BVS

Imprimir

XML

PubMed Links

Buscar no Google

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Variação Genética / Biologia Computacional / Genômica Tipo de estudo: Prognostic_studies / Risk_factors_studies Limite: Humans Idioma: En Revista: J Med Genet Ano de publicação: 2021 Tipo de documento: Article País de publicação: Reino Unido

Texto completo

Adicionar na Minha BVS

Imprimir

XML

PubMed Links

Buscar no Google