Búsqueda | Portal Regional de la BVS

Performance of in silico tools for the evaluation of p16INK4a (CDKN2A) variants in CAGI.

Carraro, Marco; Minervini, Giovanni; Giollo, Manuel; Bromberg, Yana; Capriotti, Emidio; Casadio, Rita; Dunbrack, Roland; Elefanti, Lisa; Fariselli, Pietro; Ferrari, Carlo; Gough, Julian; Katsonis, Panagiotis; Leonardi, Emanuela; Lichtarge, Olivier; Menin, Chiara; Martelli, Pier Luigi; Niroula, Abhishek; Pal, Lipika R; Repo, Susanna; Scaini, Maria Chiara; Vihinen, Mauno; Wei, Qiong; Xu, Qifang; Yang, Yuedong; Yin, Yizhou; Zaucha, Jan; Zhao, Huiying; Zhou, Yaoqi; Brenner, Steven E; Moult, John; Tosatto, Silvio C E.

Hum Mutat ; 38(9): 1042-1050, 2017 09.

Artículo en Inglés | MEDLINE | ID: mdl-28440912

RESUMEN

Correct phenotypic interpretation of variants of unknown significance for cancer-associated genes is a diagnostic challenge as genetic screenings gain in popularity in the next-generation sequencing era. The Critical Assessment of Genome Interpretation (CAGI) experiment aims to test and define the state of the art of genotype-phenotype interpretation. Here, we present the assessment of the CAGI p16INK4a challenge. Participants were asked to predict the effect on cellular proliferation of 10 variants for the p16INK4a tumor suppressor, a cyclin-dependent kinase inhibitor encoded by the CDKN2A gene. Twenty-two pathogenicity predictors were assessed with a variety of accuracy measures for reliability in a medical context. Different assessment measures were combined in an overall ranking to provide more robust results. The R scripts used for assessment are publicly available from a GitHub repository for future use in similar assessment exercises. Despite a limited test-set size, our findings show a variety of results, with some methods performing significantly better. Methods combining different strategies frequently outperform simpler approaches. The best predictor, Yang&Zhou lab, uses a machine learning method combining an empirical energy function measuring protein stability with an evolutionary conservation term. The p16INK4a challenge highlights how subtle structural effects can neutralize otherwise deleterious variants.

Asunto(s)

Biología Computacional/métodos , Inhibidor p18 de las Quinasas Dependientes de la Ciclina/genética , Variación Genética , Línea Celular Tumoral , Proliferación Celular , Simulación por Computador , Inhibidor p16 de la Quinasa Dependiente de Ciclina , Inhibidor p18 de las Quinasas Dependientes de la Ciclina/química , Bases de Datos Genéticas , Predisposición Genética a la Enfermedad , Humanos , Aprendizaje Automático , Estabilidad Proteica

Prediction of coordination number and relative solvent accessibility in proteins.

Pollastri, Gianluca; Baldi, Pierre; Fariselli, Pietro; Casadio, Rita.

Proteins ; 47(2): 142-53, 2002 May 01.

Artículo en Inglés | MEDLINE | ID: mdl-11933061

RESUMEN

Knowing the coordination number and relative solvent accessibility of all the residues in a protein is crucial for deriving constraints useful in modeling protein folding and protein structure and in scoring remote homology searches. We develop ensembles of bidirectional recurrent neural network architectures to improve the state of the art in both contact and accessibility prediction, leveraging a large corpus of curated data together with evolutionary information. The ensembles are used to discriminate between two different states of residue contacts or relative solvent accessibility, higher or lower than a threshold determined by the average value of the residue distribution or the accessibility cutoff. For coordination numbers, the ensemble achieves performances ranging within 70.6-73.9% depending on the radius adopted to discriminate contacts (6A-12A). These performances represent gains of 16-20% over the baseline statistical predictor, always assigning an amino acid to the largest class, and are 4-7% better than any previous method. A combination of different radius predictors further improves performance. For accessibility thresholds in the relevant 15-30% range, the ensemble consistently achieves a performance above 77%, which is 10-16% above the baseline prediction and better than other existing predictors, by up to several percentage points. For both problems, we quantify the improvement due to evolutionary information in the form of PSI-BLAST-generated profiles over BLAST profiles. The prediction programs are implemented in the form of two web servers, CONpro and ACCpro, available at http://promoter.ics.uci.edu/BRNN-PRED/.

Asunto(s)

Redes Neurales de la Computación , Proteínas/química , Aminoácidos/química , Animales , Bases de Datos de Proteínas , Evolución Molecular , Predicción , Modelos Estadísticos , Estructura Secundaria de Proteína , Proteínas/genética , Reproducibilidad de los Resultados , Solventes/química

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

ENVIAR RESULTADO:

SELECCIÓN DE REFERENCIAS

DETALLE DE LA BÚSQUEDA