Robust classification of protein variation using structural modelling and large-scale data integration.

Baugh, Evan H; Simmons-Edler, Riley; Müller, Christian L; Alford, Rebecca F; Volfovsky, Natalia; Lash, Alex E; Bonneau, Richard

Baugh, Evan H; Simmons-Edler, Riley; Müller, Christian L; Alford, Rebecca F; Volfovsky, Natalia; Lash, Alex E; Bonneau, Richard.

Afiliación

Baugh EH; Department of Biology, New York University, New York, NY 10003, USA New York University Center for Genomics and Systems Biology, New York, NY 10003, USA.
Simmons-Edler R; Department of Biology, New York University, New York, NY 10003, USA New York University Center for Genomics and Systems Biology, New York, NY 10003, USA.
Müller CL; New York University Center for Genomics and Systems Biology, New York, NY 10003, USA Computer Science Department, New York University, New York, NY 10003, USA Simons Center for Data Analysis, Simons Foundation, New York, NY 10010, USA.
Alford RF; Carnegie Mellon University Department of Chemistry, 5000 Forbes Ave, Pittsburgh, PA 15289, USA Commack High School, Commack, NY 11725, USA.
Volfovsky N; Simons Foundation, New York, NY 10010, USA.
Lash AE; Simons Foundation, New York, NY 10010, USA.
Bonneau R; Department of Biology, New York University, New York, NY 10003, USA New York University Center for Genomics and Systems Biology, New York, NY 10003, USA Computer Science Department, New York University, New York, NY 10003, USA Simons Center for Data Analysis, Simons Foundation, New York, NY 10010, U

Nucleic Acids Res ; 44(6): 2501-13, 2016 Apr 07.

Article en En | MEDLINE | ID: mdl-26926108

RESUMEN

Existing methods for interpreting protein variation focus on annotating mutation pathogenicity rather than detailed interpretation of variant deleteriousness and frequently use only sequence-based or structure-based information. We present VIPUR, a computational framework that seamlessly integrates sequence analysis and structural modelling (using the Rosetta protein modelling suite) to identify and interpret deleterious protein variants. To train VIPUR, we collected 9477 protein variants with known effects on protein function from multiple organisms and curated structural models for each variant from crystal structures and homology models. VIPUR can be applied to mutations in any organism's proteome with improved generalized accuracy (AUROC .83) and interpretability (AUPR .87) compared to other methods. We demonstrate that VIPUR's predictions of deleteriousness match the biological phenotypes in ClinVar and provide a clear ranking of prediction confidence. We use VIPUR to interpret known mutations associated with inflammation and diabetes, demonstrating the structural diversity of disrupted functional sites and improved interpretation of mutations associated with human diseases. Lastly, we demonstrate VIPUR's ability to highlight candidate variants associated with human diseases by applying VIPUR to de novo variants associated with autism spectrum disorders.

Asunto(s)

Trastorno del Espectro Autista/genética; Enfermedad Celíaca/genética; Enfermedad de Crohn/genética; Diabetes Mellitus/genética; Mutación; Proteínas/genética; Programas Informáticos; Animales; Trastorno del Espectro Autista/metabolismo; Trastorno del Espectro Autista/patología; Benchmarking; Enfermedad Celíaca/metabolismo; Enfermedad Celíaca/patología; Enfermedad de Crohn/metabolismo; Enfermedad de Crohn/patología; Minería de Datos; Bases de Datos de Proteínas; Diabetes Mellitus/metabolismo; Diabetes Mellitus/patología; Humanos; Inflamación; Modelos Moleculares; Anotación de Secuencia Molecular; Proteínas/química; Proteínas/metabolismo

Texto completo

Añadir a Mi BVS

Imprimir

XML

PubMed Links

Buscar en Google

Texto completo: 1 Colección: 01-internacional Base de datos: MEDLINE Asunto principal: Programas Informáticos / Proteínas / Enfermedad de Crohn / Enfermedad Celíaca / Diabetes Mellitus / Trastorno del Espectro Autista / Mutación Tipo de estudio: Prognostic_studies Límite: Animals / Humans Idioma: En Revista: Nucleic Acids Res Año: 2016 Tipo del documento: Article País de afiliación: Estados Unidos

Texto completo

Añadir a Mi BVS

Imprimir

XML

PubMed Links

Buscar en Google