Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 20
Filtrar
1.
Genome Med ; 15(1): 51, 2023 Jul 13.
Artigo em Inglês | MEDLINE | ID: mdl-37443081

RESUMO

BACKGROUND: Curated databases of genetic variants assist clinicians and researchers in interpreting genetic variation. Yet, these databases contain some misclassified variants. It is unclear whether variant misclassification is abating as these databases rapidly grow and implement new guidelines. METHODS: Using archives of ClinVar and HGMD, we investigated how variant misclassification has changed over 6 years, across different ancestry groups. We considered inborn errors of metabolism (IEMs) screened in newborns as a model system because these disorders are often highly penetrant with neonatal phenotypes. We used samples from the 1000 Genomes Project (1KGP) to identify individuals with genotypes that were classified by the databases as pathogenic. Due to the rarity of IEMs, nearly all such classified pathogenic genotypes indicate likely variant misclassification in ClinVar or HGMD. RESULTS: While the false-positive rates of both ClinVar and HGMD have improved over time, HGMD variants currently imply two orders of magnitude more affected individuals in 1KGP than ClinVar variants. We observed that African ancestry individuals have a significantly increased chance of being incorrectly indicated to be affected by a screened IEM when HGMD variants are used. However, this bias affecting genomes of African ancestry was no longer significant once common variants were removed in accordance with recent variant classification guidelines. We discovered that ClinVar variants classified as Pathogenic or Likely Pathogenic are reclassified sixfold more often than DM or DM? variants in HGMD, which has likely resulted in ClinVar's lower false-positive rate. CONCLUSIONS: Considering misclassified variants that have since been reclassified reveals our increasing understanding of rare genetic variation. We found that variant classification guidelines and allele frequency databases comprising genetically diverse samples are important factors in reclassification. We also discovered that ClinVar variants common in European and South Asian individuals were more likely to be reclassified to a lower confidence category, perhaps due to an increased chance of these variants being classified by multiple submitters. We discuss features for variant classification databases that would support their continued improvement.


Assuntos
Bases de Dados Genéticas , Variação Genética , Frequência do Gene , Genótipo , Genômica
2.
Science ; 380(6648): eabn8153, 2023 06 02.
Artigo em Inglês | MEDLINE | ID: mdl-37262156

RESUMO

Personalized genome sequencing has revealed millions of genetic differences between individuals, but our understanding of their clinical relevance remains largely incomplete. To systematically decipher the effects of human genetic variants, we obtained whole-genome sequencing data for 809 individuals from 233 primate species and identified 4.3 million common protein-altering variants with orthologs in humans. We show that these variants can be inferred to have nondeleterious effects in humans based on their presence at high allele frequencies in other primate populations. We use this resource to classify 6% of all possible human protein-altering variants as likely benign and impute the pathogenicity of the remaining 94% of variants with deep learning, achieving state-of-the-art accuracy for diagnosing pathogenic variants in patients with genetic diseases.


Assuntos
Variação Genética , Primatas , Animais , Humanos , Sequência de Bases , Frequência do Gene , Primatas/genética , Sequenciamento Completo do Genoma
3.
bioRxiv ; 2023 May 02.
Artigo em Inglês | MEDLINE | ID: mdl-37205491

RESUMO

Personalized genome sequencing has revealed millions of genetic differences between individuals, but our understanding of their clinical relevance remains largely incomplete. To systematically decipher the effects of human genetic variants, we obtained whole genome sequencing data for 809 individuals from 233 primate species, and identified 4.3 million common protein-altering variants with orthologs in human. We show that these variants can be inferred to have non-deleterious effects in human based on their presence at high allele frequencies in other primate populations. We use this resource to classify 6% of all possible human protein-altering variants as likely benign and impute the pathogenicity of the remaining 94% of variants with deep learning, achieving state-of-the-art accuracy for diagnosing pathogenic variants in patients with genetic diseases. One Sentence Summary: Deep learning classifier trained on 4.3 million common primate missense variants predicts variant pathogenicity in humans.

4.
Front Pediatr ; 9: 663752, 2021.
Artigo em Inglês | MEDLINE | ID: mdl-34350142

RESUMO

Newborn screening (NBS) is a population-based program with a goal of reducing the burden of disease for conditions with significant clinical impact on neonates. Screening tests were originally developed and implemented one at a time, but newer methods have allowed the use of multiplex technologies to expand additions more rapidly to standard panels. Recent improvements in next-generation sequencing are also evolving rapidly from first focusing on individual genes, then panels, and finally all genes as encompassed by whole exome and genome sequencing. The intersection of these two technologies brings the revolutionary possibility of identifying all genetic disorders in newborns, allowing implementation of therapies at the optimum time regardless of symptoms. This article reviews the history of newborn screening and early studies examining the use of whole genome and exome sequencing as a screening tool. Lessons learned from these studies are discussed, along with technical, ethical, and societal challenges to broad implementation.

5.
Am J Hum Genet ; 108(4): 535-548, 2021 04 01.
Artigo em Inglês | MEDLINE | ID: mdl-33798442

RESUMO

Genome sequencing is enabling precision medicine-tailoring treatment to the unique constellation of variants in an individual's genome. The impact of recurrent pathogenic variants is often understood, however there is a long tail of rare genetic variants that are uncharacterized. The problem of uncharacterized rare variation is especially acute when it occurs in genes of known clinical importance with functionally consequential variants and associated mechanisms. Variants of uncertain significance (VUSs) in these genes are discovered at a rate that outpaces current ability to classify them with databases of previous cases, experimental evaluation, and computational predictors. Clinicians are thus left without guidance about the significance of variants that may have actionable consequences. Computational prediction of the impact of rare genetic variation is increasingly becoming an important capability. In this paper, we review the technical and ethical challenges of interpreting the function of rare variants in two settings: inborn errors of metabolism in newborns and pharmacogenomics. We propose a framework for a genomic learning healthcare system with an initial focus on early-onset treatable disease in newborns and actionable pharmacogenomics. We argue that (1) a genomic learning healthcare system must allow for continuous collection and assessment of rare variants, (2) emerging machine learning methods will enable algorithms to predict the clinical impact of rare variants on protein function, and (3) ethical considerations must inform the construction and deployment of all rare-variation triage strategies, particularly with respect to health disparities arising from unbalanced ancestry representation.


Assuntos
Variação Genética/genética , Genética Médica , Genômica , Aprendizado de Máquina , Erros Inatos do Metabolismo/genética , Farmacogenética , Medicina de Precisão , Genoma Humano/genética , Humanos , Recém-Nascido
6.
Int J Neonatal Screen ; 6(2)2020 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-32802992

RESUMO

Short-chain acyl-CoA dehydrogenase deficiency (SCADD) is a rare autosomal recessive disorder of ß-oxidation caused by pathogenic variants in the ACADS gene. Analyte testing for SCADD in blood and urine, including newborn screening (NBS) using tandem mass spectrometry (MS/MS) on dried blood spots (DBSs), is complicated by the presence of two relatively common ACADS variants (c.625G>A and c.511C>T). Individuals homozygous for these variants or compound heterozygous do not have clinical disease but do have reduced short-chain acyl-CoA dehydrogenase (SCAD) activity, resulting in elevated blood and urine metabolites. As part of a larger study of the potential role of exome sequencing in NBS in California, we reviewed ACADS sequence and MS/MS data from DBSs from a cohort of 74 patients identified to have SCADD. Of this cohort, approximately 60% had one or more of the common variants and did not have the two rare variants, and thus would need no further testing. Retrospective analysis of ethylmalonic acid, glutaric acid, 2-hydroxyglutaric acid, 3-hydroxyglutaric acid, and methylsuccinic acid demonstrated that second-tier testing applied before the release of the newborn screening result could reduce referrals by over 50% and improve the positive predictive value for SCADD to above 75%.

7.
Nat Med ; 26(9): 1392-1397, 2020 09.
Artigo em Inglês | MEDLINE | ID: mdl-32778825

RESUMO

Public health newborn screening (NBS) programs provide population-scale ascertainment of rare, treatable conditions that require urgent intervention. Tandem mass spectrometry (MS/MS) is currently used to screen newborns for a panel of rare inborn errors of metabolism (IEMs)1-4. The NBSeq project evaluated whole-exome sequencing (WES) as an innovative methodology for NBS. We obtained archived residual dried blood spots and data for nearly all IEM cases from the 4.5 million infants born in California between mid-2005 and 2013 and from some infants who screened positive by MS/MS, but were unaffected upon follow-up testing. WES had an overall sensitivity of 88% and specificity of 98.4%, compared to 99.0% and 99.8%, respectively for MS/MS, although effectiveness varied among individual IEMs. Thus, WES alone was insufficiently sensitive or specific to be a primary screen for most NBS IEMs. However, as a secondary test for infants with abnormal MS/MS screens, WES could reduce false-positive results, facilitate timely case resolution and in some instances even suggest more appropriate or specific diagnosis than that initially obtained. This study represents the largest, to date, sequencing effort of an entire population of IEM-affected cases, allowing unbiased assessment of current capabilities of WES as a tool for population screening.


Assuntos
Sequenciamento do Exoma/métodos , Exoma/genética , Erros Inatos do Metabolismo/diagnóstico , Erros Inatos do Metabolismo/genética , Triagem Neonatal/métodos , Testes Genéticos , Humanos , Recém-Nascido , Erros Inatos do Metabolismo/epidemiologia , Espectrometria de Massas em Tandem
8.
Hum Mutat ; 40(9): 1373-1391, 2019 09.
Artigo em Inglês | MEDLINE | ID: mdl-31322791

RESUMO

Whole-genome sequencing (WGS) holds great potential as a diagnostic test. However, the majority of patients currently undergoing WGS lack a molecular diagnosis, largely due to the vast number of undiscovered disease genes and our inability to assess the pathogenicity of most genomic variants. The CAGI SickKids challenges attempted to address this knowledge gap by assessing state-of-the-art methods for clinical phenotype prediction from genomes. CAGI4 and CAGI5 participants were provided with WGS data and clinical descriptions of 25 and 24 undiagnosed patients from the SickKids Genome Clinic Project, respectively. Predictors were asked to identify primary and secondary causal variants. In addition, for CAGI5, groups had to match each genome to one of three disorder categories (neurologic, ophthalmologic, and connective), and separately to each patient. The performance of matching genomes to categories was no better than random but two groups performed significantly better than chance in matching genomes to patients. Two of the ten variants proposed by two groups in CAGI4 were deemed to be diagnostic, and several proposed pathogenic variants in CAGI5 are good candidates for phenotype expansion. We discuss implications for improving in silico assessment of genomic variants and identifying new disease genes.


Assuntos
Biologia Computacional/métodos , Variação Genética , Doenças não Diagnosticadas/diagnóstico , Adolescente , Criança , Pré-Escolar , Simulação por Computador , Bases de Dados Genéticas , Feminino , Predisposição Genética para Doença , Humanos , Masculino , Fenótipo , Doenças não Diagnosticadas/genética , Sequenciamento Completo do Genoma
9.
Hum Mutat ; 40(9): 1202-1214, 2019 09.
Artigo em Inglês | MEDLINE | ID: mdl-31283070

RESUMO

Genome sequencing identifies vast number of genetic variants. Predicting these variants' molecular and clinical effects is one of the preeminent challenges in human genetics. Accurate prediction of the impact of genetic variants improves our understanding of how genetic information is conveyed to molecular and cellular functions, and is an essential step towards precision medicine. Over one hundred tools/resources have been developed specifically for this purpose. We summarize these tools as well as their characteristics, in the genetic Variant Impact Predictor Database (VIPdb). This database will help researchers and clinicians explore appropriate tools, and inform the development of improved methods. VIPdb can be browsed and downloaded at https://genomeinterpretation.org/vipdb.


Assuntos
Bases de Dados Genéticas , Variação Genética , Proteínas/química , Proteínas/genética , Biologia Computacional , Predisposição Genética para Doença , Genoma Humano , Humanos , Fenótipo , Medicina de Precisão , Estrutura Secundária de Proteína , Interface Usuário-Computador
10.
Hum Mutat ; 40(9): 1507-1518, 2019 09.
Artigo em Inglês | MEDLINE | ID: mdl-31228295

RESUMO

We present a computational model for predicting mutational impact on enzymatic activity of human acid α-glucosidase (GAA), an enzyme associated with Pompe disease. Using a model that combines features specific to GAA with other general evolutionary and physiochemical features, we made blind predictions of enzymatic activity relative to wildtype human GAA for >300 GAA mutants, as part of the Critical Assessment of Genome Interpretation 5 GAA challenge. We found that gene-specific features can improve the performance of existing impact prediction tools that mostly rely on general features for pathogenicity prediction. Majority of the poorly predicted mutants that lower wildtype GAA enzyme activity occurred on the surface of the GAA protein. We also found that gene-specific features were uncorrelated with existing methods and provided orthogonal information for interpreting the origin of pathogenicity, particular in variants that are poorly predicted by existing general methods. Specific variants in GAA, when investigated in the context of its protein structure, suggested gene-specific information like the disruption of local backbone torsional geometry and disruption of particular sidechain-sidechain hydrogen bonds as some potential sources for pathogenicity.


Assuntos
Biologia Computacional/métodos , Doença de Depósito de Glicogênio Tipo II/genética , Mutação , alfa-Glucosidases/química , Algoritmos , Predisposição Genética para Doença , Humanos , Ligação de Hidrogênio , Modelos Moleculares , Conformação Proteica , alfa-Glucosidases/genética
11.
Hum Mutat ; 40(9): 1280-1291, 2019 09.
Artigo em Inglês | MEDLINE | ID: mdl-31106481

RESUMO

The integrative analysis of high-throughput reporter assays, machine learning, and profiles of epigenomic chromatin state in a broad array of cells and tissues has the potential to significantly improve our understanding of noncoding regulatory element function and its contribution to human disease. Here, we report results from the CAGI 5 regulation saturation challenge where participants were asked to predict the impact of nucleotide substitution at every base pair within five disease-associated human enhancers and nine disease-associated promoters. A library of mutations covering all bases was generated by saturation mutagenesis and altered activity was assessed in a massively parallel reporter assay (MPRA) in relevant cell lines. Reporter expression was measured relative to plasmid DNA to determine the impact of variants. The challenge was to predict the functional effects of variants on reporter expression. Comparative analysis of the full range of submitted prediction results identifies the most successful models of transcription factor binding sites, machine learning algorithms, and ways to choose among or incorporate diverse datatypes and cell-types for training computational models. These results have the potential to improve the design of future studies on more diverse sets of regulatory elements and aid the interpretation of disease-associated genetic variation.


Assuntos
DNA/química , Epigenômica/métodos , Mutação Puntual , Sítios de Ligação , Linhagem Celular , Cromatina/genética , DNA/metabolismo , Elementos Facilitadores Genéticos , Predisposição Genética para Doença , Humanos , Aprendizado de Máquina , Regiões Promotoras Genéticas , Fatores de Transcrição/metabolismo
12.
Hum Mutat ; 38(9): 1155-1168, 2017 09.
Artigo em Inglês | MEDLINE | ID: mdl-28397312

RESUMO

The CAGI-4 Hopkins clinical panel challenge was an attempt to assess state-of-the-art methods for clinical phenotype prediction from DNA sequence. Participants were provided with exonic sequences of 83 genes for 106 patients from the Johns Hopkins DNA Diagnostic Laboratory. Five groups participated in the challenge, predicting both the probability that each patient had each of the 14 possible classes of disease, as well as one or more causal variants. In cases where the Hopkins laboratory reported a variant, at least one predictor correctly identified the disease class in 36 of the 43 patients (84%). Even in cases where the Hopkins laboratory did not find a variant, at least one predictor correctly identified the class in 39 of the 63 patients (62%). Each prediction group correctly diagnosed at least one patient that was not successfully diagnosed by any other group. We discuss the causal variant predictions by different groups and their implications for further development of methods to assess variants of unknown significance. Our results suggest that clinically relevant variants may be missed when physicians order small panels targeted on a specific phenotype. We also quantify the false-positive rate of DNA-guided analysis in the absence of prior phenotypic indication.


Assuntos
Biologia Computacional/métodos , Análise de Sequência de DNA/métodos , Bases de Dados Genéticas , Predisposição Genética para Doença , Testes Genéticos , Humanos , Fenótipo
13.
Protein Sci ; 26(2): 208-217, 2017 02.
Artigo em Inglês | MEDLINE | ID: mdl-27750371

RESUMO

We determined the NMR structure of a highly aromatic (13%) protein of unknown function, Aq1974 from Aquifex aeolicus (PDB ID: 5SYQ). The unusual sequence of this protein has a tryptophan content five times the normal (six tryptophan residues of 114 or 5.2% while the average tryptophan content is 1.0%) with the tryptophans occurring in a WXW motif. It has no detectable sequence homology with known protein structures. Although its NMR spectrum suggested that the protein was rich in ß-sheet, upon resonance assignment and solution structure determination, the protein was found to be primarily α-helical with a small two-stranded ß-sheet with a novel fold that we have termed an Aromatic Claw. As this fold was previously unknown and the sequence unique, we submitted the sequence to CASP10 as a target for blind structural prediction. At the end of the competition, the sequence was classified a hard template based model; the structural relationship between the template and the experimental structure was small and the predictions all failed to predict the structure. CSRosetta was found to predict the secondary structure and its packing; however, it was found that there was little correlation between CSRosetta score and the RMSD between the CSRosetta structure and the NMR determined one. This work demonstrates that even in relatively small proteins, we do not yet have the capacity to accurately predict the fold for all primary sequences. The experimental discovery of new folds helps guide the improvement of structural prediction methods.


Assuntos
Bactérias/química , Proteínas de Bactérias/química , Dobramento de Proteína , Triptofano/química , Ressonância Magnética Nuclear Biomolecular , Estrutura Secundária de Proteína
14.
N Engl J Med ; 375(22): 2165-2176, 2016 12 01.
Artigo em Inglês | MEDLINE | ID: mdl-27959755

RESUMO

BACKGROUND: Severe combined immunodeficiency (SCID) is characterized by arrested T-lymphocyte production and by B-lymphocyte dysfunction, which result in life-threatening infections. Early diagnosis of SCID through population-based screening of newborns can aid clinical management and help improve outcomes; it also permits the identification of previously unknown factors that are essential for lymphocyte development in humans. METHODS: SCID was detected in a newborn before the onset of infections by means of screening of T-cell-receptor excision circles, a biomarker for thymic output. On confirmation of the condition, the affected infant was treated with allogeneic hematopoietic stem-cell transplantation. Exome sequencing in the patient and parents was followed by functional analysis of a prioritized candidate gene with the use of human hematopoietic stem cells and zebrafish embryos. RESULTS: The infant had "leaky" SCID (i.e., a form of SCID in which a minimal degree of immune function is preserved), as well as craniofacial and dermal abnormalities and the absence of a corpus callosum; his immune deficit was fully corrected by hematopoietic stem-cell transplantation. Exome sequencing revealed a heterozygous de novo missense mutation, p.N441K, in BCL11B. The resulting BCL11B protein had dominant negative activity, which abrogated the ability of wild-type BCL11B to bind DNA, thereby arresting development of the T-cell lineage and disrupting hematopoietic stem-cell migration; this revealed a previously unknown function of BCL11B. The patient's abnormalities, when recapitulated in bcl11ba-deficient zebrafish, were reversed by ectopic expression of functionally intact human BCL11B but not mutant human BCL11B. CONCLUSIONS: Newborn screening facilitated the identification and treatment of a previously unknown cause of human SCID. Coupling exome sequencing with an evaluation of candidate genes in human hematopoietic stem cells and in zebrafish revealed that a constitutional BCL11B mutation caused human multisystem anomalies with SCID and also revealed a prethymic role for BCL11B in hematopoietic progenitors. (Funded by the National Institutes of Health and others.).


Assuntos
Anormalidades Múltiplas/genética , Células-Tronco Hematopoéticas/fisiologia , Mutação de Sentido Incorreto , Proteínas Repressoras/genética , Imunodeficiência Combinada Severa/genética , Proteínas Supressoras de Tumor/genética , Animais , Encéfalo/diagnóstico por imagem , Movimento Celular , Modelos Animais de Doenças , Regulação da Expressão Gênica , Transplante de Células-Tronco Hematopoéticas , Células-Tronco Hematopoéticas/metabolismo , Humanos , Técnicas In Vitro , Recém-Nascido , Imageamento por Ressonância Magnética , Masculino , Triagem Neonatal/métodos , Receptores de Antígenos de Linfócitos T , Proteínas Repressoras/deficiência , Proteínas Repressoras/metabolismo , Proteínas Supressoras de Tumor/deficiência , Proteínas Supressoras de Tumor/metabolismo , Peixe-Zebra/crescimento & desenvolvimento
15.
Proc Natl Acad Sci U S A ; 112(27): 8302-7, 2015 Jul 07.
Artigo em Inglês | MEDLINE | ID: mdl-26100906

RESUMO

Experimental and computational folding studies of Proteins L & G and NuG2 typically find that sequence differences determine which of the two hairpins is formed in the transition state ensemble (TSE). However, our recent work on Protein L finds that its TSE contains both hairpins, compelling a reassessment of the influence of sequence on the folding behavior of the other two homologs. We characterize the TSEs for Protein G and NuG2b, a triple mutant of NuG2, using ψ analysis, a method for identifying contacts in the TSE. All three homologs are found to share a common and near-native TSE topology with interactions between all four strands. However, the helical content varies in the TSE, being largely absent in Proteins G & L but partially present in NuG2b. The variability likely arises from competing propensities for the formation of nonnative ß turns in the naturally occurring proteins, as observed in our TerItFix folding algorithm. All-atom folding simulations of NuG2b recapitulate the observed TSEs with four strands for 5 of 27 transition paths [Lindorff-Larsen K, Piana S, Dror RO, Shaw DE (2011) Science 334(6055):517-520]. Our data support the view that homologous proteins have similar folding mechanisms, even when nonnative interactions are present in the transition state. These findings emphasize the ongoing challenge of accurately characterizing and predicting TSEs, even for relatively simple proteins.


Assuntos
Simulação de Dinâmica Molecular , Dobramento de Proteína , Estrutura Secundária de Proteína , Proteínas/química , Algoritmos , Sequência de Aminoácidos , Cinética , Dados de Sequência Molecular , Mutação , Proteínas/genética , Termodinâmica
16.
Phys Rev Lett ; 111(2): 028103, 2013 Jul 12.
Artigo em Inglês | MEDLINE | ID: mdl-23889448

RESUMO

We demonstrate the ability of simultaneously determining a protein's folding pathway and structure using a properly formulated model without prior knowledge of the native structure. Our model employs a natural coordinate system for describing proteins and a search strategy inspired by the observation that real proteins fold in a sequential fashion by incrementally stabilizing nativelike substructures or "foldons." Comparable folding pathways and structures are obtained for the twelve proteins recently studied using atomistic molecular dynamics simulations [K. Lindorff-Larsen, S. Piana, R. O. Dror, D. E. Shaw, Science 334, 517 (2011)], with our calculations running several orders of magnitude faster. We find that nativelike propensities in the unfolded state do not necessarily determine the order of structure formation, a departure from a major conclusion of the molecular dynamics study. Instead, our results support a more expansive view wherein intrinsic local structural propensities may be enhanced or overridden in the folding process by environmental context. The success of our search strategy validates it as an expedient mechanism for folding both in silico and in vivo.


Assuntos
Aminoácidos/química , Modelos Químicos , Simulação de Dinâmica Molecular , Proteínas/química , Sequência de Aminoácidos , Dobramento de Proteína , Relação Estrutura-Atividade
17.
Proc Natl Acad Sci U S A ; 109(43): 17442-7, 2012 Oct 23.
Artigo em Inglês | MEDLINE | ID: mdl-23045636

RESUMO

Motivated by the relationship between the folding mechanism and the native structure, we develop a unified approach for predicting folding pathways and tertiary structure using only the primary sequence as input. Simulations begin from a realistic unfolded state devoid of secondary structure and use a chain representation lacking explicit side chains, rendering the simulations many orders of magnitude faster than molecular dynamics simulations. The multiple round nature of the algorithm mimics the authentic folding process and tests the effectiveness of sequential stabilization (SS) as a search strategy wherein 2° structural elements add onto existing structures in a process of progressive learning and stabilization of structure found in prior rounds of folding. Because no a priori knowledge is used, we can identify kinetically significant non-native interactions and intermediates, sometimes generated by only two mutations, while the evolution of contact matrices is often consistent with experiments. Moreover, structure prediction improves substantially by incorporating information from prior rounds. The success of our simple, homology-free approach affirms the validity of our description of the primary determinants of folding pathways and structure, and the effectiveness of SS as a search strategy.


Assuntos
Dobramento de Proteína , Estrutura Secundária de Proteína , Algoritmos , Cinética , Simulação de Dinâmica Molecular , Método de Monte Carlo
18.
J Mol Biol ; 420(3): 220-34, 2012 Jul 13.
Artigo em Inglês | MEDLINE | ID: mdl-22522126

RESUMO

Progress in understanding protein folding relies heavily upon an interplay between experiment and theory. In particular, readily interpretable experimental data that can be meaningfully compared to simulations are required. According to standard mutational ϕ analysis, the transition state for Protein L contains only a single hairpin. However, we demonstrate here using ψ analysis with engineered metal ion binding sites that the transition state is extensive, containing the entire four-stranded ß sheet. Underreporting of the structural content of the transition state by ϕ analysis also occurs for acyl phosphatase [Pandit, A. D., Jha, A., Freed, K. F. & Sosnick, T. R., (2006). Small proteins fold through transition states with native-like topologies. J. Mol. Biol.361, 755-770], ubiquitin [Sosnick, T. R., Dothager, R. S. & Krantz, B. A., (2004). Differences in the folding transition state of ubiquitin indicated by ϕ and ψ analyses. Proc. Natl Acad. Sci. USA 101, 17377-17382] and BdpA [Baxa, M., Freed, K. F. & Sosnick, T. R., (2008). Quantifying the structural requirements of the folding transition state of protein A and other systems. J. Mol. Biol.381, 1362-1381]. The carboxy-terminal hairpin in the transition state of Protein L is found to be nonnative, a significant result that agrees with our Protein Data Bank-based backbone sampling and all-atom simulations. The nonnative character partially explains the failure of accepted experimental and native-centric computational approaches to adequately describe the transition state. Hence, caution is required even when an apparent agreement exists between experiment and theory, thus highlighting the importance of having alternative methods for characterizing transition states.


Assuntos
Proteínas de Bactérias/química , Dobramento de Proteína , Sequência de Aminoácidos , Proteínas de Bactérias/metabolismo , Sítios de Ligação , Cinética , Metais/metabolismo , Modelos Moleculares , Simulação de Dinâmica Molecular , Dados de Sequência Molecular
19.
Protein Sci ; 21(1): 107-21, 2012 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-22095743

RESUMO

Template-based methods for predicting protein structure provide models for a significant portion of the protein but often contain insertions or chain ends (InsEnds) of indeterminate conformation. The local structure prediction "problem" entails modeling the InsEnds onto the rest of the protein. A well-known limit involves predicting loops of ≤12 residues in crystal structures. However, InsEnds may contain as many as ~50 amino acids, and the template-based model of the protein itself may be imperfect. To address these challenges, we present a free modeling method for predicting the local structure of loops and large InsEnds in both crystal structures and template-based models. The approach uses single amino acid torsional angle "pivot" moves of the protein backbone with a C(ß) level representation. Nevertheless, our accuracy for loops is comparable to existing methods. We also apply a more stringent test, the blind structure prediction and refinement categories of the CASP9 tournament, where we improve the quality of several homology based models by modeling InsEnds as long as 45 amino acids, sizes generally inaccessible to existing loop prediction methods. Our approach ranks as one of the best in the CASP9 refinement category that involves improving template-based models so that they can function as molecular replacement models to solve the phase problem for crystallographic structure determination.


Assuntos
Proteínas/química , Algoritmos , Sequência de Aminoácidos , Análise por Conglomerados , Simulação por Computador , Modelos Moleculares , Dados de Sequência Molecular , Conformação Proteica , Dobramento de Proteína , Estrutura Terciária de Proteína , Proteínas/metabolismo , Software
20.
J Chem Phys ; 127(11): 114508, 2007 Sep 21.
Artigo em Inglês | MEDLINE | ID: mdl-17887858

RESUMO

We studied the temperature dependence of the structural relaxation in poly(vinyl acetate) near the glass transition temperature with single molecule spectroscopy from Tg-1 K to Tg+12 K. The temperature dependence of the observed relaxation times matches results from bulk experiments; the observed relaxation times are, however, 80-fold slower than those from bulk experiments at the same temperature. We attribute this factor to the size of the probe molecule. The individual relaxation times of the single molecule environments are distributed normally on a logarithmic time scale, confirming that the dynamics in poly(vinyl acetate) is heterogeneous. The width of the distribution of individual relaxation times is essentially independent of temperature. The observed full width at half maximum (FWHM) on a logarithmic time axis is approximately 0.7, corresponding to a factor of about 5-fold, significantly narrower than the dielectric spectrum of the same material with a FWHM of about 2.0 on a logarithmic time axis, corresponding to a factor of about 100-fold. We explain this narrow width as the effect of temporal averaging of single molecule fluorescence signals over numerous environments due to a limited lifetime of the probed heterogeneities, indicating that heterogeneities are dynamic. We determine a loose upper limit for the ratio of the structural relaxation time to the lifetime of the heterogeneities (the rate memory parameter) of Q<80 for the range of investigated temperatures.

SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA