Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 66
Filter
Add more filters

Country/Region as subject
Publication year range
1.
J Med Genet ; 59(4): 385-392, 2022 04.
Article in English | MEDLINE | ID: mdl-33766936

ABSTRACT

BACKGROUND: Improving the clinical interpretation of missense variants can increase the diagnostic yield of genomic testing and lead to personalised management strategies. Currently, due to the imprecision of bioinformatic tools that aim to predict variant pathogenicity, their role in clinical guidelines remains limited. There is a clear need for more accurate prediction algorithms and this study aims to improve performance by harnessing structural biology insights. The focus of this work is missense variants in a subset of genes associated with X linked disorders. METHODS: We have developed a protein-specific variant interpreter (ProSper) that combines genetic and protein structural data. This algorithm predicts missense variant pathogenicity by applying machine learning approaches to the sequence and structural characteristics of variants. RESULTS: ProSper outperformed seven previously described tools, including meta-predictors, in correctly evaluating whether or not variants are pathogenic; this was the case for 11 of the 21 genes associated with X linked disorders that met the inclusion criteria for this study. We also determined gene-specific pathogenicity thresholds that improved the performance of VEST4, REVEL and ClinPred, the three best-performing tools out of the seven that were evaluated; this was the case in 11, 11 and 12 different genes, respectively. CONCLUSION: ProSper can form the basis of a molecule-specific prediction tool that can be implemented into diagnostic strategies. It can allow the accurate prioritisation of missense variants associated with X linked disorders, aiding precise and timely diagnosis. In addition, we demonstrate that gene-specific pathogenicity thresholds for a range of missense prioritisation tools can lead to an increase in prediction accuracy.


Subject(s)
Genes, X-Linked , Mutation, Missense , Algorithms , Computational Biology , Humans , Mutation, Missense/genetics
2.
J Biol Chem ; 294(10): 3794-3805, 2019 03 08.
Article in English | MEDLINE | ID: mdl-30651349

ABSTRACT

Protein sequences of members of the plasminogen activation system are present throughout the entire vertebrate phylum. This important and well-described proteolytic cascade is governed by numerous protease-substrate and protease-inhibitor interactions whose conservation is crucial to maintaining unchanged protein function throughout evolution. The pressure to preserve protein-protein interactions may lead to either co-conservation or covariation of binding interfaces. Here, we combined covariation analysis and structure-based prediction to analyze the binding interfaces of urokinase (uPA):plasminogen activator inhibitor-1 (PAI-1) and uPA:plasminogen complexes. We detected correlated variation between the S3-pocket-lining residues of uPA and the P3 residue of both PAI-1 and plasminogen. These residues are known to form numerous polar interactions in the human uPA:PAI-1 Michaelis complex. To test the effect of mutations that correlate with each other and have occurred during mammalian diversification on protein-protein interactions, we produced uPA, PAI-1, and plasminogen from human and zebrafish to represent mammalian and nonmammalian orthologs. Using single amino acid point substitutions in these proteins, we found that the binding interfaces of uPA:plasminogen and uPA:PAI-1 may have coevolved to maintain tight interactions. Moreover, we conclude that although the interaction areas between protease-substrate and protease-inhibitor are shared, the two interactions are mechanistically different. Compared with a protease cleaving its natural substrate, the interaction between a protease and its inhibitor is more complex and involves a more fine-tuned mechanism. Understanding the effects of evolution on specific protein interactions may help further pharmacological interventions of the plasminogen activation system and other proteolytic systems.


Subject(s)
Evolution, Molecular , Plasminogen Activator Inhibitor 1/metabolism , Plasminogen Activators/metabolism , Amino Acid Sequence , Animals , Humans , Models, Molecular , Plasminogen Activators/antagonists & inhibitors , Plasminogen Activators/chemistry , Protein Binding , Protein Conformation , Urokinase-Type Plasminogen Activator/metabolism
3.
PLoS Genet ; 13(10): e1007068, 2017 Oct.
Article in English | MEDLINE | ID: mdl-29084269

ABSTRACT

The coronary vasculature is an essential vessel network providing the blood supply to the heart. Disruptions in coronary blood flow contribute to cardiac disease, a major cause of premature death worldwide. The generation of treatments for cardiovascular disease will be aided by a deeper understanding of the developmental processes that underpin coronary vessel formation. From an ENU mutagenesis screen, we have isolated a mouse mutant displaying embryonic hydrocephalus and cardiac defects (EHC). Positional cloning and candidate gene analysis revealed that the EHC phenotype results from a point mutation in a splice donor site of the Myh10 gene, which encodes NMHC IIB. Complementation testing confirmed that the Myh10 mutation causes the EHC phenotype. Characterisation of the EHC cardiac defects revealed abnormalities in myocardial development, consistent with observations from previously generated NMHC IIB null mouse lines. Analysis of the EHC mutant hearts also identified defects in the formation of the coronary vasculature. We attribute the coronary vessel abnormalities to defective epicardial cell function, as the EHC epicardium displays an abnormal cell morphology, reduced capacity to undergo epithelial-mesenchymal transition (EMT), and impaired migration of epicardial-derived cells (EPDCs) into the myocardium. Our studies on the EHC mutant demonstrate a requirement for NMHC IIB in epicardial function and coronary vessel formation, highlighting the importance of this protein in cardiac development and ultimately, embryonic survival.


Subject(s)
Coronary Vessels/growth & development , Embryonic Development/genetics , Myosin Heavy Chains/genetics , Nonmuscle Myosin Type IIB/genetics , Pericardium/growth & development , Animals , Cell Differentiation/genetics , Coronary Vessels/metabolism , Embryo, Mammalian , Epithelial-Mesenchymal Transition/genetics , Humans , Hydrocephalus/genetics , Hydrocephalus/metabolism , Hydrocephalus/pathology , Mice , Mice, Knockout , Mutation , Myocardium/metabolism , Pericardium/metabolism
4.
Proc Biol Sci ; 284(1861)2017 Aug 30.
Article in English | MEDLINE | ID: mdl-28835561

ABSTRACT

Duplication of genes or genomes provides the raw material for evolutionary innovation. After duplication a gene may be lost, recombine with another gene, have its function modified or be retained in an unaltered state. The fate of duplication is usually studied by comparing extant genomes and reconstructing the most likely ancestral states. Valuable as this approach is, it may miss the most rapid evolutionary events. Here, we engineered strains of Saccharomyces cerevisiae carrying tandem and non-tandem duplications of the singleton gene IFA38 to monitor (i) the fate of the duplicates in different conditions, including time scale and asymmetry of gene loss, and (ii) the changes in fitness and transcriptome of the strains immediately after duplication and after experimental evolution. We found that the duplication brings widespread transcriptional changes, but a fitness advantage is only present in fermentable media. In respiratory conditions, the yeast strains consistently lose the non-tandem IFA38 gene copy in a surprisingly short time, within only a few generations. This gene loss appears to be asymmetric and dependent on genome location, since the original IFA38 copy and the tandem duplicate are retained. Overall, this work shows for the first time that gene loss can be extremely rapid and context dependent.


Subject(s)
Evolution, Molecular , Gene Duplication , Saccharomyces cerevisiae/genetics , Genetic Fitness , Genome, Fungal , Microorganisms, Genetically-Modified/genetics , Transcriptome
5.
BMC Evol Biol ; 16: 40, 2016 Feb 18.
Article in English | MEDLINE | ID: mdl-26892785

ABSTRACT

BACKGROUND: Physical interactions between proteins are essential for almost all biological functions and systems. To understand the evolution of function it is therefore important to understand the evolution of molecular interactions. Of key importance is the evolution of binding specificity, the set of interactions made by a protein, since change in specificity can lead to "rewiring" of interaction networks. Unfortunately, the interfaces through which proteins interact are complex, typically containing many amino-acid residues that collectively must contribute to binding specificity as well as binding affinity, structural integrity of the interface and solubility in the unbound state. RESULTS: In order to study the relationship between interface composition and binding specificity, we make use of paralogous pairs of yeast proteins. Immediately after duplication these paralogues will have identical sequences and protein products that make an identical set of interactions. As the sequences diverge, we can correlate amino-acid change in the interface with any change in the specificity of binding. We show that change in interface regions correlates only weakly with change in specificity, and many variants in interfaces are functionally equivalent. We show that many of the residue replacements within interfaces are silent with respect to their contribution to binding specificity. CONCLUSIONS: We conclude that such functionally-equivalent change has the potential to contribute to evolutionary plasticity in interfaces by creating cryptic variation, which in turn may provide the raw material for functional innovation and coevolution.


Subject(s)
Evolution, Molecular , Saccharomyces cerevisiae Proteins/chemistry , Amino Acids/genetics , Binding Sites , Biological Evolution , Gene Duplication , Genome, Fungal , Protein Interaction Domains and Motifs , Protein Interaction Mapping , Saccharomyces cerevisiae Proteins/genetics
6.
Proteins ; 84(4): 411-26, 2016 Apr.
Article in English | MEDLINE | ID: mdl-26799916

ABSTRACT

Energy functions, fragment libraries, and search methods constitute three key components of fragment-assembly methods for protein structure prediction, which are all crucial for their ability to generate high-accuracy predictions. All of these components are tightly coupled; efficient searching becomes more important as the quality of fragment libraries decreases. Given these relationships, there is currently a poor understanding of the strengths and weaknesses of the sampling approaches currently used in fragment-assembly techniques. Here, we determine how the performance of search techniques can be assessed in a meaningful manner, given the above problems. We describe a set of techniques that aim to reduce the impact of the energy function, and assess exploration in view of the search space defined by a given fragment library. We illustrate our approach using Rosetta and EdaFold, and show how certain features of these methods encourage or limit conformational exploration. We demonstrate that individual trajectories of Rosetta are susceptible to local minima in the energy landscape, and that this can be linked to non-uniform sampling across the protein chain. We show that EdaFold's novel approach can help balance broad exploration with locating good low-energy conformations. This occurs through two mechanisms which cannot be readily differentiated using standard performance measures: exclusion of false minima, followed by an increasingly focused search in low-energy regions of conformational space. Measures such as ours can be helpful in characterizing new fragment-based methods in terms of the quality of conformational exploration realized.


Subject(s)
Algorithms , Gene Library , Peptide Fragments/chemistry , Computer Simulation , Models, Molecular , Peptide Fragments/genetics , Protein Conformation , Protein Folding , Thermodynamics
7.
Mol Biol Evol ; 32(9): 2456-68, 2015 Sep.
Article in English | MEDLINE | ID: mdl-25944916

ABSTRACT

Recent developments in the analysis of amino acid covariation are leading to breakthroughs in protein structure prediction, protein design, and prediction of the interactome. It is assumed that observed patterns of covariation are caused by molecular coevolution, where substitutions at one site affect the evolutionary forces acting at neighboring sites. Our theoretical and empirical results cast doubt on this assumption. We demonstrate that the strongest coevolutionary signal is a decrease in evolutionary rate and that unfeasibly long times are required to produce coordinated substitutions. We find that covarying substitutions are mostly found on different branches of the phylogenetic tree, indicating that they are independent events that may or may not be attributable to coevolution. These observations undermine the hypothesis that molecular coevolution is the primary cause of the covariation signal. In contrast, we find that the pairs of residues with the strongest covariation signal tend to have low evolutionary rates, and that it is this low rate that gives rise to the covariation signal. Slowly evolving residue pairs are disproportionately located in the protein's core, which explains covariation methods' ability to detect pairs of residues that are close in three dimensions. These observations lead us to propose the "coevolution paradox": The strength of coevolution required to cause coordinated changes means the evolutionary rate is so low that such changes are highly unlikely to occur. As modern covariation methods may lead to breakthroughs in structural genomics, it is critical to recognize their biases and limitations.


Subject(s)
Evolution, Molecular , Markov Chains , Models, Genetic , Mutation Rate , Phylogeny , Protein Folding , Proteins/genetics
8.
Bioinformatics ; 31(3): 416-7, 2015 Feb 01.
Article in English | MEDLINE | ID: mdl-25294920

ABSTRACT

SUMMARY: Gene duplication and loss are important processes in the evolution of gene families. Moreover, growth of families by duplication and retention is an important mechanism by which organisms gain new functions. Therefore the ability to infer the evolutionary histories of families is an important step in understanding the evolution of function. We have recently developed DupliPHY, a software tool to infer gene family histories using parsimony and maximum likelihood. Here, we present DupliPHY-Web a web server for DupliPHY that implements additional maximum likelihood functionality and provides users an intuitive interface to run DupliPHY. AVAILABILITY AND IMPLEMENTATION: DupliPHY-Web is available at www.bioinf.manchester.ac.uk/dupliphy/ CONTACT: : ryan.ames@manchester.ac.uk SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
Computational Biology/methods , Evolution, Molecular , Gene Duplication , Software , Humans , Internet , Likelihood Functions , Phylogeny
9.
Evol Comput ; 24(4): 577-607, 2016.
Article in English | MEDLINE | ID: mdl-26908350

ABSTRACT

Computational approaches to de novo protein tertiary structure prediction, including those based on the preeminent "fragment-assembly" technique, have failed to scale up fully to larger proteins (on the order of 100 residues and above). A number of limiting factors are thought to contribute to the scaling problem over and above the simple combinatorial explosion, but the key ones relate to the lack of exploration of properly diverse protein folds, and to an acute form of "deception" in the energy function, whereby low-energy conformations do not reliably equate with native structures. In this article, solutions to both of these problems are investigated through a multistage memetic algorithm incorporating the successful Rosetta method as a local search routine. We found that specialised genetic operators significantly add to structural diversity and that this translates well to reaching low energies. The use of a generalised stochastic ranking procedure for selection enables the memetic algorithm to handle and traverse deep energy wells that can be considered deceptive, which further adds to the ability of the algorithm to obtain a much-improved diversity of folds. The results should translate to a tangible improvement in the performance of protein structure prediction algorithms in blind experiments such as CASP, and potentially to a further step towards the more challenging problem of predicting the three-dimensional shape of large proteins.


Subject(s)
Algorithms , Proteins/chemistry , Computational Biology , Evolution, Molecular , Molecular Dynamics Simulation , Peptide Fragments/chemistry , Peptide Fragments/genetics , Protein Conformation , Protein Structure, Secondary , Protein Structure, Tertiary , Proteins/genetics , Stochastic Processes
10.
J Hum Genet ; 60(4): 199-202, 2015 Apr.
Article in English | MEDLINE | ID: mdl-25589041

ABSTRACT

Agnathia-otocephaly complex is a malformation characterized by absent/hypoplastic mandible and abnormally positioned ears. Mutations in two genes, PRRX1 and OTX2, have been described in a small number of families with this disorder. We performed clinical and genetic testing in an additional family. The proband is a healthy female with a complicated pregnancy history that includes two offspring diagnosed with agnathia-otocephaly during prenatal ultrasound scans. Exome sequencing was performed in fetal DNA from one of these two offspring revealing a heterozygous duplication in OTX2: c.271_273dupCAG, p.(Gln91dup). This change leads to the insertion of a glutamine within the OTX2 homeodomain region, and is predicted to alter this signaling molecule's ability to interact with DNA. The same variant was also identified in the proband's clinically unaffected 38-year-old husband and their 9-year-old daughter, who presented with a small mandible, normal ears and velopharyngeal insufficiency due to a short hemi-palate. This unusual presentation of OTX2-related disease suggests that OTX2 might have a role in palatal hypoplasia cases. A previously unreported OTX2 variant associated with extreme intrafamilial variability is described and the utility of exome sequencing as a tool to confirm the diagnosis of agnathia-otocephaly and to inform the reproductive decisions of affected families is highlighted.


Subject(s)
Abnormalities, Multiple/genetics , Gene Duplication , Otx Transcription Factors/genetics , Reading Frames , Velopharyngeal Insufficiency/genetics , Abnormalities, Multiple/diagnosis , Adult , Child , Female , Genetic Association Studies , Heterozygote , Humans , Male , Models, Molecular , Mutation , Otx Transcription Factors/chemistry , Pedigree , Phenotype , Protein Conformation , Velopharyngeal Insufficiency/diagnosis
11.
Sci Rep ; 14(1): 9199, 2024 04 22.
Article in English | MEDLINE | ID: mdl-38649399

ABSTRACT

The distinctive nature of cancer as a disease prompts an exploration of the special characteristics the genes implicated in cancer exhibit. The identification of cancer-associated genes and their characteristics is crucial to further our understanding of this disease and enhanced likelihood of therapeutic drug targets success. However, the rate at which cancer genes are being identified experimentally is slow. Applying predictive analysis techniques, through the building of accurate machine learning models, is potentially a useful approach in enhancing the identification rate of these genes and their characteristics. Here, we investigated gene essentiality scores and found that they tend to be higher for cancer-associated genes compared to other protein-coding human genes. We built a dataset of extended gene properties linked to essentiality and used it to train a machine-learning model; this model reached 89% accuracy and > 0.85 for the Area Under Curve (AUC). The model showed that essentiality, evolutionary-related properties, and properties arising from protein-protein interaction networks are particularly effective in predicting cancer-associated genes. We were able to use the model to identify potential candidate genes that have not been previously linked to cancer. Prioritising genes that score highly by our methods could aid scientists in their cancer genes research.


Subject(s)
Genes, Essential , Machine Learning , Neoplasms , Humans , Neoplasms/genetics , Protein Interaction Maps/genetics , Evolution, Molecular , Computational Biology/methods
12.
Bioinformatics ; 28(1): 48-55, 2012 Jan 01.
Article in English | MEDLINE | ID: mdl-22039210

ABSTRACT

MOTIVATION: Recent large-scale studies of individuals within a population have demonstrated that there is widespread variation in copy number in many gene families. In addition, there is increasing evidence that the variation in gene copy number can give rise to substantial phenotypic effects. In some cases, these variations have been shown to be adaptive. These observations show that a full understanding of the evolution of biological function requires an understanding of gene gain and gene loss. Accurate, robust evolutionary models of gain and loss events are, therefore, required. RESULTS: We have developed weighted parsimony and maximum likelihood methods for inferring gain and loss events. To test these methods, we have used Markov models of gain and loss to simulate data with known properties. We examine three models: a simple birth-death model, a single rate model and a birth-death innovation model with parameters estimated from Drosophila genome data. We find that for all simulations maximum likelihood-based methods are very accurate for reconstructing the number of duplication events on the phylogenetic tree, and that maximum likelihood and weighted parsimony have similar accuracy for reconstructing the ancestral state. Our implementations are robust to different model parameters and provide accurate inferences of ancestral states and the number of gain and loss events. For ancestral reconstruction, we recommend weighted parsimony because it has similar accuracy to maximum likelihood, but is much faster. For inferring the number of individual gene loss or gain events, maximum likelihood is noticeably more accurate, albeit at greater computational cost. AVAILABILITY: www.bioinf.manchester.ac.uk/dupliphy CONTACT: simon.lovell@manchester.ac.uk; simon.whelan@manchester.ac.uk SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Subject(s)
Drosophila/genetics , Evolution, Molecular , Models, Genetic , Animals , Computer Simulation , Drosophila/classification , Genome, Insect , Likelihood Functions , Markov Chains
13.
BMC Evol Biol ; 12: 238, 2012 Dec 06.
Article in English | MEDLINE | ID: mdl-23217198

ABSTRACT

BACKGROUND: The correlation of genetic distances between pairs of protein sequence alignments has been used to infer protein-protein interactions. It has been suggested that these correlations are based on the signal of co-evolution between interacting proteins. However, although mutations in different proteins associated with maintaining an interaction clearly occur (particularly in binding interfaces and neighbourhoods), many other factors contribute to correlated rates of sequence evolution. Proteins in the same genome are usually linked by shared evolutionary history and so it would be expected that there would be topological similarities in their phylogenetic trees, whether they are interacting or not. For this reason the underlying species tree is often corrected for. Moreover processes such as expression level, are known to effect evolutionary rates. However, it has been argued that the correlated rates of evolution used to predict protein interaction explicitly includes shared evolutionary history; here we test this hypothesis. RESULTS: In order to identify the evolutionary mechanisms giving rise to the correlations between interaction proteins, we use phylogenetic methods to distinguish similarities in tree topologies from similarities in genetic distances. We use a range of datasets of interacting and non-interacting proteins from Saccharomyces cerevisiae. We find that the signal of correlated evolution between interacting proteins is predominantly a result of shared evolutionary rates, rather than similarities in tree topology, independent of evolutionary divergence. CONCLUSIONS: Since interacting proteins do not have tree topologies that are more similar than the control group of non-interacting proteins, it is likely that coevolution does not contribute much to, if any, of the observed correlations.


Subject(s)
Evolution, Molecular , Phylogeny , Saccharomyces cerevisiae Proteins/metabolism , Saccharomyces cerevisiae/metabolism , Chi-Square Distribution , Databases, Protein , Genetic Variation , Genome, Fungal/genetics , Protein Binding , Saccharomyces cerevisiae/genetics , Saccharomyces cerevisiae Proteins/classification , Saccharomyces cerevisiae Proteins/genetics
14.
Proteins ; 80(2): 490-504, 2012 Feb.
Article in English | MEDLINE | ID: mdl-22095594

ABSTRACT

In fragment-assembly techniques for protein structure prediction, models of protein structure are assembled from fragments of known protein structures. This process is typically guided by a knowledge-based energy function and uses a heuristic optimization method. The fragments play two important roles in this process: they define the set of structural parameters available, and they also assume the role of the main variation operators that are used by the optimiser. Previous analysis has typically focused on the first of these roles. In particular, the relationship between local amino acid sequence and local protein structure has been studied by a range of authors. The correlation between the two has been shown to vary with the window length considered, and the results of these analyses have informed directly the choice of fragment length in state-of-the-art prediction techniques. Here, we focus on the second role of fragments and aim to determine the effect of fragment length from an optimization perspective. We use theoretical analyses to reveal how the size and structure of the search space changes as a function of insertion length. Furthermore, empirical analyses are used to explore additional ways in which the size of the fragment insertion influences the search both in a simulation model and for the fragment-assembly technique, Rosetta.


Subject(s)
Models, Molecular , Peptide Fragments/chemistry , Proteins/chemistry , Algorithms , Markov Chains , Protein Conformation
15.
Mol Biol Evol ; 28(12): 3331-44, 2011 Dec.
Article in English | MEDLINE | ID: mdl-21693437

ABSTRACT

Evolution of new cellular functions can be achieved both by changes in protein coding sequences and by alteration of expression patterns. Variation of expression may lead to changes in cellular function with relatively little change in genomic sequence. We therefore hypothesize that one of the first signals of functional divergence should be evolution of transcription factor-binding sites (TFBSs). This adaptation should be detectable as substantial variation in the TFBSs of alleles. New data sets allow the first analyses of intraspecies variation from large number of whole-genome sequences. Using data from the Saccharomyces Genome Resequencing Project, we have analyzed variation in TFBSs. We find a large degree of variation both between these closely related strains and between pairs of duplicated genes. There is a correlation between changes in promoter regions and changes in coding sequences, indicating a coupling of changes in expression and function. We show that 1) the types genes with diverged promoters vary between strains from different environments and 2) that patterns of divergence in promoters consistent with positive selection are detectable in alleles between strains and on duplicate promoters. This variation is likely to reflect adaptation to each strain's natural environment. We conclude that, even within a species, we detect signs of selection acting on promoter regions that may act to alter expression patterns. These changes may indicate functional innovation in multiple genes and across the whole genome. Change in function could represent adaptation to the environment and be a precursor to speciation.


Subject(s)
Adaptation, Biological/genetics , Saccharomyces cerevisiae/genetics , Transcription Factors/chemistry , Transcription Factors/metabolism , Base Sequence , Binding Sites/genetics , Biological Evolution , Evolution, Molecular , Genes, Fungal , Genetic Variation , Genome, Fungal , Open Reading Frames/genetics , Promoter Regions, Genetic , Protein Binding/genetics , Regulatory Sequences, Nucleic Acid/genetics , Saccharomyces cerevisiae/classification , Saccharomyces cerevisiae/metabolism , Transcription Factors/genetics , Transcription, Genetic
16.
J Mol Diagn ; 24(12): 1232-1239, 2022 12.
Article in English | MEDLINE | ID: mdl-36191840

ABSTRACT

Small in-frame insertion-deletion (indel) variants are a common form of genomic variation whose impact on rare disease phenotypes has been understudied. The prediction of the pathogenicity of such variants remains challenging. X-linked incomplete congenital stationary night blindness type 2 (CSNB2) is a nonprogressive, inherited retinal disorder caused by variants in CACNA1F, encoding the Cav1.4α1 channel protein. Here, structural analysis was used through homology modeling to interpret 10 disease-correlated and 10 putatively benign CACNA1F in-frame indel variants. CSNB2-correlated changes were found to be more highly conserved compared with putative benign variants. Notably, all 10 disease-correlated variants but none of the benign changes were within modeled regions of the protein. Structural analysis revealed that disease-correlated variants are predicted to destabilize the structure and function of the Cav1.4α1 channel protein. Overall, the use of structural information to interpret the consequences of in-frame indel variants provides an important adjunct that can improve the diagnosis for individuals with CSNB2.


Subject(s)
Eye Diseases, Hereditary , Night Blindness , Humans , Virulence , Calcium Channels, L-Type/genetics , Night Blindness/genetics , Night Blindness/metabolism , Eye Diseases, Hereditary/genetics , Eye Diseases, Hereditary/metabolism , Mutation
17.
J Biol Chem ; 285(28): 21600-6, 2010 Jul 09.
Article in English | MEDLINE | ID: mdl-20430899

ABSTRACT

Kar2p, an essential Hsp70 chaperone in the endoplasmic reticulum of Saccharomyces cerevisiae, facilitates the transport and folding of nascent polypeptides within the endoplasmic reticulum lumen. The chaperone activity of Kar2p is regulated by its intrinsic ATPase activity that can be stimulated by two different nucleotide exchange factors, namely Sil1p and Lhs1p. Here, we demonstrate that the binding requirements for Lhs1p are complex, requiring both the nucleotide binding domain plus the linker domain of Kar2p. In contrast, the IIB domain of Kar2p is sufficient for binding of Sil1p, and point mutations within IIB specifically blocked Sil1p-dependent activation while remaining competent for activation by Lhs1p. Taken together, these results demonstrate that the interactions between Kar2p and its two nucleotide exchange factors can be functionally resolved and are thus mechanistically distinct.


Subject(s)
Fungal Proteins/metabolism , Gene Expression Regulation, Fungal , HSP70 Heat-Shock Proteins/metabolism , Membrane Transport Proteins/metabolism , Saccharomyces cerevisiae Proteins/metabolism , Saccharomyces cerevisiae/genetics , Adenosine Triphosphatases/metabolism , Amino Acid Sequence , Glutathione Transferase/metabolism , Models, Biological , Molecular Sequence Data , Mutation , Protein Binding , Protein Conformation , Protein Structure, Tertiary , Saccharomyces cerevisiae/metabolism , Sequence Homology, Amino Acid
18.
Mol Biol Evol ; 27(11): 2567-75, 2010 Nov.
Article in English | MEDLINE | ID: mdl-20551042

ABSTRACT

Protein-protein interactions effectively mediate molecular function. They are the result of specific interactions between protein interfaces and are maintained by the action of evolutionary pressure on the regions of the interacting proteins that contribute to binding. For the most part, selection restricts amino acid replacements, accounting for the conservation of binding interfaces. However, in some cases, change in one protein will be mitigated by compensatory change in its binding partner, maintaining function in the face of evolutionary change. There have been several attempts to use correlations in sequence evolution to predict interactions of proteins. Most commonly, these approaches use the entire sequence to identify correlations and so infer probable binding. However, other factors such as shared evolutionary history and similarities in the rates of evolution confound these whole-sequence-based approaches. Here, we discuss recent work on this topic and argue that both site-specific coevolutionary change and whole-sequence evolution contribute to evolutionary signals in sets of interacting proteins. We discuss the relative effects of both types of selection and how they might be identified. This permits an integrated view of protein-protein interactions, their evolution, and coevolution.


Subject(s)
Evolution, Molecular , Proteins/genetics , Proteins/metabolism , Amino Acid Sequence , Animals , Humans , Molecular Sequence Data , Phylogeny , Protein Binding/genetics , Proteins/chemistry
19.
J Virol ; 84(24): 12995-3003, 2010 Dec.
Article in English | MEDLINE | ID: mdl-20881050

ABSTRACT

The high rate of HIV-1 evolution contributes to immune escape, enables the virus to escape drug therapy, and may underlie the difficulty of producing an effective vaccine. Identifying constraints on HIV evolution is therefore of prime importance. To investigate this problem, we examined the relationships between sequence diversity, selection, and protein structure. We found that while there was an increase in sequence diversity over time, this variation had a tendency to be limited to specific structural regions. When individual sites were analyzed, there was, in contrast, substantial and widespread evolutionary constraint over gag and env. This constraint was present even in the highly variable envelope proteins. The evolutionary significance of an individual site is indicated by the change in selection pressure along the time course: increasing entropy indicates that the site is evolving predominantly in a more "clock"-like manner, low entropy values with no increase indicate a high degree of constraint, and high entropy values indicate a lack of constraint. Few sites display high degrees of turnover. Mapping these sites onto the three-dimensional protein structure, we found a significant difference between evolutionary rates for regions buried in the core of the protein and those on the surface. This constraint did not change over the time period analyzed and was not subtype dependent, as similar results were found for subtypes B and C. This link between sequence and structure not only demonstrates the limits of recent HIV-1 evolution but also highlights the origins of evolutionary constraint on viral change.


Subject(s)
Evolution, Molecular , Genetic Variation , HIV Infections/genetics , HIV-1/genetics , env Gene Products, Human Immunodeficiency Virus/chemistry , gag Gene Products, Human Immunodeficiency Virus/chemistry , HIV Infections/virology , HIV-1/chemistry , Humans , Phylogeny , Selection, Genetic , Sequence Analysis, DNA , env Gene Products, Human Immunodeficiency Virus/genetics , gag Gene Products, Human Immunodeficiency Virus/genetics
20.
Mol Biol Evol ; 26(5): 1055-65, 2009 May.
Article in English | MEDLINE | ID: mdl-19193735

ABSTRACT

The complex constraints imposed by protein structure and function result in varied rates of sequence and structural divergence in proteins. Analysis of sequence differences between homologous proteins can advance our understanding of structural divergence and some of the constraints that govern the evolution of these molecules. Here, we assess the relationship between amino acid sequence and structural divergence. Firstly, we demonstrate that the relationship between protein sequence and structural divergence is governed by a variety of evolutionary constraints, including solvent exposure and secondary structure. Secondly, although compensatory substitutions are widespread, we find many radical size-changing mutations that are not compensated by neighboring complementary changes. Instead, these noncompensated substitutions are mitigated by alteration of protein structure. These results suggest a combined mechanism of accommodating substitutions in proteins, involving both coevolution and structural accommodation. Such a mechanism can explain previously observed correlated substitutions of residues that are distant both in sequence and structure, allowing an integrated view of sequence and structural divergence of proteins.


Subject(s)
Evolution, Molecular , Genetic Variation , Proteins/chemistry , Amino Acid Sequence , Amino Acid Substitution , Amino Acids/chemistry , Molecular Sequence Data , Mutation/genetics , Protein Structure, Secondary , Sequence Homology, Amino Acid
SELECTION OF CITATIONS
SEARCH DETAIL