Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 29
Filtrar
1.
Mol Biol Evol ; 38(2): 368-379, 2021 01 23.
Artículo en Inglés | MEDLINE | ID: mdl-32871012

RESUMEN

The distribution of fitness effects (DFEs) of new mutations across different environments quantifies the potential for adaptation in a given environment and its cost in others. So far, results regarding the cost of adaptation across environments have been mixed, and most studies have sampled random mutations across different genes. Here, we quantify systematically how costs of adaptation vary along a large stretch of protein sequence by studying the distribution of fitness effects of the same ≈2,300 amino-acid changing mutations obtained from deep mutational scanning of 119 amino acids in the middle domain of the heat shock protein Hsp90 in five environments. This region is known to be important for client binding, stabilization of the Hsp90 dimer, stabilization of the N-terminal-Middle and Middle-C-terminal interdomains, and regulation of ATPase-chaperone activity. Interestingly, we find that fitness correlates well across diverse stressful environments, with the exception of one environment, diamide. Consistent with this result, we find little cost of adaptation; on average only one in seven beneficial mutations is deleterious in another environment. We identify a hotspot of beneficial mutations in a region of the protein that is located within an allosteric center. The identified protein regions that are enriched in beneficial, deleterious, and costly mutations coincide with residues that are involved in the stabilization of Hsp90 interdomains and stabilization of client-binding interfaces, or residues that are involved in ATPase-chaperone activity of Hsp90. Thus, our study yields information regarding the role and adaptive potential of a protein sequence that complements and extends known structural information.


Asunto(s)
Adaptación Biológica , Interacción Gen-Ambiente , Aptitud Genética , Proteínas HSP90 de Choque Térmico/genética , Mutación , Saccharomyces cerevisiae
2.
Mol Biol Evol ; 36(4): 798-810, 2019 04 01.
Artículo en Inglés | MEDLINE | ID: mdl-30721995

RESUMEN

The evolution of HIV-1 protein sequences should be governed by a combination of factors including nucleotide mutational probabilities, the genetic code, and fitness. The impact of these factors on protein sequence evolution is interdependent, making it challenging to infer the individual contribution of each factor from phylogenetic analyses alone. We investigated the protein sequence evolution of HIV-1 by determining an experimental fitness landscape of all individual amino acid changes in protease. We compared our experimental results to the frequency of protease variants in a publicly available data set of 32,163 sequenced isolates from drug-naïve individuals. The most common amino acids in sequenced isolates supported robust experimental fitness, indicating that the experimental fitness landscape captured key features of selection acting on protease during viral infections of hosts. Amino acid changes requiring multiple mutations from the likely ancestor were slightly less likely to support robust experimental fitness than single mutations, consistent with the genetic code favoring chemically conservative amino acid changes. Amino acids that were common in sequenced isolates were predominantly accessible by single mutations from the likely protease ancestor. Multiple mutations commonly observed in isolates were accessible by mutational walks with highly fit single mutation intermediates. Our results indicate that the prevalence of multiple-base mutations in HIV-1 protease is strongly influenced by mutational sampling.


Asunto(s)
Evolución Molecular , Proteasa del VIH/genética , VIH-1/genética , Mutación Puntual , Código Genético , Selección Genética
3.
J Virol ; 93(2)2019 01 15.
Artículo en Inglés | MEDLINE | ID: mdl-30381484

RESUMEN

Influenza A virus (IAV), a major cause of human morbidity and mortality, continuously evolves in response to selective pressures. Stem-directed, broadly neutralizing antibodies (sBnAbs) targeting the influenza virus hemagglutinin (HA) are a promising therapeutic strategy, but neutralization escape mutants can develop. We used an integrated approach combining viral passaging, deep sequencing, and protein structural analyses to define escape mutations and mechanisms of neutralization escape in vitro for the F10 sBnAb. IAV was propagated with escalating concentrations of F10 over serial passages in cultured cells to select for escape mutations. Viral sequence analysis revealed three mutations in HA and one in neuraminidase (NA). Introduction of these specific mutations into IAV through reverse genetics confirmed their roles in resistance to F10. Structural analyses revealed that the selected HA mutations (S123G, N460S, and N203V) are away from the F10 epitope but may indirectly impact influenza virus receptor binding, endosomal fusion, or budding. The NA mutation E329K, which was previously identified to be associated with antibody escape, affects the active site of NA, highlighting the importance of the balance between HA and NA function for viral survival. Thus, whole-genome population sequencing enables the identification of viral resistance mutations responding to antibody-induced selective pressure.IMPORTANCE Influenza A virus is a public health threat for which currently available vaccines are not always effective. Broadly neutralizing antibodies that bind to the highly conserved stem region of the influenza virus hemagglutinin (HA) can neutralize many influenza virus strains. To understand how influenza virus can become resistant or escape such antibodies, we propagated influenza A virus in vitro with escalating concentrations of antibody and analyzed viral populations by whole-genome sequencing. We identified HA mutations near and distal to the antibody binding epitope that conferred resistance to antibody neutralization. Additionally, we identified a neuraminidase (NA) mutation that allowed the virus to grow in the presence of high concentrations of the antibody. Virus carrying dual mutations in HA and NA also grew under high antibody concentrations. We show that NA mutations mediate the escape of neutralization by antibodies against HA, highlighting the importance of a balance between HA and NA for optimal virus function.


Asunto(s)
Farmacorresistencia Viral , Glicoproteínas Hemaglutininas del Virus de la Influenza/genética , Subtipo H1N1 del Virus de la Influenza A/genética , Mutación , Neuraminidasa/genética , Animales , Anticuerpos Neutralizantes/farmacología , Anticuerpos Antivirales/farmacología , Perros , Glicoproteínas Hemaglutininas del Virus de la Influenza/química , Subtipo H1N1 del Virus de la Influenza A/efectos de los fármacos , Vacunas contra la Influenza , Células de Riñón Canino Madin Darby , Modelos Moleculares , Neuraminidasa/química , Pruebas de Neutralización , Genética Inversa , Análisis de Secuencia de ARN , Proteínas Virales/química , Proteínas Virales/genética
4.
Mol Biol Evol ; 35(1): 211-224, 2018 01 01.
Artículo en Inglés | MEDLINE | ID: mdl-29106597

RESUMEN

Prokaryotes evolved to thrive in an extremely diverse set of habitats, and their proteomes bear signatures of environmental conditions. Although correlations between amino acid usage and environmental temperature are well-documented, understanding of the mechanisms of thermal adaptation remains incomplete. Here, we couple the energetic costs of protein folding and protein homeostasis to build a microscopic model explaining both the overall amino acid composition and its temperature trends. Low biosynthesis costs lead to low diversity of physical interactions between amino acid residues, which in turn makes proteins less stable and drives up chaperone activity to maintain appropriate levels of folded, functional proteins. Assuming that the cost of chaperone activity is proportional to the fraction of unfolded client proteins, we simulated thermal adaptation of model proteins subject to minimization of the total cost of amino acid synthesis and chaperone activity. For the first time, we predicted both the proteome-average amino acid abundances and their temperature trends simultaneously, and found strong correlations between model predictions and 402 genomes of bacteria and archaea. The energetic constraint on protein evolution is more apparent in highly expressed proteins, selected by codon adaptation index. We found that in bacteria, highly expressed proteins are similar in composition to thermophilic ones, whereas in archaea no correlation between predicted expression level and thermostability was observed. At the same time, thermal adaptations of highly expressed proteins in bacteria and archaea are nearly identical, suggesting that universal energetic constraints prevail over the phylogenetic differences between these domains of life.


Asunto(s)
Adaptación Fisiológica/fisiología , Células Procariotas/metabolismo , Proteostasis/fisiología , Aclimatación/genética , Aclimatación/fisiología , Adaptación Fisiológica/genética , Aminoácidos/genética , Archaea/genética , Archaea/metabolismo , Bacterias/genética , Bacterias/metabolismo , Evolución Biológica , Codón/metabolismo , Simulación por Computador , Evolución Molecular , Calor , Filogenia , Células Procariotas/fisiología , Proteoma/genética , Temperatura
5.
PLoS Genet ; 10(2): e1004185, 2014 Feb.
Artículo en Inglés | MEDLINE | ID: mdl-24586206

RESUMEN

The challenge of distinguishing genetic drift from selection remains a central focus of population genetics. Time-sampled data may provide a powerful tool for distinguishing these processes, and we here propose approximate Bayesian, maximum likelihood, and analytical methods for the inference of demography and selection from time course data. Utilizing these novel statistical and computational tools, we evaluate whole-genome datasets of an influenza A H1N1 strain in the presence and absence of oseltamivir (an inhibitor of neuraminidase) collected at thirteen time points. Results reveal a striking consistency amongst the three estimation procedures developed, showing strongly increased selection pressure in the presence of drug treatment. Importantly, these approaches re-identify the known oseltamivir resistance site, successfully validating the approaches used. Enticingly, a number of previously unknown variants have also been identified as being positively selected. Results are interpreted in the light of Fisher's Geometric Model, allowing for a quantification of the increased distance to optimum exerted by the presence of drug, and theoretical predictions regarding the distribution of beneficial fitness effects of contending mutations are empirically tested. Further, given the fit to expectations of the Geometric Model, results suggest the ability to predict certain aspects of viral evolution in response to changing host environments and novel selective pressures.


Asunto(s)
Farmacorresistencia Viral/genética , Genética de Población , Subtipo H1N1 del Virus de la Influenza A/genética , Selección Genética , Teorema de Bayes , Flujo Genético , Humanos , Subtipo H1N1 del Virus de la Influenza A/efectos de los fármacos , Gripe Humana/genética , Gripe Humana/virología , Mutación , Oseltamivir/farmacología
6.
Mol Biol Evol ; 32(6): 1519-32, 2015 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-25713211

RESUMEN

Influenza A virus (IAV) has a segmented genome that allows for the exchange of genome segments between different strains. This reassortment accelerates evolution by breaking linkage, helping IAV cross species barriers to potentially create highly virulent strains. Challenges associated with monitoring the process of reassortment in molecular detail have limited our understanding of its evolutionary implications. We applied a novel deep sequencing approach with quantitative analysis to assess the in vitro temporal evolution of genomic reassortment in IAV. The combination of H1N1 and H3N2 strains reproducibly generated a new H1N2 strain with the hemagglutinin and nucleoprotein segments originating from H1N1 and the remaining six segments from H3N2. By deep sequencing the entire viral genome, we monitored the evolution of reassortment, quantifying the relative abundance of all IAV genome segments from the two parent strains over time and measuring the selection coefficients of the reassorting segments. Additionally, we observed several mutations coemerging with reassortment that were not found during passaging of pure parental IAV strains. Our results demonstrate how reassortment of the segmented genome can accelerate viral evolution in IAV, potentially enabled by the emergence of a small number of individual mutations.


Asunto(s)
Alphainfluenzavirus/genética , Genoma Viral , Virus Reordenados/genética , Selección Genética , Animales , Biología Computacional , Perros , Evolución Molecular , Frecuencia de los Genes , Genotipo , Glicoproteínas Hemaglutininas del Virus de la Influenza/genética , Secuenciación de Nucleótidos de Alto Rendimiento , Subtipo H1N1 del Virus de la Influenza A/genética , Subtipo H1N2 del Virus de la Influenza A/genética , Subtipo H3N2 del Virus de la Influenza A/genética , Límite de Detección , Células de Riñón Canino Madin Darby , Nucleoproteínas/genética , Análisis de Secuencia de ARN
7.
PLoS Genet ; 9(6): e1003600, 2013 Jun.
Artículo en Inglés | MEDLINE | ID: mdl-23825969

RESUMEN

In natural systems, selection acts on both protein sequence and expression level, but it is unclear how selection integrates over these two dimensions. We recently developed the EMPIRIC approach to systematically determine the fitness effects of all possible point mutants for important regions of essential genes in yeast. Here, we systematically investigated the fitness effects of point mutations in a putative substrate binding loop of yeast Hsp90 (Hsp82) over a broad range of expression strengths. Negative epistasis between reduced expression strength and amino acid substitutions was common, and the endogenous expression strength frequently obscured mutant defects. By analyzing fitness effects at varied expression strengths, we were able to uncover all mutant effects on function. The majority of mutants caused partial functional defects, consistent with this region of Hsp90 contributing to a mutation sensitive and critical process. These results demonstrate that important functional regions of proteins can tolerate mutational defects without experimentally observable impacts on fitness.


Asunto(s)
Aptitud Genética , Proteínas HSP90 de Choque Térmico/genética , Proteínas de Saccharomyces cerevisiae/genética , Saccharomyces cerevisiae/genética , Selección Genética/genética , Secuencia de Aminoácidos , Sustitución de Aminoácidos/genética , Regulación Fúngica de la Expresión Génica , Mutación , Mutación Puntual/genética , Especificidad por Sustrato
8.
J Virol ; 88(1): 272-81, 2014 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-24155392

RESUMEN

Influenza A virus (IAV) is a major cause of morbidity and mortality throughout the world. Current antiviral therapies include oseltamivir, a neuraminidase inhibitor that prevents the release of nascent viral particles from infected cells. However, the IAV genome can evolve rapidly, and oseltamivir resistance mutations have been detected in numerous clinical samples. Using an in vitro evolution platform and whole-genome population sequencing, we investigated the population genomics of IAV during the development of oseltamivir resistance. Strain A/Brisbane/59/2007 (H1N1) was grown in Madin-Darby canine kidney cells with or without escalating concentrations of oseltamivir over serial passages. Following drug treatment, the H274Y resistance mutation fixed reproducibly within the population. The presence of the H274Y mutation in the viral population, at either a low or a high frequency, led to measurable changes in the neuraminidase inhibition assay. Surprisingly, fixation of the resistance mutation was not accompanied by alterations of viral population diversity or differentiation, and oseltamivir did not alter the selective environment. While the neighboring K248E mutation was also a target of positive selection prior to H274Y fixation, H274Y was the primary beneficial mutation in the population. In addition, once evolved, the H274Y mutation persisted after the withdrawal of the drug, even when not fixed in viral populations. We conclude that only selection of H274Y is required for oseltamivir resistance and that H274Y is not deleterious in the absence of the drug. These collective results could offer an explanation for the recent reproducible rise in oseltamivir resistance in seasonal H1N1 IAV strains in humans.


Asunto(s)
Antivirales/farmacología , Farmacorresistencia Viral/genética , Evolución Molecular , Genoma Viral , Subtipo H1N1 del Virus de la Influenza A/genética , Oseltamivir/farmacología , Animales , Línea Celular , Perros , Ensayos Analíticos de Alto Rendimiento , Técnicas In Vitro , Subtipo H1N1 del Virus de la Influenza A/crecimiento & desarrollo , Concentración 50 Inhibidora , Mutación , Ensayo de Placa Viral
9.
J Chem Phys ; 143(5): 055101, 2015 Aug 07.
Artículo en Inglés | MEDLINE | ID: mdl-26254668

RESUMEN

Evolution of proteins in bacteria and archaea living in different conditions leads to significant correlations between amino acid usage and environmental temperature. The origins of these correlations are poorly understood, and an important question of protein theory, physics-based prediction of types of amino acids overrepresented in highly thermostable proteins, remains largely unsolved. Here, we extend the random energy model of protein folding by weighting the interaction energies of amino acids by their frequencies in protein sequences and predict the energy gap of proteins designed to fold well at elevated temperatures. To test the model, we present a novel scalable algorithm for simultaneous energy calculation for many sequences in many structures, targeting massively parallel computing architectures such as graphics processing unit. The energy calculation is performed by multiplying two matrices, one representing the complete set of sequences, and the other describing the contact maps of all structural templates. An implementation of the algorithm for the CUDA platform is available at http://www.github.com/kzeldovich/galeprot and calculates protein folding energies over 250 times faster than a single central processing unit. Analysis of amino acid usage in 64-mer cubic lattice proteins designed to fold well at different temperatures demonstrates an excellent agreement between theoretical and simulated values of energy gap. The theoretical predictions of temperature trends of amino acid frequencies are significantly correlated with bioinformatics data on 191 bacteria and archaea, and highlight protein folding constraints as a fundamental selection pressure during thermal adaptation in biological evolution.


Asunto(s)
Adaptación Fisiológica , Modelos Moleculares , Proteínas/metabolismo , Temperatura , Algoritmos , Evolución Molecular , Pliegue de Proteína , Proteínas/química , Factores de Tiempo
10.
Phys Rev Lett ; 110(9): 098104, 2013 Mar 01.
Artículo en Inglés | MEDLINE | ID: mdl-23496749

RESUMEN

The genome of influenza A virus consists of eight separate RNA segments, which are selectively packaged into virions prior to virus budding. The microscopic mechanism of highly selective packaging involves molecular interactions between packaging signals in the genome segments and remains poorly understood. We propose that the condition of proper packaging can be formulated as a large gap between RNA-RNA interaction energies in the viable virion with eight unique segments and in improperly packed assemblages lacking the complete genome. We then demonstrate that selective packaging of eight unique segments into an infective influenza virion can be achieved by self-repulsion of identical segments at the virion assembly stage, rather than by previously hypothesized intricate molecular recognition of particular segments. Using Monte Carlo simulations to maximize the energy gap, without any other assumptions, we generated model eight-segment virions, which all display specific packaging, strong self-repulsion of the segments, and reassortment patterns similar to natural influenza. The model provides a biophysical foundation of influenza genome packaging and reassortment and serves as an important step towards robust sequence-driven prediction of reassortment patterns of the influenza virus.


Asunto(s)
Genoma Viral , Virus de la Influenza A/química , Virus de la Influenza A/genética , Modelos Genéticos , ARN Viral/química , ARN Viral/genética , Termodinámica , Virión/química , Virión/genética
11.
Vaccine ; 41(12): 2022-2034, 2023 03 17.
Artículo en Inglés | MEDLINE | ID: mdl-36803901

RESUMEN

Seasonal influenza remains a serious public health concern as the viral infection spreads easily from person to person and due to antigenic drift of neutralizing epitopes. Vaccination is the best method for disease prevention, however current seasonal influenza vaccines stimulate antibodies which are often effective against only antigenically similar strains. To boost the immune responses and increase vaccine effectiveness, adjuvants have been used for the past 20 years. The current study explores the use of oil-in-water adjuvant, AF03 to improve an immunogenicity of 2 licensed vaccines. A standard-dose inactivated quadrivalent influenza vaccine (IIV4-SD), containing both hemagglutinin (HA) and neuraminidase (NA) antigens, and recombinant quadrivalent influenza vaccine (RIV4), containing only HA-antigen were adjuvanted with AF03 in naïve BALB/c mouse model. Functional HA-specific antibody titers against all four homologous vaccine strains were enhanced by AF03, indicating potential increase in protective immunity. An increase in HA-specific total immunoglobulin G (IgG) binding titers were detected against homologous HAs, heterologous panel of 30 H3 HAs and seven Influenza B HAs. The neuraminidase inhibition (NAI) activity was significantly higher in IIV4-SD-AF03 group. Use of AF03 adjuvant improved the immune response to two influenza vaccines in a mouse model via an increase in functional and total antibodies against NA and a broad panel of HA-antigens.


Asunto(s)
Vacunas contra la Influenza , Gripe Humana , Infecciones por Orthomyxoviridae , Animales , Ratones , Humanos , Gripe Humana/prevención & control , Hemaglutininas , Neuraminidasa , Estaciones del Año , Anticuerpos Antivirales , Adyuvantes Inmunológicos , Inmunidad , Vacunas de Productos Inactivados , Infecciones por Orthomyxoviridae/prevención & control
12.
Protein Sci ; 29(9): 1911-1923, 2020 09.
Artículo en Inglés | MEDLINE | ID: mdl-32643222

RESUMEN

Protein evolution proceeds by a complex response of organismal fitness to mutations that can simultaneously affect protein stability, structure, and enzymatic activity. To probe the relationship between genotype and phenotype, we chose a fundamental paradigm for protein evolution, folding, and design, the (ßα)8 TIM barrel fold. Here, we demonstrate the role of long-range allosteric interactions in the adaptation of an essential hyperthermophilic TIM barrel enzyme to mesophilic conditions in a yeast host. Beneficial fitness effects observed with single and double mutations of the canonical ßα-hairpin clamps and the α-helical shell distal to the active site revealed an underlying energy network between opposite faces of the cylindrical ß-barrel. We experimentally determined the fitness of multiple mutants in the energetic phase plane, contrasting the energy barrier of the chemical reaction and the folding free energy of the protein. For the system studied, the reaction energy barrier was the primary determinant of organism fitness. Our observations of long-range epistatic interactions uncovered an allosteric pathway in an ancient and ubiquitous enzyme that may provide a novel way of designing proteins with a desired activity and stability profile.


Asunto(s)
Modelos Moleculares , Pliegue de Proteína , Proteínas de Saccharomyces cerevisiae/química , Saccharomyces cerevisiae/enzimología , Regulación Alostérica , Conformación Proteica en Hélice alfa , Conformación Proteica en Lámina beta , Saccharomyces cerevisiae/genética , Proteínas de Saccharomyces cerevisiae/genética , Proteínas de Saccharomyces cerevisiae/metabolismo
13.
J Chem Theory Comput ; 16(2): 1284-1299, 2020 Feb 11.
Artículo en Inglés | MEDLINE | ID: mdl-31877249

RESUMEN

Over the past several decades, atomistic simulations of biomolecules, whether carried out using molecular dynamics or Monte Carlo techniques, have provided detailed insights into their function. Comparing the results of such simulations for a few closely related systems has guided our understanding of the mechanisms by which changes such as ligand binding or mutation can alter the function. The general problem of detecting and interpreting such mechanisms from simulations of many related systems, however, remains a challenge. This problem is addressed here by applying supervised and unsupervised machine learning techniques to a variety of thermodynamic observables extracted from molecular dynamics simulations of different systems. As an important test case, these methods are applied to understand the evasion by human immunodeficiency virus type-1 (HIV-1) protease of darunavir, a potent inhibitor to which resistance can develop via the simultaneous mutation of multiple amino acids. Complex mutational patterns have been observed among resistant strains, presenting a challenge to developing a mechanistic picture of resistance in the protease. In order to dissect these patterns and gain mechanistic insight into the role of specific mutations, molecular dynamics simulations were carried out on a collection of HIV-1 protease variants, chosen to include highly resistant strains and susceptible controls, in complex with darunavir. Using a machine learning approach that takes advantage of the hierarchical nature in the relationships among the sequence, structure, and function, an integrative analysis of these trajectories reveals key details of the resistance mechanism, including changes in the protein structure, hydrogen bonding, and protein-ligand contacts.


Asunto(s)
Farmacorresistencia Viral , Proteasa del VIH/metabolismo , VIH-1/enzimología , Ligandos , Aprendizaje Automático , Proteasa del VIH/química , Proteasa del VIH/genética , Inhibidores de la Proteasa del VIH/química , Inhibidores de la Proteasa del VIH/metabolismo , Humanos , Enlace de Hidrógeno , Simulación de Dinámica Molecular , Método de Montecarlo , Mutación , Unión Proteica , Electricidad Estática
14.
J Mol Biol ; 368(3): 884-93, 2007 May 04.
Artículo en Inglés | MEDLINE | ID: mdl-17368671

RESUMEN

An increasing number of proteins are being discovered with a remarkable and somewhat surprising feature, a knot in their native structures. How the polypeptide chain is able to "knot" itself during the folding process to form these highly intricate protein topologies is not known. Here we perform a computational study on the 160-amino-acid homodimeric protein YibK, which, like other proteins in the SpoU family of MTases, contains a deep trefoil knot in its C-terminal region. In this study, we use a coarse-grained C(alpha)-chain representation and Langevin dynamics to study folding kinetics. We find that specific, attractive nonnative interactions are critical for knot formation. In the absence of these interactions, i.e., in an energetics driven entirely by native interactions, knot formation is exceedingly unlikely. Further, we find, in concert with recent experimental data on YibK, two parallel folding pathways that we attribute to an early and a late formation of the trefoil knot, respectively. For both pathways, knot formation occurs before dimerization. A bioinformatics analysis of the SpoU family of proteins reveals further that the critical nonnative interactions may originate from evolutionary conserved hydrophobic segments around the knotted region.


Asunto(s)
Simulación por Computador , Metiltransferasas/química , Modelos Moleculares , Pliegue de Proteína , Secuencia de Aminoácidos , Dimerización , Datos de Secuencia Molecular
15.
PLoS Comput Biol ; 3(1): e5, 2007 Jan 12.
Artículo en Inglés | MEDLINE | ID: mdl-17222055

RESUMEN

There have been considerable attempts in the past to relate phenotypic trait--habitat temperature of organisms--to their genotypes, most importantly compositions of their genomes and proteomes. However, despite accumulation of anecdotal evidence, an exact and conclusive relationship between the former and the latter has been elusive. We present an exhaustive study of the relationship between amino acid composition of proteomes, nucleotide composition of DNA, and optimal growth temperature (OGT) of prokaryotes. Based on 204 complete proteomes of archaea and bacteria spanning the temperature range from -10 degrees C to 110 degrees C, we performed an exhaustive enumeration of all possible sets of amino acids and found a set of amino acids whose total fraction in a proteome is correlated, to a remarkable extent, with the OGT. The universal set is Ile, Val, Tyr, Trp, Arg, Glu, Leu (IVYWREL), and the correlation coefficient is as high as 0.93. We also found that the G + C content in 204 complete genomes does not exhibit a significant correlation with OGT (R = -0.10). On the other hand, the fraction of A + G in coding DNA is correlated with temperature, to a considerable extent, due to codon patterns of IVYWREL amino acids. Further, we found strong and independent correlation between OGT and the frequency with which pairs of A and G nucleotides appear as nearest neighbors in genome sequences. This adaptation is achieved via codon bias. These findings present a direct link between principles of proteins structure and stability and evolutionary mechanisms of thermophylic adaptation. On the nucleotide level, the analysis provides an example of how nature utilizes codon bias for evolutionary adaptation to extreme conditions. Together these results provide a complete picture of how compositions of proteomes and genomes in prokaryotes adjust to the extreme conditions of the environment.


Asunto(s)
Adaptación Fisiológica/genética , Archaea/genética , Proteínas Bacterianas/genética , ADN/genética , Evolución Molecular , Proteoma/genética , Temperatura , Secuencia de Aminoácidos , Mapeo Cromosómico , Secuencia Conservada , ADN/química , Datos de Secuencia Molecular , Análisis de Secuencia de ADN , Análisis de Secuencia de Proteína , Homología de Secuencia
16.
PLoS Comput Biol ; 3(3): e52, 2007 Mar 23.
Artículo en Inglés | MEDLINE | ID: mdl-17381236

RESUMEN

The aim of this work is to elucidate how physical principles of protein design are reflected in natural sequences that evolved in response to the thermal conditions of the environment. Using an exactly solvable lattice model, we design sequences with selected thermal properties. Compositional analysis of designed model sequences and natural proteomes reveals a specific trend in amino acid compositions in response to the requirement of stability at elevated environmental temperature: the increase of fractions of hydrophobic and charged amino acid residues at the expense of polar ones. We show that this "from both ends of the hydrophobicity scale" trend is due to positive (to stabilize the native state) and negative (to destabilize misfolded states) components of protein design. Negative design strengthens specific repulsive non-native interactions that appear in misfolded structures. A pressure to preserve specific repulsive interactions in non-native conformations may result in correlated mutations between amino acids that are far apart in the native state but may be in contact in misfolded conformations. Such correlated mutations are indeed found in TIM barrel and other proteins.


Asunto(s)
Proteínas Arqueales/química , Proteínas Arqueales/ultraestructura , Modelos Químicos , Modelos Moleculares , Análisis de Secuencia de Proteína/métodos , Secuencia de Aminoácidos , Simulación por Computador , Estabilidad de Medicamentos , Interacciones Hidrofóbicas e Hidrofílicas , Datos de Secuencia Molecular , Desnaturalización Proteica , Pliegue de Proteína , Alineación de Secuencia/métodos , Relación Estructura-Actividad , Temperatura
17.
PLoS Comput Biol ; 3(7): e139, 2007 Jul.
Artículo en Inglés | MEDLINE | ID: mdl-17630830

RESUMEN

In this work we develop a microscopic physical model of early evolution where phenotype--organism life expectancy--is directly related to genotype--the stability of its proteins in their native conformations-which can be determined exactly in the model. Simulating the model on a computer, we consistently observe the "Big Bang" scenario whereby exponential population growth ensues as soon as favorable sequence-structure combinations (precursors of stable proteins) are discovered. Upon that, random diversity of the structural space abruptly collapses into a small set of preferred proteins. We observe that protein folds remain stable and abundant in the population at timescales much greater than mutation or organism lifetime, and the distribution of the lifetimes of dominant folds in a population approximately follows a power law. The separation of evolutionary timescales between discovery of new folds and generation of new sequences gives rise to emergence of protein families and superfamilies whose sizes are power-law distributed, closely matching the same distributions for real proteins. On the population level we observe emergence of species--subpopulations that carry similar genomes. Further, we present a simple theory that relates stability of evolving proteins to the sizes of emerging genomes. Together, these results provide a microscopic first-principles picture of how first-gene families developed in the course of early evolution.


Asunto(s)
Evolución Biológica , Genoma/fisiología , Modelos Genéticos , Familia de Multigenes/fisiología , Proteínas/clasificación , Proteínas/genética , Animales , Fenómenos Cronobiológicos/genética , Simulación por Computador , Bases de Datos de Proteínas , Humanos , Esperanza de Vida , Mutación/fisiología , Crecimiento Demográfico , Conformación Proteica , Pliegue de Proteína , Selección Genética , Alineación de Secuencia , Análisis de Secuencia de Proteína , Homología de Secuencia de Aminoácido , Procesos Estocásticos , Relación Estructura-Actividad
18.
J Mol Biol ; 430(8): 1098-1115, 2018 04 13.
Artículo en Inglés | MEDLINE | ID: mdl-29466705

RESUMEN

The fitness effects of synonymous mutations can provide insights into biological and evolutionary mechanisms. We analyzed the experimental fitness effects of all single-nucleotide mutations, including synonymous substitutions, at the beginning of the influenza A virus hemagglutinin (HA) gene. Many synonymous substitutions were deleterious both in bulk competition and for individually isolated clones. Investigating protein and RNA levels of a subset of individually expressed HA variants revealed that multiple biochemical properties contribute to the observed experimental fitness effects. Our results indicate that a structural element in the HA segment viral RNA may influence fitness. Examination of naturally evolved sequences in human hosts indicates a preference for the unfolded state of this structural element compared to that found in swine hosts. Our overall results reveal that synonymous mutations may have greater fitness consequences than indicated by simple models of sequence conservation, and we discuss the implications of this finding for commonly used evolutionary tests and analyses.


Asunto(s)
Aptitud Genética , Glicoproteínas Hemaglutininas del Virus de la Influenza/química , Glicoproteínas Hemaglutininas del Virus de la Influenza/genética , Subtipo H1N1 del Virus de la Influenza A/crecimiento & desarrollo , Mutación Silenciosa , Sustitución de Aminoácidos , Animales , Perros , Evolución Molecular , Células HEK293 , Humanos , Subtipo H1N1 del Virus de la Influenza A/genética , Células de Riñón Canino Madin Darby , Modelos Moleculares , Filogenia , Pliegue del ARN , Porcinos , Replicación Viral
19.
J Mol Biol ; 357(4): 1335-43, 2006 Apr 07.
Artículo en Inglés | MEDLINE | ID: mdl-16483605

RESUMEN

In this work, we discovered a fundamental connection between selection for protein stability and emergence of preferred structures of proteins. Using a standard exact three-dimensional lattice model we evolve sequences starting from random ones and determine the exact native structure after each mutation. Acceptance of mutations is biased to select for stable proteins. We found that certain structures, "wonderfolds", are independently discovered numerous times as native states of stable proteins in many unrelated runs of selection. The strong dependence of lattice fold usage on the structural determinant of designability quantitatively reproduces uneven fold usage in natural proteins. Diversity of sequences that fold into wonderfold structures gives rise to superfamilies, i.e. sets of dissimilar sequences that fold into the same or very similar structures. The present work establishes a model of pre-biotic structure selection, which identifies dominant structural patterns emerging upon optimization of proteins for survival in a hot environment. Convergently discovered pre-biotic initial superfamilies with wonderfold structures could have served as a seed for subsequent biological evolution involving gene duplications and divergence.


Asunto(s)
Evolución Molecular , Conformación Proteica , Pliegue de Proteína , Proteínas/química , Secuencia de Aminoácidos , Simulación por Computador , Modelos Moleculares , Mutación , Proteínas/genética , Proteínas/metabolismo
20.
Nat Commun ; 8: 14614, 2017 03 06.
Artículo en Inglés | MEDLINE | ID: mdl-28262665

RESUMEN

Sequence divergence of orthologous proteins enables adaptation to environmental stresses and promotes evolution of novel functions. Limits on evolution imposed by constraints on sequence and structure were explored using a model TIM barrel protein, indole-3-glycerol phosphate synthase (IGPS). Fitness effects of point mutations in three phylogenetically divergent IGPS proteins during adaptation to temperature stress were probed by auxotrophic complementation of yeast with prokaryotic, thermophilic IGPS. Analysis of beneficial mutations pointed to an unexpected, long-range allosteric pathway towards the active site of the protein. Significant correlations between the fitness landscapes of distant orthologues implicate both sequence and structure as primary forces in defining the TIM barrel fitness landscape and suggest that fitness landscapes can be translocated in sequence space. Exploration of fitness landscapes in the context of a protein fold provides a strategy for elucidating the sequence-structure-fitness relationships in other common motifs.


Asunto(s)
Indol-3-Glicerolfosfato Sintasa/química , Mutación , Sulfolobus solfataricus/química , Thermotoga maritima/química , Thermus thermophilus/química , Secuencia de Aminoácidos , Sitios de Unión , Clonación Molecular , Evolución Molecular , Expresión Génica , Vectores Genéticos/química , Vectores Genéticos/metabolismo , Indol-3-Glicerolfosfato Sintasa/genética , Indol-3-Glicerolfosfato Sintasa/metabolismo , Cinética , Modelos Moleculares , Unión Proteica , Conformación Proteica en Hélice alfa , Conformación Proteica en Lámina beta , Dominios y Motivos de Interacción de Proteínas , Proteínas Recombinantes/química , Proteínas Recombinantes/genética , Proteínas Recombinantes/metabolismo , Saccharomyces cerevisiae/genética , Saccharomyces cerevisiae/metabolismo , Homología Estructural de Proteína , Especificidad por Sustrato , Sulfolobus solfataricus/enzimología , Termodinámica , Thermotoga maritima/enzimología , Thermus thermophilus/enzimología
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA