Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 5 de 5
Filtrar
Mais filtros

Base de dados
Tipo de documento
Intervalo de ano de publicação
1.
Proc Natl Acad Sci U S A ; 121(3): e2312029121, 2024 Jan 16.
Artigo em Inglês | MEDLINE | ID: mdl-38194446

RESUMO

Understanding natural protein evolution and designing novel proteins are motivating interest in development of high-throughput methods to explore large sequence spaces. In this work, we demonstrate the application of multisite λ dynamics (MSλD), a rigorous free energy simulation method, and chemical denaturation experiments to quantify evolutionary selection pressure from sequence-stability relationships and to address questions of design. This study examines a mesophilic phylogenetic clade of ribonuclease H (RNase H), furthering its extensive characterization in earlier studies, focusing on E. coli RNase H (ecRNH) and a more stable consensus sequence (AncCcons) differing at 15 positions. The stabilities of 32,768 chimeras between these two sequences were computed using the MSλD framework. The most stable and least stable chimeras were predicted and tested along with several other sequences, revealing a designed chimera with approximately the same stability increase as AncCcons, but requiring only half the mutations. Comparing the computed stabilities with experiment for 12 sequences reveals a Pearson correlation of 0.86 and root mean squared error of 1.18 kcal/mol, an unprecedented level of accuracy well beyond less rigorous computational design methods. We then quantified selection pressure using a simple evolutionary model in which sequences are selected according to the Boltzmann factor of their stability. Selection temperatures from 110 to 168 K are estimated in three ways by comparing experimental and computational results to evolutionary models. These estimates indicate selection pressure is high, which has implications for evolutionary dynamics and for the accuracy required for design, and suggests accurate high-throughput computational methods like MSλD may enable more effective protein design.


Assuntos
Escherichia coli , Ribonuclease H , Escherichia coli/genética , Filogenia , Simulação por Computador , Sequência Consenso , Ribonuclease H/genética
2.
Biochemistry ; 60(3): 170-181, 2021 01 26.
Artigo em Inglês | MEDLINE | ID: mdl-33433210

RESUMO

In addition to encoding the tertiary fold and stability, the primary sequence of a protein encodes the folding trajectory and kinetic barriers that determine the speed of folding. How these kinetic barriers are encoded is not well understood. Here, we use evolutionary sequence variation in the α-lytic protease (αLP) protein family to probe the relationship between sequence and energy landscape. αLP has an unusual energy landscape: the native state of αLP is not the most thermodynamically favored conformation and, instead, remains folded due to a large kinetic barrier preventing unfolding. To fold, αLP utilizes an N-terminal pro region similar in size to the protease itself that functions as a folding catalyst. Once folded, the pro region is removed, and the native state does not unfold on a biologically relevant time scale. Without the pro region, αLP folds on the order of millennia. A phylogenetic search uncovers αLP homologs with a wide range of pro region sizes, including some with no pro region at all. In the resulting phylogenetic tree, these homologs cluster by pro region size. By studying homologs naturally lacking a pro region, we demonstrate they can be thermodynamically stable, fold much faster than αLP, yet retain the same fold as αLP. Key amino acids thought to contribute to αLP's extreme kinetic stability are lost in these homologs, supporting their role in kinetic stability. This study highlights how the entire energy landscape plays an important role in determining the evolutionary pressures on the protein sequence.


Assuntos
Proteínas de Bactérias/química , Evolução Molecular , Modelos Moleculares , Filogenia , Dobramento de Proteína , Serina Endopeptidases/química , Proteínas de Bactérias/genética , Estabilidade Enzimática , Cinética , Serina Endopeptidases/genética
3.
Protein Sci ; 33(6): e5011, 2024 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-38747388

RESUMO

A protein sequence encodes its energy landscape-all the accessible conformations, energetics, and dynamics. The evolutionary relationship between sequence and landscape can be probed phylogenetically by compiling a multiple sequence alignment of homologous sequences and generating common ancestors via Ancestral Sequence Reconstruction or a consensus protein containing the most common amino acid at each position. Both ancestral and consensus proteins are often more stable than their extant homologs-questioning the differences between them and suggesting that both approaches serve as general methods to engineer thermostability. We used the Ribonuclease H family to compare these approaches and evaluate how the evolutionary relationship of the input sequences affects the properties of the resulting consensus protein. While the consensus protein derived from our full Ribonuclease H sequence alignment is structured and active, it neither shows properties of a well-folded protein nor has enhanced stability. In contrast, the consensus protein derived from a phylogenetically-restricted set of sequences is significantly more stable and cooperatively folded, suggesting that cooperativity may be encoded by different mechanisms in separate clades and lost when too many diverse clades are combined to generate a consensus protein. To explore this, we compared pairwise covariance scores using a Potts formalism as well as higher-order sequence correlations using singular value decomposition (SVD). We find the SVD coordinates of a stable consensus sequence are close to coordinates of the analogous ancestor sequence and its descendants, whereas the unstable consensus sequences are outliers in SVD space.


Assuntos
Evolução Molecular , Ribonuclease H/química , Ribonuclease H/genética , Ribonuclease H/metabolismo , Sequência Consenso , Alinhamento de Sequência , Filogenia , Sequência de Aminoácidos , Modelos Moleculares , Dobramento de Proteína , Conformação Proteica
4.
bioRxiv ; 2023 Jul 01.
Artigo em Inglês | MEDLINE | ID: mdl-37425932

RESUMO

A protein sequence encodes its energy landscape - all the accessible conformations, energetics, and dynamics. The evolutionary relationship between sequence and landscape can be probed phylogenetically by compiling a multiple sequence alignment of homologous sequences and generating common ancestors via Ancestral Sequence Reconstruction or a consensus protein containing the most common amino acid at each position. Both ancestral and consensus proteins are often more stable than their extant homologs - questioning the differences and suggesting that both approaches serve as general methods to engineer thermostability. We used the Ribonuclease H family to compare these approaches and evaluate how the evolutionary relationship of the input sequences affects the properties of the resulting consensus protein. While the overall consensus protein is structured and active, it neither shows properties of a well-folded protein nor has enhanced stability. In contrast, the consensus protein derived from a phylogenetically-restricted region is significantly more stable and cooperatively folded, suggesting that cooperativity may be encoded by different mechanisms in separate clades and lost when too many diverse clades are combined to generate a consensus protein. To explore this, we compared pairwise covariance scores using a Potts formalism as well as higher-order couplings using singular value decomposition (SVD). We find the SVD coordinates of a stable consensus sequence are close to coordinates of the analogous ancestor sequence and its descendants, whereas the unstable consensus sequences are outliers in SVD space.

5.
J Mol Biol ; 428(1): 153-164, 2016 Jan 16.
Artigo em Inglês | MEDLINE | ID: mdl-26608811

RESUMO

Many bacteria employ a protein organelle, the carboxysome, to catalyze carbon dioxide fixation in the Calvin Cycle. Only 10 genes from Halothiobacillus neapolitanus are sufficient for heterologous expression of carboxysomes in Escherichia coli, opening the door to detailed mechanistic analysis of the assembly process of this complex (more than 200MDa). One of these genes, csoS2, has been implicated in assembly but ascribing a molecular function is confounded by the observation that the single csoS2 gene yields expression of two gene products and both display an apparent molecular weight incongruent with the predicted amino acid sequence. Here, we elucidate the co-translational mechanism responsible for the expression of the two protein isoforms. Specifically, csoS2 was found to possess -1 frameshifting elements that lead to the production of the full-length protein, CsoS2B, and a truncated protein, CsoS2A, which possesses a C-terminus translated from the alternate frame. The frameshifting elements comprise both a ribosomal slippery sequence and a 3' secondary structure, and ablation of either sequence is sufficient to eliminate the slip. Using these mutants, we investigated the individual roles of CsoS2B and CsoS2A on carboxysome formation. In this in vivo formation assay, cells expressing only the CsoS2B isoform were capable of producing intact carboxysomes, while those with only CsoS2A were not. Thus, we have answered a long-standing question about the nature of CsoS2 in this model microcompartment and demonstrate that CsoS2B is functionally distinct from CsoS2A in the assembly of α-carboxysomes.


Assuntos
Proteínas de Bactérias/biossíntese , Proteínas de Bactérias/genética , Mudança da Fase de Leitura do Gene Ribossômico , Regulação Bacteriana da Expressão Gênica , Halothiobacillus/genética , Isoformas de Proteínas/biossíntese , Isoformas de Proteínas/genética , Escherichia coli/genética , Substâncias Macromoleculares/metabolismo , Multimerização Proteica
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA