Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 10 de 10
Filtrar
1.
Brief Bioinform ; 25(3)2024 Mar 27.
Artículo en Inglés | MEDLINE | ID: mdl-38557677

RESUMEN

Protein design is central to nearly all protein engineering problems, as it can enable the creation of proteins with new biological functions, such as improving the catalytic efficiency of enzymes. One key facet of protein design, fixed-backbone protein sequence design, seeks to design new sequences that will conform to a prescribed protein backbone structure. Nonetheless, existing sequence design methods present limitations, such as low sequence diversity and shortcomings in experimental validation of the designed functional proteins. These inadequacies obstruct the goal of functional protein design. To improve these limitations, we initially developed the Graphormer-based Protein Design (GPD) model. This model utilizes the Transformer on a graph-based representation of three-dimensional protein structures and incorporates Gaussian noise and a sequence random masks to node features, thereby enhancing sequence recovery and diversity. The performance of the GPD model was significantly better than that of the state-of-the-art ProteinMPNN model on multiple independent tests, especially for sequence diversity. We employed GPD to design CalB hydrolase and generated nine artificially designed CalB proteins. The results show a 1.7-fold increase in catalytic activity compared to that of the wild-type CalB and strong substrate selectivity on p-nitrophenyl acetate with different carbon chain lengths (C2-C16). Thus, the GPD method could be used for the de novo design of industrial enzymes and protein drugs. The code was released at https://github.com/decodermu/GPD.


Asunto(s)
Ingeniería de Proteínas , Proteínas , Proteínas/química , Secuencia de Aminoácidos , Ingeniería de Proteínas/métodos
2.
Bioinformatics ; 40(2)2024 02 01.
Artículo en Inglés | MEDLINE | ID: mdl-38261649

RESUMEN

MOTIVATION: Proteins found in nature represent only a fraction of the vast space of possible proteins. Protein design presents an opportunity to explore and expand this protein landscape. Within protein design, protein sequence design plays a crucial role, and numerous successful methods have been developed. Notably, deep learning-based protein sequence design methods have experienced significant advancements in recent years. However, a comprehensive and systematic comparison and evaluation of these methods have been lacking, with indicators provided by different methods often inconsistent or lacking effectiveness. RESULTS: To address this gap, we have designed a diverse set of indicators that cover several important aspects, including sequence recovery, diversity, root-mean-square deviation of protein structure, secondary structure, and the distribution of polar and nonpolar amino acids. In our evaluation, we have employed an improved weighted inferiority-superiority distance method to comprehensively assess the performance of eight widely used deep learning-based protein sequence design methods. Our evaluation not only provides rankings of these methods but also offers optimization suggestions by analyzing the strengths and weaknesses of each method. Furthermore, we have developed a method to select the best temperature parameter and proposed solutions for the common issue of designing sequences with consecutive repetitive amino acids, which is often encountered in protein design methods. These findings can greatly assist users in selecting suitable protein sequence design methods. Overall, our work contributes to the field of protein sequence design by providing a comprehensive evaluation system and optimization suggestions for different methods.


Asunto(s)
Aprendizaje Profundo , Secuencia de Aminoácidos , Proteínas/química , Aminoácidos/química , Estructura Secundaria de Proteína
3.
Proc Natl Acad Sci U S A ; 119(15): e2120787119, 2022 04 12.
Artículo en Inglés | MEDLINE | ID: mdl-35385357

RESUMEN

T cell acute lymphoblastic leukemia (T-ALL) is an aggressive hematological malignancy of T cell progenitors, known to be a heterogeneous disease in pediatric and adult patients. Here we attempted to better understand the disease at the molecular level based on the transcriptomic landscape of 707 T-ALL patients (510 pediatric, 190 adult patients, and 7 with unknown age; 599 from published cohorts and 108 newly investigated). Leveraging the information of gene expression enabled us to identify 10 subtypes (G1­G10), including the previously undescribed one characterized by GATA3 mutations, with GATA3R276Q capable of affecting lymphocyte development in zebrafish. Through associating with T cell differentiation stages, we found that high expression of LYL1/LMO2/SPI1/HOXA (G1­G6) might represent the early T cell progenitor, pro/precortical/cortical stage with a relatively high age of disease onset, and lymphoblasts with TLX3/TLX1 high expression (G7­G8) could be blocked at the cortical/postcortical stage, while those with high expression of NKX2-1/TAL1/LMO1 (G9­G10) might correspond to cortical/postcortical/mature stages of T cell development. Notably, adult patients harbored more cooperative mutations among epigenetic regulators, and genes involved in JAK-STAT and RAS signaling pathways, with 44% of patients aged 40 y or above in G1 bearing DNMT3A/IDH2 mutations usually seen in acute myeloid leukemia, suggesting the nature of mixed phenotype acute leukemia.


Asunto(s)
Leucemia-Linfoma Linfoblástico de Células T Precursoras , Transcriptoma , Niño , Humanos , Mutación , Leucemia-Linfoma Linfoblástico de Células T Precursoras/genética
4.
BMC Biol ; 19(1): 117, 2021 06 04.
Artículo en Inglés | MEDLINE | ID: mdl-34088313

RESUMEN

BACKGROUND: The vacuole/lysosome is the final destination of autophagic pathways, but can also itself be degraded in whole or in part by selective macroautophagic or microautophagic processes. Diverse molecular mechanisms are involved in these processes, the characterization of which has lagged behind those of ATG-dependent macroautophagy and ESCRT-dependent endosomal multivesicular body pathways. RESULTS: Here we show that as yeast cells gradually exhaust available nutrients and approach stationary phase, multiple vacuolar integral membrane proteins with unrelated functions are degraded in the vacuolar lumen. This degradation depends on the ESCRT machinery, but does not strictly require ubiquitination of cargos or trafficking of cargos out of the vacuole. It is also temporally and mechanistically distinct from NPC-dependent microlipophagy. The turnover is facilitated by Atg8, an exception among autophagy proteins, and an Atg8-interacting vacuolar membrane protein, Hfl1. Lack of Atg8 or Hfl1 led to the accumulation of enlarged lumenal membrane structures in the vacuole. We further show that a key function of Hfl1 is the membrane recruitment of Atg8. In the presence of Hfl1, lipidation of Atg8 is not required for efficient cargo turnover. The need for Hfl1 can be partially bypassed by blocking Atg8 delipidation. CONCLUSIONS: Our data reveal a vacuolar membrane protein degradation process with a unique dependence on vacuole-associated Atg8 downstream of ESCRTs, and we identify a specific role of Hfl1, a protein conserved from yeast to plants and animals, in membrane targeting of Atg8.


Asunto(s)
Saccharomyces cerevisiae , Vacuolas , Animales , Autofagia , Familia de las Proteínas 8 Relacionadas con la Autofagia/genética , Complejos de Clasificación Endosomal Requeridos para el Transporte/genética , Proteínas de la Membrana/genética , Saccharomyces cerevisiae/genética , Proteínas de Saccharomyces cerevisiae/genética
5.
J Chem Inf Model ; 61(10): 5141-5151, 2021 10 25.
Artículo en Inglés | MEDLINE | ID: mdl-34546059

RESUMEN

Intrinsically disordered proteins (IDPs) have no fixed three-dimensional (3D) structures under physiological conditions, with the content being about 51% in human proteomics. IDPs are associated with many human diseases, such as cancer, diabetes, and neurodegenerative diseases. Because IDPs do not crystallize and have diverse conformers, traditional experimental methods such as crystallization and NMR can hardly capture their conformation ensemble and just provide average structural characters of IDPs. Therefore, molecular dynamics (MD) simulations become a valuable complement to the experimental data. However, the accuracy of molecular dynamics simulation for IDPs depends on the combination of force fields and solvent models. Recently, we released an environment-specific force field (ESFF1) for IDPs, which can well reproduce the local structural properties (such as J-coupling and secondary chemical shifts). However, there is still a large deviation for the radius of gyration (Rg). Therefore, a solvent model combined with ESFF1 is necessary to capture the local and global characters for IDPs and ordered proteins. Here, we investigated the underestimation or overestimation of the solvent interaction for four solvent models (TIP3P, TIP4P-Ew, TIP4P-D, OPC) under ESFF1 and found the important ε parameter of the solvent model to play a key role in scaling Rg. A near-linear relationship between the simulation Rg and the ε parameter was used to develop the new solvent model, named TIP4P-B. The results indicate that the simulated Rg with TIP4P-B is in better agreement with the experimental observations than the other four solvent models. Simultaneously, TIP4P-B can also maintain the advantages of the ESFF1 force field for the local structural properties. Additionally, TIP4P-B can successfully sample the conformation of ordered proteins. These findings confirm that TIP4P-B is a balanced solvent model and can improve sampling Rg performance for folded proteins and IDPs.


Asunto(s)
Proteínas Intrínsecamente Desordenadas , Humanos , Espectroscopía de Resonancia Magnética , Simulación de Dinámica Molecular , Conformación Proteica , Solventes
6.
J Chem Inf Model ; 61(3): 1037-1047, 2021 03 22.
Artículo en Inglés | MEDLINE | ID: mdl-33591749

RESUMEN

Intrinsically disordered proteins (IDPs) are widely distributed across eukaryotic cells, playing important roles in molecular recognition, molecular assembly, post-translational modification, and other biological processes. IDPs are also associated with many diseases such as cancers, cardiovascular diseases, and neurodegenerative diseases. Due to their structural flexibility, conventional experimental methods cannot reliably capture their heterogeneous structures. Molecular dynamics simulation becomes an important complementary tool to quantify IDP structures. This review covers recent force field strategies proposed for more accurate molecular dynamics simulations of IDPs. The strategies include adjusting dihedral parameters, adding grid-based energy correction map (CMAP) parameters, refining protein-water interactions, and others. Different force fields were found to perform well on specific observables of specific IDPs but also are limited in reproducing all available experimental observables consistently for all tested IDPs. We conclude the review with perspective areas for improvements for future force fields for IDPs.


Asunto(s)
Proteínas Intrínsecamente Desordenadas , Simulación de Dinámica Molecular , Conformación Proteica , Agua
7.
J Chem Theory Comput ; 20(14): 6082-6097, 2024 Jul 23.
Artículo en Inglés | MEDLINE | ID: mdl-38980289

RESUMEN

Excited-ground-state transition and strand slippage of RNA play key roles in transcription and translation of central dogma. Due to limitation of current experimental techniques, the dynamic structure ensembles of RNA remain inadequately understood. Molecular dynamics simulations offer a promising complementary approach, whose accuracy depends on the force field. Here, we develop the new version of RNA base-specific force field (BSFF2) to address underestimation of base pairing stability and artificial backbone conformations. Extensive evaluations on typical RNA systems have comprehensively confirmed the accuracy of BSFF2. Furthermore, BSFF2 demonstrates exceptional efficiency in de novo folding of tetraloops and reproducing base pair reshuffling transition between RNA excited and ground states. Then, we explored the RNA strand slippage mechanism with BSFF2. We conducted a comprehensive three-dimensional structural investigation into the strand slippage of the most complex r(G4C2)9 repeat element and presented the molecular details in the dynamic transition along with the underlying mechanism. Our results of capturing the strand slippage, excited-ground transition, de novo folding, and simulations for various typical RNA motifs indicate that BSFF2 should be one of valuable tools for dynamic conformation research and structure prediction of RNA, and a future contribution to RNA-targeted drug design as well as RNA therapy development.


Asunto(s)
Emparejamiento Base , Simulación de Dinámica Molecular , Conformación de Ácido Nucleico , ARN , ARN/química
8.
J Chem Theory Comput ; 19(15): 4837-4850, 2023 Aug 08.
Artículo en Inglés | MEDLINE | ID: mdl-37452752

RESUMEN

Intrinsically disordered proteins (IDPs) play a critical role in many biological processes. Due to the inherent structural flexibility of IDPs, experimental methods present significant challenges for sampling their conformational information at the atomic level. Therefore, molecular dynamics (MD) simulations have emerged as the primary tools for modeling IDPs whose accuracy depend on force field and water model. To enhance the accuracy of physical modeling of IDPs, several force fields have been developed. However, current water models lack precision and underestimate the interaction between water molecules and proteins. Here, we used Monte-Carlo re-weighting method to re-parameterize a three-point water model based on OPC3 for IDPs (named OPC3-B). We benchmarked the performance of OPC3-B compared with nine different water models for 10 IDPs and three ordered proteins. The results indicate that the performance of OPC3-B is better than other water models for both IDPs and ordered proteins. At the same time, OPC3-B possess the power of transferability with other force field to simulate IDPs. This newly developed water model can be used to insight into the research of sequence-disordered-function paradigm for IDPs.


Asunto(s)
Proteínas Intrínsecamente Desordenadas , Agua , Conformación Proteica , Agua/química , Proteínas Intrínsecamente Desordenadas/química , Simulación de Dinámica Molecular , Benchmarking
9.
Int J Biol Macromol ; 222(Pt A): 680-690, 2022 Dec 01.
Artículo en Inglés | MEDLINE | ID: mdl-36167105

RESUMEN

RNA plays a key role in numerous biological processes. Traditional experimental methods have difficulties capturing the structure and dynamic conformation of RNA. Thus, Molecular dynamic simulations (MDs) has become an essential complementary for RNA experiment. However, state-of-the-art RNA force fields have two major limitations of overestimation base stacking propensity and generation of a high ratio of intercalated conformations. Therefore, a two-step strategy was used to optimize the parameters of ff99bsc0χOL3 (named BSFF1) to improve these limitations, which as well adjusted the unbonded parameters of nucleobase heavy atoms and added ζ/α grid-based energy correction map energy term with reweighting. MD simulations of tetranucleotides indicate that BSFF1 can significantly decrease the ratio of intercalated conformations. Tests of single-strand RNA and kink-turn show that BSFF1 force field can reproduce more accurate conformers than ff99bsc0χOL3 force field. BSFF1 can also stabilize the conformers of duplex and riboswitch. The successful ab initio folding of tetraloop further supports the performance of BSFF1. These findings confirm that the newly developed force field BSFF1 can improve the conformer sampling of RNA.


Asunto(s)
ARN , Riboswitch , ARN/química , Nucleótidos , Conformación Molecular , Simulación de Dinámica Molecular , Conformación de Ácido Nucleico
10.
Comput Struct Biotechnol J ; 20: 368-379, 2022.
Artículo en Inglés | MEDLINE | ID: mdl-35035789

RESUMEN

The Arabidopsis Serine/Threonine Kinase 1 (SIK1) is a Sterile 20 (STE20)/Hippo orthologue that is also categorized as a Mitogen-Activated Protein Kinase Kinase Kinase Kinase (MAP4K). Like its animal and fungi orthologues, SIK1 is required for cell cycle exit, cell expansion, polarity establishment, as well as pathogenic response. The catalytic activity of SIK1, like other MAPKs, is presumably regulated by its phosphorylation states. Since no crystal structure for SIK1 has been reported yet, we built structural models for SIK1 kinase domain in different phosphorylation states with different pocket conformation to see how this kinase may be regulated. Using computational structural biology methods, we outlined a conduction path in which a phosphorylation site on the A-loop regulates the catalytic activity of SIK1 by controlling the closing or opening of the catalytic pocket at the G-loop. Furthermore, with analyses on the dynamic motions and in vitro kinase assay, we confirmed that three key residues in this conduction path, Lys278, Glu295, and Arg370, are indeed important for the kinase activity of SIK1. Since these residues are conserved in all STE20 kinases examined, the regulatory mechanism that we discovered may be common in STE20 kinases.

SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA