Pesquisa | BVS IEC

1.

Structural basis of DNA recognition by BEN domain proteins reveals a role for oligomerization in unmethylated DNA selection by BANP.

Ren, Jiahao; Wang, Junmeng; Ren, Yanpeng; Zhang, Yuyang; Wei, Pengshuai; Wang, Meng; Zhang, Yimeng; Li, Meng; Yuan, Chuyan; Gong, Haipeng; Jiang, Junyi; Wang, Zhanxin.

Nucleic Acids Res ; 52(18): 11349-11361, 2024 Oct 14.

Artigo em Inglês | MEDLINE | ID: mdl-39225042

RESUMO

The BEN domain is a newly discovered type of DNA-binding domain that exists in a variety of species. There are nine BEN domain-containing proteins in humans, and most have been shown to have chromatin-related functions. NACC1 preferentially binds to CATG motif-containing sequences and functions primarily as a transcriptional coregulator. BANP and BEND3 preferentially bind DNA bearing unmethylated CpG motifs, and they function as CpG island-binding proteins. To date, the DNA recognition mechanism of quite a few of these proteins remains to be determined. In this study, we solved the crystal structures of the BEN domains of NACC1 and BANP in complex with their cognate DNA substrates. We revealed the details of DNA binding by these BEN domain proteins and unexpectedly revealed that oligomerization is required for BANP to select unmethylated CGCG motif-containing DNA substrates. Our study clarifies the controversies surrounding DNA recognition by BANP and demonstrates a new mechanism by which BANP selects unmethylated CpG motifs and functions as a CpG island-binding protein. This understanding will facilitate further exploration of the physiological functions of the BEN domain proteins in the future.

Assuntos

Ilhas de CpG , Proteínas de Ligação a DNA , DNA , Ligação Proteica , Humanos , Sítios de Ligação , Cristalografia por Raios X , DNA/química , DNA/genética , Metilação de DNA , Proteínas de Ligação a DNA/química , Proteínas de Ligação a DNA/genética , Modelos Moleculares , Domínios Proteicos , Multimerização Proteica , Proteínas Repressoras/química , Proteínas Repressoras/genética , Proteínas Nucleares/química , Proteínas Nucleares/genética , Proteínas de Ciclo Celular/química , Proteínas de Ciclo Celular/genética

2.

Protein design via deep learning.

Ding, Wenze; Nakai, Kenta; Gong, Haipeng.

Brief Bioinform ; 23(3)2022 05 13.

Artigo em Inglês | MEDLINE | ID: mdl-35348602

RESUMO

Proteins with desired functions and properties are important in fields like nanotechnology and biomedicine. De novo protein design enables the production of previously unseen proteins from the ground up and is believed as a key point for handling real social challenges. Recent introduction of deep learning into design methods exhibits a transformative influence and is expected to represent a promising and exciting future direction. In this review, we retrospect the major aspects of current advances in deep-learning-based design procedures and illustrate their novelty in comparison with conventional knowledge-based approaches through noticeable cases. We not only describe deep learning developments in structure-based protein design and direct sequence design, but also highlight recent applications of deep reinforcement learning in protein design. The future perspectives on design goals, challenges and opportunities are also comprehensively discussed.

Assuntos

Aprendizado Profundo , Bases de Conhecimento , Proteínas

3.

Study on the diagnostic value of MDCT extramural vascular invasion in preoperative N staging of gastric cancer patients.

Zhu, Zhengqi; Mao, Mimi; Song, Anyi; Gong, Haipeng; Gu, Jianan; Dai, Yongfeng; Feng, Feng.

BMC Med Imaging ; 24(1): 20, 2024 Jan 19.

Artigo em Inglês | MEDLINE | ID: mdl-38243288

RESUMO

BACKGROUND: To explore the diagnostic value of multidetector computed tomography (MDCT) extramural vascular invasion (EMVI) in preoperative N Staging of gastric cancer patients. METHODS: According to the MR-defined EMVI scoring standard of rectal cancer, we developed a 5-point scale scoring system to evaluate the status of CT-detected extramural vascular invasion(ctEMVI), 0-2 points were ctEMVI-negative status, and 3-4 points were positive status for ctEMVI. Patients were divided into ctEMVI positive group and ctEMVI negative group. The correlation between ctEMVI and clinical features was analyzed. Receiver operating characteristic (ROC) curve was used to evaluate the diagnostic efficacy of ctEMVI for pathological metastatic lymph nodes and N staging, The sensitivity, specificity, accuracy, positive predictive value (PPV), and negative predictive value (NPV) of pathological N staging using ctEMVI and short-axis diameter were generated and compared. RESULTS: The occurrence rate of lymphovascular invasion (LVI) and proportion of tumors with a greatest diameter > 6 cm in the ctEMVI positive group was higher than that in the ctEMVI negative group (P < 0.05). Spearman correlation analysis showed a positive correlation between ctEMVI and LVI, N stage, and tumor size (P < 0.05). For ctEMVI scores ≥ 3,The AUC of ctEMVI for diagnosing lymph node metastasis, N stage ≥ N2, and N3 stage were 0.857, 0.802, and 0.758, respectively. The sensitivity, NPV and accuracy of ctEMVI for diagnosing N stage ≥ N2 were superior to those of short-axis diameter (P < 0.05), while sensitivity, specificity, PPV, NPV, and accuracy of ctEMVI for diagnosing N3 stage were superior to those of short-axis diameter (P < 0.05). CONCLUSION: ctEMVI has important value in diagnosing metastatic lymph nodes and advanced N staging. As an important imaging marker, ctEMVI can be included in the preoperative imaging evaluation of patients, providing important assistance for clinical guidance and treatment.

Assuntos

Tomografia Computadorizada Multidetectores , Neoplasias Gástricas , Humanos , Neoplasias Gástricas/diagnóstico por imagem , Neoplasias Gástricas/cirurgia , Neoplasias Gástricas/patologia , Invasividade Neoplásica/diagnóstico por imagem , Invasividade Neoplásica/patologia , Estudos Retrospectivos , Linfonodos/patologia , Estadiamento de Neoplasias

4.

Vision-Based Real-Time Bolt Loosening Detection by Identifying Anti-Loosening Lines.

Lei, Wenyang; Yuan, Fang; Guo, Jiang; Wang, Haoyang; Geng, Zaiming; Wu, Tao; Gong, Haipeng.

Sensors (Basel) ; 24(20)2024 Oct 20.

Artigo em Inglês | MEDLINE | ID: mdl-39460227

RESUMO

Bolt loosening detection is crucial for ensuring the safe operation of equipment. This paper presents a vision-based real-time detection method that identifies bolt loosening by recognizing anti-loosening line markers at bolt connections. The method employs the YOLOv10-S deep learning model for high-precision, real-time bolt detection, followed by a two-step Fast-SCNN image segmentation technique. This approach effectively isolates the bolt and nut regions, enabling accurate extraction of the anti-loosening line markers. Key intersection points are calculated using ellipse and line fitting techniques, and the loosening angle is determined through spatial projection transformation. The experimental results demonstrate that, for high-resolution images of 2048 × 1024 pixels, the proposed method achieves an average angle detection error of 1.145° with a detection speed of 32 FPS. Compared to traditional methods and other vision-based approaches, this method offers non-contact measurement, real-time detection capabilities, reduced detection error, and general adaptability to various bolt types and configurations, indicating significant application potential.

5.

BERT-Kcr: prediction of lysine crotonylation sites by a transfer learning method with pre-trained BERT models.

Qiao, Yanhua; Zhu, Xiaolei; Gong, Haipeng.

Bioinformatics ; 38(3): 648-654, 2022 01 12.

Artigo em Inglês | MEDLINE | ID: mdl-34643684

RESUMO

MOTIVATION: As one of the most important post-translational modifications (PTMs), protein lysine crotonylation (Kcr) has attracted wide attention, which involves in important physiological activities, such as cell differentiation and metabolism. However, experimental methods are expensive and time-consuming for Kcr identification. Instead, computational methods can predict Kcr sites in silico with high efficiency and low cost. RESULTS: In this study, we proposed a novel predictor, BERT-Kcr, for protein Kcr sites prediction, which was developed by using a transfer learning method with pre-trained bidirectional encoder representations from transformers (BERT) models. These models were originally used for natural language processing (NLP) tasks, such as sentence classification. Here, we transferred each amino acid into a word as the input information to the pre-trained BERT model. The features encoded by BERT were extracted and then fed to a BiLSTM network to build our final model. Compared with the models built by other machine learning and deep learning classifiers, BERT-Kcr achieved the best performance with AUROC of 0.983 for 10-fold cross validation. Further evaluation on the independent test set indicates that BERT-Kcr outperforms the state-of-the-art model Deep-Kcr with an improvement of about 5% for AUROC. The results of our experiment indicate that the direct use of sequence information and advanced pre-trained models of NLP could be an effective way for identifying PTM sites of proteins. AVAILABILITY AND IMPLEMENTATION: The BERT-Kcr model is publicly available on http://zhulab.org.cn/BERT-Kcr_models/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Assuntos

Lisina , Aprendizado de Máquina , Lisina/metabolismo , Idioma , Processamento de Linguagem Natural , Processamento de Proteína Pós-Traducional

6.

SAMF: a self-adaptive protein modeling framework.

Ding, Wenze; Xu, Qijiang; Liu, Siyuan; Wang, Tong; Shao, Bin; Gong, Haipeng; Liu, Tie-Yan.

Bioinformatics ; 37(22): 4075-4082, 2021 11 18.

Artigo em Inglês | MEDLINE | ID: mdl-34042965

RESUMO

MOTIVATION: Gradient descent-based protein modeling is a popular protein structure prediction approach that takes as input the predicted inter-residue distances and other necessary constraints and folds protein structures by minimizing protein-specific energy potentials. The constraints from multiple predicted protein properties provide redundant and sometime conflicting information that can trap the optimization process into local minima and impairs the modeling efficiency. RESULTS: To address these issues, we developed a self-adaptive protein modeling framework, SAMF. It eliminates redundancy of constraints and resolves conflicts, folds protein structures in an iterative way, and picks up the best structures by a deep quality analysis system. Without a large amount of complicated domain knowledge and numerous patches as barriers, SAMF achieves the state-of-the-art performance by exploiting the power of cutting-edge techniques of deep learning. SAMF has a modular design and can be easily customized and extended. As the quality of input constraints is ever growing, the superiority of SAMF will be amplified over time. AVAILABILITY AND IMPLEMENTATION: The source code and data for reproducing the results is available at https://msracb.blob.core.windows.net/pub/psp/SAMF.zip. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Assuntos

Proteínas , Software , Proteínas/metabolismo

7.

RDb₂C2: an improved method to identify the residue-residue pairing in ß strands.

Shao, Di; Mao, Wenzhi; Xing, Yaoguang; Gong, Haipeng.

BMC Bioinformatics ; 21(1): 133, 2020 Apr 03.

Artigo em Inglês | MEDLINE | ID: mdl-32245403

RESUMO

BACKGROUND: Despite the great advance of protein structure prediction, accurate prediction of the structures of mainly ß proteins is still highly challenging, but could be assisted by the knowledge of residue-residue pairing in ß strands. Previously, we proposed a ridge-detection-based algorithm RDb2C that adopted a multi-stage random forest framework to predict the ß-ß pairing given the amino acid sequence of a protein. RESULTS: In this work, we developed a second version of this algorithm, RDb2C2, by employing the residual neural network to further enhance the prediction accuracy. In the benchmark test, this new algorithm improves the F1-score by > 10 percentage points, reaching impressively high values of ~ 72% and ~ 73% in the BetaSheet916 and BetaSheet1452 sets, respectively. CONCLUSION: Our new method promotes the prediction accuracy of ß-ß pairing to a new level and the prediction results could better assist the structure modeling of mainly ß proteins. We prepared an online server of RDb2C2 at http://structpred.life.tsinghua.edu.cn/rdb2c2.html.

Assuntos

Algoritmos , Conformação Proteica em Folha beta , Análise de Sequência de Proteína/métodos , Redes Neurais de Computação

8.

Identification of residue pairing in interacting ß-strands from a predicted residue contact map.

Mao, Wenzhi; Wang, Tong; Zhang, Wenxuan; Gong, Haipeng.

BMC Bioinformatics ; 19(1): 146, 2018 04 19.

Artigo em Inglês | MEDLINE | ID: mdl-29673311

RESUMO

BACKGROUND: Despite the rapid progress of protein residue contact prediction, predicted residue contact maps frequently contain many errors. However, information of residue pairing in ß strands could be extracted from a noisy contact map, due to the presence of characteristic contact patterns in ß-ß interactions. This information may benefit the tertiary structure prediction of mainly ß proteins. In this work, we propose a novel ridge-detection-based ß-ß contact predictor to identify residue pairing in ß strands from any predicted residue contact map. RESULTS: Our algorithm RDb2C adopts ridge detection, a well-developed technique in computer image processing, to capture consecutive residue contacts, and then utilizes a novel multi-stage random forest framework to integrate the ridge information and additional features for prediction. Starting from the predicted contact map of CCMpred, RDb2C remarkably outperforms all state-of-the-art methods on two conventional test sets of ß proteins (BetaSheet916 and BetaSheet1452), and achieves F1-scores of ~ 62% and ~ 76% at the residue level and strand level, respectively. Taking the prediction of the more advanced RaptorX-Contact as input, RDb2C achieves impressively higher performance, with F1-scores reaching ~ 76% and ~ 86% at the residue level and strand level, respectively. In a test of structural modeling using the top 1 L predicted contacts as constraints, for 61 mainly ß proteins, the average TM-score achieves 0.442 when using the raw RaptorX-Contact prediction, but increases to 0.506 when using the improved prediction by RDb2C. CONCLUSION: Our method can significantly improve the prediction of ß-ß contacts from any predicted residue contact maps. Prediction results of our algorithm could be directly applied to effectively facilitate the practical structure prediction of mainly ß proteins. AVAILABILITY: All source data and codes are available at http://166.111.152.91/Downloads.html or the GitHub address of https://github.com/wzmao/RDb2C .

Assuntos

Aminoácidos/química , Biologia Computacional/métodos , Proteínas/química , Algoritmos , Modelos Moleculares , Conformação Proteica em Folha beta , Estrutura Terciária de Proteína , Reprodutibilidade dos Testes

9.

A deep learning framework for improving long-range residue-residue contact prediction using a hierarchical strategy.

Xiong, Dapeng; Zeng, Jianyang; Gong, Haipeng.

Bioinformatics ; 33(17): 2675-2683, 2017 Sep 01.

Artigo em Inglês | MEDLINE | ID: mdl-28472263

RESUMO

MOTIVATION: Residue-residue contacts are of great value for protein structure prediction, since contact information, especially from those long-range residue pairs, can significantly reduce the complexity of conformational sampling for protein structure prediction in practice. Despite progresses in the past decade on protein targets with abundant homologous sequences, accurate contact prediction for proteins with limited sequence information is still far from satisfaction. Methodologies for these hard targets still need further improvement. RESULTS: We presented a computational program DeepConPred, which includes a pipeline of two novel deep-learning-based methods (DeepCCon and DeepRCon) as well as a contact refinement step, to improve the prediction of long-range residue contacts from primary sequences. When compared with previous prediction approaches, our framework employed an effective scheme to identify optimal and important features for contact prediction, and was only trained with coevolutionary information derived from a limited number of homologous sequences to ensure robustness and usefulness for hard targets. Independent tests showed that 59.33%/49.97%, 64.39%/54.01% and 70.00%/59.81% of the top L/5, top L/10 and top 5 predictions were correct for CASP10/CASP11 proteins, respectively. In general, our algorithm ranked as one of the best methods for CASP targets. AVAILABILITY AND IMPLEMENTATION: All source data and codes are available at http://166.111.152.91/Downloads.html . CONTACT: hgong@tsinghua.edu.cn or zengjy321@tsinghua.edu.cn. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Assuntos

Biologia Computacional/métodos , Aprendizado de Máquina , Modelos Moleculares , Conformação Proteica , Software , Bases de Dados de Proteínas

10.

LRFragLib: an effective algorithm to identify fragments for de novo protein structure prediction.

Wang, Tong; Yang, Yuedong; Zhou, Yaoqi; Gong, Haipeng.

Bioinformatics ; 33(5): 677-684, 2017 03 01.

Artigo em Inglês | MEDLINE | ID: mdl-27797773

RESUMO

Motivation: The quality of fragment library determines the efficiency of fragment assembly, an approach that is widely used in most de novo protein-structure prediction algorithms. Conventional fragment libraries are constructed mainly based on the identities of amino acids, sometimes facilitated by predicted information including dihedral angles and secondary structures. However, it remains challenging to identify near-native fragment structures with low sequence homology. Results: We introduce a novel fragment-library-construction algorithm, LRFragLib, to improve the detection of near-native low-homology fragments of 7-10 residues, using a multi-stage, flexible selection protocol. Based on logistic regression scoring models, LRFragLib outperforms existing techniques by achieving a significantly higher precision and a comparable coverage on recent CASP protein sets in sampling near-native structures. The method also has a comparable computational efficiency to the fastest existing techniques with substantially reduced memory usage. Availability and Implementation: The source code is available for download at http://166.111.152.91/Downloads.html. Contact: hgong@tsinghua.edu.cn. Supplementary information: Supplementary data are available at Bioinformatics online.

Assuntos

Biologia Computacional/métodos , Proteínas/química , Software , Algoritmos , Caspases/química , Estrutura Secundária de Proteína

11.

Molecular determinants for the thermodynamic and functional divergence of uniporter GLUT1 and proton symporter XylE.

Ke, Meng; Yuan, Yafei; Jiang, Xin; Yan, Nieng; Gong, Haipeng.

PLoS Comput Biol ; 13(6): e1005603, 2017 Jun.

Artigo em Inglês | MEDLINE | ID: mdl-28617850

RESUMO

GLUT1 facilitates the down-gradient translocation of D-glucose across cell membrane in mammals. XylE, an Escherichia coli homolog of GLUT1, utilizes proton gradient as an energy source to drive uphill D-xylose transport. Previous studies of XylE and GLUT1 suggest that the variation between an acidic residue (Asp27 in XylE) and a neutral one (Asn29 in GLUT1) is a key element for their mechanistic divergence. In this work, we combined computational and biochemical approaches to investigate the mechanism of proton coupling by XylE and the functional divergence between GLUT1 and XylE. Using molecular dynamics simulations, we evaluated the free energy profiles of the transition between inward- and outward-facing conformations for the apo proteins. Our results revealed the correlation between the protonation state and conformational preference in XylE, which is supported by the crystal structures. In addition, our simulations suggested a thermodynamic difference between XylE and GLUT1 that cannot be explained by the single residue variation at the protonation site. To understand the molecular basis, we applied Bayesian network models to analyze the alteration in the architecture of the hydrogen bond networks during conformational transition. The models and subsequent experimental validation suggest that multiple residue substitutions are required to produce the thermodynamic and functional distinction between XylE and GLUT1. Despite the lack of simulation studies with substrates, these computational and biochemical characterizations provide unprecedented insight into the mechanistic difference between proton symporters and uniporters.

Assuntos

Proteínas de Escherichia coli/química , Proteínas de Escherichia coli/ultraestrutura , Transportador de Glucose Tipo 1/química , Transportador de Glucose Tipo 1/ultraestrutura , Modelos Químicos , Simulação de Dinâmica Molecular , Simportadores/química , Simportadores/ultraestrutura , Transferência de Energia , Humanos , Ligação Proteica , Conformação Proteica , Relação Estrutura-Atividade , Termodinâmica

12.

A deep learning framework for modeling structural features of RNA-binding protein targets.

Zhang, Sai; Zhou, Jingtian; Hu, Hailin; Gong, Haipeng; Chen, Ligong; Cheng, Chao; Zeng, Jianyang.

Nucleic Acids Res ; 44(4): e32, 2016 Feb 29.

Artigo em Inglês | MEDLINE | ID: mdl-26467480

RESUMO

RNA-binding proteins (RBPs) play important roles in the post-transcriptional control of RNAs. Identifying RBP binding sites and characterizing RBP binding preferences are key steps toward understanding the basic mechanisms of the post-transcriptional gene regulation. Though numerous computational methods have been developed for modeling RBP binding preferences, discovering a complete structural representation of the RBP targets by integrating their available structural features in all three dimensions is still a challenging task. In this paper, we develop a general and flexible deep learning framework for modeling structural binding preferences and predicting binding sites of RBPs, which takes (predicted) RNA tertiary structural information into account for the first time. Our framework constructs a unified representation that characterizes the structural specificities of RBP targets in all three dimensions, which can be further used to predict novel candidate binding sites and discover potential binding motifs. Through testing on the real CLIP-seq datasets, we have demonstrated that our deep learning framework can automatically extract effective hidden structural features from the encoded raw sequence and structural profiles, and predict accurate RBP binding sites. In addition, we have conducted the first study to show that integrating the additional RNA tertiary structural features can improve the model performance in predicting RBP binding sites, especially for the polypyrimidine tract-binding protein (PTB), which also provides a new evidence to support the view that RBPs may own specific tertiary structural binding preferences. In particular, the tests on the internal ribosome entry site (IRES) segments yield satisfiable results with experimental support from the literature and further demonstrate the necessity of incorporating RNA tertiary structural information into the prediction model. The source code of our approach can be found in https://github.com/thucombio/deepnet-rbp.

Assuntos

Proteína de Ligação a Regiões Ricas em Polipirimidinas/química , RNA Mensageiro/química , Proteínas de Ligação a RNA/química , Ribossomos/química , Sítios de Ligação , Biologia Computacional , Regulação da Expressão Gênica , Conformação de Ácido Nucleico , Proteína de Ligação a Regiões Ricas em Polipirimidinas/genética , Processamento Pós-Transcricional do RNA/genética , RNA Mensageiro/metabolismo , Proteínas de Ligação a RNA/genética , Ribossomos/genética

13.

Structural and Dynamic Insights into the Mechanism of Allosteric Signal Transmission in ERK2-Mediated MKP3 Activation.

Lu, Chang; Liu, Xin; Zhang, Chen-Song; Gong, Haipeng; Wu, Jia-Wei; Wang, Zhi-Xin.

Biochemistry ; 56(46): 6165-6175, 2017 11 21.

Artigo em Inglês | MEDLINE | ID: mdl-29077400

RESUMO

The mitogen-activated protein kinases (MAPKs) are key components of cellular signal transduction pathways, which are down-regulated by the MAPK phosphatases (MKPs). Catalytic activity of the MKPs is controlled both by their ability to recognize selective MAPKs and by allosteric activation upon binding to MAPK substrates. Here, we use a combination of experimental and computational techniques to elucidate the molecular mechanism for the ERK2-induced MKP3 activation. Mutational and kinetic study shows that the 334FNFM337 motif in the MKP3 catalytic domain is essential for MKP3-mediated ERK2 inactivation and is responsible for ERK2-mediated MKP3 activation. The long-term molecular dynamics (MD) simulations further reveal a complete dynamic process in which the catalytic domain of MKP3 gradually changes to a conformation that resembles an active MKP catalytic domain over the time scale of the simulation, providing a direct time-dependent observation of allosteric signal transmission in ERK2-induced MKP3 activation.

Assuntos

Fosfatase 6 de Especificidade Dupla/metabolismo , Ativação Enzimática , Proteína Quinase 1 Ativada por Mitógeno/metabolismo , Transdução de Sinais , Regulação Alostérica , Animais , Domínio Catalítico , Fosfatase 6 de Especificidade Dupla/química , Humanos , Camundongos , Proteína Quinase 1 Ativada por Mitógeno/química , Simulação de Dinâmica Molecular , Ligação Proteica , Conformação Proteica , Ratos

14.

Predicting the helix-helix interactions from correlated residue mutations.

Xiong, Dapeng; Mao, Wenzhi; Gong, Haipeng.

Proteins ; 85(12): 2162-2169, 2017 Dec.

Artigo em Inglês | MEDLINE | ID: mdl-28833538

RESUMO

Helix-helix interactions are crucial in the structure assembly, stability and function of helix-rich proteins including many membrane proteins. In spite of remarkable progresses over the past decades, the accuracy of predicting protein structures from their amino acid sequences is still far from satisfaction. In this work, we focused on a simpler problem, the prediction of helix-helix interactions, the results of which could facilitate practical protein structure prediction by constraining the sampling space. Specifically, we started from the noisy 2D residue contact maps derived from correlated residue mutations, and utilized ridge detection to identify the characteristic residue contact patterns for helix-helix interactions. The ridge information as well as a few additional features were then fed into a machine learning model HHConPred to predict interactions between helix pairs. In an independent test, our method achieved an F-measure of â¼60% for predicting helix-helix interactions. Moreover, although the model was trained mainly using soluble proteins, it could be extended to membrane proteins with at least comparable performance relatively to previous approaches that were generated purely using membrane proteins. All data and source codes are available at http://166.111.152.91/Downloads.html or https://github.com/dpxiong/HHConPred.

Assuntos

Biologia Computacional/métodos , Aprendizado de Máquina , Proteínas de Membrana/química , Sequência de Aminoácidos , Sítios de Ligação , Ligação Proteica , Conformação Proteica em alfa-Hélice , Domínios e Motivos de Interação entre Proteínas

15.

Coupling between ATP hydrolysis and protein conformational change in maltose transporter.

Lv, Xiaoying; Liu, Hao; Chen, Haifeng; Gong, Haipeng.

Proteins ; 85(2): 207-220, 2017 02.

Artigo em Inglês | MEDLINE | ID: mdl-27616441

RESUMO

As the intracellular part of maltose transporter, MalK dimer utilizes the energy of ATP hydrolysis to drive protein conformational change, which then facilitates substrate transport. Free energy evaluation of the complete conformational change before and after ATP hydrolysis is helpful to elucidate the mechanism of chemical-to-mechanical energy conversion in MalK dimer, but is lacking in previous studies. In this work, we used molecular dynamics simulations to investigate the structural transition of MalK dimer among closed, semi-open and open states. We observed spontaneous structural transition from closed to open state in the ADP-bound system and partial closure of MalK dimer from the semi-open state in the ATP-bound system. Subsequently, we calculated the reaction pathways connecting the closed and open states for the ATP- and ADP-bound systems and evaluated the free energy profiles along the paths. Our results suggested that the closed state is stable in the presence of ATP but is markedly destabilized when ATP is hydrolyzed to ADP, which thus explains the coupling between ATP hydrolysis and protein conformational change of MalK dimer in thermodynamics. Proteins 2017; 85:207-220. © 2016 Wiley Periodicals, Inc.

Assuntos

Transportadores de Cassetes de Ligação de ATP/química , Trifosfato de Adenosina/química , Proteínas de Escherichia coli/química , Escherichia coli/genética , Transportadores de Cassetes de Ligação de ATP/genética , Transportadores de Cassetes de Ligação de ATP/metabolismo , Trifosfato de Adenosina/metabolismo , Sítios de Ligação , Clonagem Molecular , Escherichia coli/metabolismo , Proteínas de Escherichia coli/genética , Proteínas de Escherichia coli/metabolismo , Expressão Gênica , Hidrólise , Simulação de Dinâmica Molecular , Ligação Proteica , Domínios e Motivos de Interação entre Proteínas , Multimerização Proteica , Estrutura Secundária de Proteína , Proteínas Recombinantes/química , Proteínas Recombinantes/genética , Proteínas Recombinantes/metabolismo , Termodinâmica

16.

Molecular dynamics study of ion transport through an open model of voltage-gated sodium channel.

Li, Yang; Sun, Ruining; Liu, Huihui; Gong, Haipeng.

Biochim Biophys Acta Biomembr ; 1859(5): 879-887, 2017 May.

Artigo em Inglês | MEDLINE | ID: mdl-28188741

RESUMO

Voltage-gated sodium (NaV) channels are critical in the signal transduction of excitable cells. In this work, we modeled the open conformation for the pore domain of a prokaryotic NaV channel (NaVRh), and used molecular dynamics simulations to track the translocation of dozens of Na+ ions through the channel in the presence of a physiological transmembrane ion concentration gradient and a transmembrane electrical field that was closer to the physiological one than previous studies. Channel conductance was then estimated from simulations on the wide-type and DEKA mutant of NaVRh. Interestingly, the conductivity predicted from the DEKA mutant agrees well with experimental measurement on eukaryotic NaV1.4 channel. Moreover, the wide-type and DEKA mutant of NaVRh exhibited markedly distinct ion permeation patterns, which thus implies the mechanistic difference between prokaryotic and eukaryotic NaV channels.

Assuntos

Transporte de Íons , Simulação de Dinâmica Molecular , Canais de Sódio Disparados por Voltagem/fisiologia , Sítios de Ligação , Potenciais da Membrana , Conformação Proteica , Canais de Sódio Disparados por Voltagem/química

17.

Data construction for phosphorylation site prediction.

Gong, Haipeng; Liu, Xiaoqing; Wu, Jun; He, Zengyou.

Brief Bioinform ; 15(5): 839-55, 2014 Sep.

Artigo em Inglês | MEDLINE | ID: mdl-23543354

RESUMO

Protein phosphorylation is one of the most pervasive post-translational modifications, regulating diverse cellular processes in various organisms. As mass spectrometry-based experimental approaches for identifying phosphorylation events are resource-intensive, many computational methods have been proposed, in which phosphorylation site prediction is formulated as a classification problem. They differ in several ways, and one crucial issue is the construction of training data and test data for unbiased performance evaluation. In this article, we categorize the existing data construction methods and try to answer three questions: (i) Is it equivalent to use different data construction methods in the assessment of phosphorylation site prediction algorithms? (ii) What kind of test data set is unbiased for assessing the prediction performance of a trained algorithm in different real world scenarios? (iii) Among the summarized training data construction methods, which one(s) has better generalization performance for most scenarios? To answer these questions, we conduct comprehensive experimental studies for both non-kinase-specific and kinase-specific prediction tasks. The experimental results show that: (i) different data construction methods can lead to significantly different prediction performance; (ii) there can be different test data construction methods that are unbiased with respect to different real world scenarios; and (iii) different data construction methods have different generalization performance in different real world scenarios. Therefore, when developing new algorithms in future research, people should concentrate on what kind of scenario their algorithm will work for, what the corresponding unbiased test data are and which training data construction method can generate best generalization performance.

Assuntos

Proteínas/metabolismo , Algoritmos , Fosforilação

18.

Structure of a fucose transporter in an outward-open conformation.

Dang, Shangyu; Sun, Linfeng; Huang, Yongjian; Lu, Feiran; Liu, Yufeng; Gong, Haipeng; Wang, Jiawei; Yan, Nieng.

Nature ; 467(7316): 734-8, 2010 Oct 07.

Artigo em Inglês | MEDLINE | ID: mdl-20877283

RESUMO

The major facilitator superfamily (MFS) transporters are an ancient and widespread family of secondary active transporters. In Escherichia coli, the uptake of l-fucose, a source of carbon for microorganisms, is mediated by an MFS proton symporter, FucP. Despite intensive study of the MFS transporters, atomic structure information is only available on three proteins and the outward-open conformation has yet to be captured. Here we report the crystal structure of FucP at 3.1 Å resolution, which shows that it contains an outward-open, amphipathic cavity. The similarly folded amino and carboxyl domains of FucP have contrasting surface features along the transport path, with negative electrostatic potential on the N domain and hydrophobic surface on the C domain. FucP only contains two acidic residues along the transport path, Asp 46 and Glu 135, which can undergo cycles of protonation and deprotonation. Their essential role in active transport is supported by both in vivo and in vitro experiments. Structure-based biochemical analyses provide insights into energy coupling, substrate recognition and the transport mechanism of FucP.

Assuntos

Proteínas de Escherichia coli/química , Escherichia coli/química , Proteínas de Transporte de Monossacarídeos/química , Simportadores/química , Cristalografia por Raios X , Proteínas de Escherichia coli/metabolismo , Fucose/metabolismo , Interações Hidrofóbicas e Hidrofílicas , Modelos Biológicos , Modelos Moleculares , Proteínas de Transporte de Monossacarídeos/metabolismo , Conformação Proteica , Prótons , Rotação , Eletricidade Estática , Simportadores/metabolismo

19.

Protonation of Glu(135) Facilitates the Outward-to-Inward Structural Transition of Fucose Transporter.

Liu, Yufeng; Ke, Meng; Gong, Haipeng.

Biophys J ; 109(3): 542-51, 2015 Aug 04.

Artigo em Inglês | MEDLINE | ID: mdl-26244736

RESUMO

Major facilitator superfamily (MFS) transporters typically need to alternatingly sample the outward-facing and inward-facing conformations, in order to transport the substrate across membrane. To understand the mechanism, in this work, we focused on one MFS member, the L-fucose/H(+) symporter (FucP), whose crystal structure exhibits an outward-open conformation. Previous experiments imply several residues critical to the substrate/proton binding and structural transition of FucP, among which Glu(135), located in the periplasm-accessible vestibule, is supposed as being involved in both proton translocation and conformational change of the protein. Here, the structural transition of FucP in presence of substrate was investigated using molecular-dynamics simulations. By combining the equilibrium and accelerated simulations as well as thermodynamic calculations, not only was the large-scale conformational change from the outward-facing to inward-facing state directly observed, but also the free energy change during the structural transition was calculated. The simulations confirm the critical role of Glu(135), whose protonation facilitates the outward-to-inward structural transition both by energetically favoring the inward-facing conformation in thermodynamics and by reducing the free energy barrier along the reaction pathway in kinetics. Our results may help the mechanistic studies of both FucP and other MFS transporters.

Assuntos

Simulação de Dinâmica Molecular , Proteínas de Transporte de Monossacarídeos/química , Prótons , Sequência de Aminoácidos , Ácido Glutâmico/química , Dados de Sequência Molecular , Proteínas de Transporte de Monossacarídeos/metabolismo

20.

RBRIdent: An algorithm for improved identification of RNA-binding residues in proteins from primary sequences.

Xiong, Dapeng; Zeng, Jianyang; Gong, Haipeng.

Proteins ; 83(6): 1068-77, 2015 Jun.

Artigo em Inglês | MEDLINE | ID: mdl-25846271

RESUMO

Rapid and correct identification of RNA-binding residues based on the protein primary sequences is of great importance. In most prevalent machine-learning-based identification methods; however, either some features are inefficiently represented, or the redundancy between features is not effectively removed. Both problems may weaken the performance of a classifier system and raise its computational complexity. Here, we addressed the above problems and developed a better classifier (RBRIdent) to identify the RNA-binding residues. In an independent benchmark test, RBRIdent achieved an accuracy of 76.79%, Matthews correlation coefficient of 0.3819 and F-measure of 75.58%, remarkably outperforming all prevalent methods. These results suggest the necessity of proper feature description and the essential role of feature selection in this project. All source data and codes are freely available at http://166.111.152.91/RBRIdent.

Assuntos

Algoritmos , Biologia Computacional/métodos , Proteínas de Ligação a RNA/química , Proteínas de Ligação a RNA/metabolismo , Análise de Sequência de Proteína/métodos , Software , Sítios de Ligação , Bases de Dados de Proteínas , Aprendizado de Máquina , Modelos Moleculares

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA