Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 58
Filtrar
1.
Biochim Biophys Acta Proteins Proteom ; 1866(2): 379-386, 2018 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-28911812

RESUMO

BACKGROUND: Functional and biophysical constraints can cause different levels of sequence conservation in proteins. Previously, structural properties, e.g., relative solvent accessibility (RSA) and packing density of the weighted contact number (WCN), have been found to be related to protein sequence conservation (CS). The Voronoi volume has recently been recognized as a new structural property of the local protein structural environment reflecting CS. However, for surface residues, it is sensitive to water molecules surrounding the protein structure. Herein, we present a simple structural determinant termed the relative space of Voronoi volume (RSV); it uses the Voronoi volume and the van der Waals volume of particular residues to quantify the local structural environment. METHODS: RSV (range, 0-1) is defined as (Voronoi volume-van der Waals volume)/Voronoi volume of the target residue. The concept of RSV describes the extent of available space for every protein residue. RESULTS: RSV and Voronoi profiles with and without water molecules (RSVw, RSV, VOw, and VO) were compared for 554 non-homologous proteins. RSV (without water) showed better Pearson's correlations with CS than did RSVw, VO, or VOw values. The mean correlation coefficient between RSV and CS was 0.51, which is comparable to the correlation between RSA and CS (0.49) and that between WCN and CS (0.56). CONCLUSIONS: RSV is a robust structural descriptor with and without water molecules and can quantitatively reflect evolutionary information in a single protein structure. Therefore, it may represent a practical structural determinant to study protein sequence, structure, and function relationships.


Assuntos
Proteínas/química , Análise de Sequência de Proteína , Domínios Proteicos , Proteínas/genética , Relação Estrutura-Atividade
2.
Proteins ; 85(9): 1713-1723, 2017 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-28597979

RESUMO

Residues that are crucial to protein function or structure are usually evolutionarily conserved. To identify the important residues in protein, sequence conservation is estimated, and current methods rely upon the unbiased collection of homologous sequences. Surprisingly, our previous studies have shown that the sequence conservation is closely correlated with the weighted contact number (WCN), a measure of packing density for residue's structural environment, calculated only based on the Cα positions of a protein structure. Moreover, studies have shown that sequence conservation is correlated with environment-related structural properties calculated based on different protein substructures, such as a protein's all atoms, backbone atoms, side-chain atoms, or side-chain centroid. To know whether the Cα atomic positions are adequate to show the relationship between residue environment and sequence conservation or not, here we compared Cα atoms with other substructures in their contributions to the sequence conservation. Our results show that Cα positions are substantially equivalent to the other substructures in calculations of various measures of residue environment. As a result, the overlapping contributions between Cα atoms and the other substructures are high, yielding similar structure-conservation relationship. Take the WCN as an example, the average overlapping contribution to sequence conservation is 87% between Cα and all-atom substructures. These results indicate that only Cα atoms of a protein structure could reflect sequence conservation at the residue level.


Assuntos
Sequência de Aminoácidos/genética , Sequência Conservada/genética , Conformação Proteica , Proteínas/química , Modelos Moleculares , Proteínas/genética
3.
Nucleic Acids Res ; 43(W1): W338-42, 2015 Jul 01.
Artigo em Inglês | MEDLINE | ID: mdl-25943546

RESUMO

Protein complexes are involved in many biological processes. Examining coupling between subunits of a complex would be useful to understand the molecular basis of protein function. Here, our updated (PS)(2) web server predicts the three-dimensional structures of protein complexes based on comparative modeling; furthermore, this server examines the coupling between subunits of the predicted complex by combining structural and evolutionary considerations. The predicted complex structure could be indicated and visualized by Java-based 3D graphics viewers and the structural and evolutionary profiles are shown and compared chain-by-chain. For each subunit, considerations with or without the packing contribution of other subunits cause the differences in similarities between structural and evolutionary profiles, and these differences imply which form, complex or monomeric, is preferred in the biological condition for the subunit. We believe that the (PS)(2) server would be a useful tool for biologists who are interested not only in the structures of protein complexes but also in the coupling between subunits of the complexes. The (PS)(2) is freely available at http://ps2v3.life.nctu.edu.tw/.


Assuntos
Complexos Multiproteicos/química , Software , Internet , Modelos Moleculares , Conformação Proteica , Análise de Sequência de Proteína
4.
J Chem Inf Model ; 56(12): 2287-2291, 2016 12 27.
Artigo em Inglês | MEDLINE | ID: mdl-27976886

RESUMO

The structure of a protein determines its biological function(s) and its interactions with other factors; the binding regions tend to be conserved in sequence and structure, and the interacting residues involved are usually in close 3D space. The Protein Data Bank currently contains more than 110 000 protein structures, approximately one-third of which contain metal ions. Identifying and characterizing metal ion-binding sites is thus essential for investigating a protein's function(s) and interactions. However, experimental approaches are time-consuming and costly. The web server reported here was built to predict metal ion-binding residues and to generate the predicted metal ion-bound 3D structure. Binding templates have been constructed for regions that bind 12 types of metal ion-binding residues have been used to construct binding templates. The templates include residues within 3.5 Šof the metal ion, and the fragment transformation method was used for structural comparison between query proteins and templates without any data training. Through the adjustment of scoring functions, which are based on the similarity of structure and binding residues. Twelve kinds of metal ions (Ca2+, Cu2+, Fe3+, Mg2+, Mn2+, Zn2+, Cd2+, Fe2+, Ni2+, Hg2+, Co2+, and Cu+) binding residues prediction are supported. MIB also provides the metal ions docking after prediction. The MIB server is available at http://bioinfo.cmu.edu.tw/MIB/ .


Assuntos
Metais/metabolismo , Simulação de Acoplamento Molecular , Proteínas/metabolismo , Sítios de Ligação , Cátions/metabolismo , Bases de Dados de Proteínas , Internet , Conformação Proteica , Proteínas/química , Software
5.
BMC Bioinformatics ; 16 Suppl 1: S7, 2015.
Artigo em Inglês | MEDLINE | ID: mdl-25708145

RESUMO

BACKGROUND: Characterizing the interface residues will help shed light on protein-protein interactions, which are involved in many important biological processes. Many studies focus on characterizing sequence or structure features of protein interfaces, but there are few studies characterizing the dynamics of interfaces. Therefore, we would like to know whether there is any specific dynamics pattern in the protein-protein interaction interfaces. Thermal fluctuation is an important dynamical property for a residue, and could be quickly estimated by local packing density without large computation since studies have showen closely relationship between these two properties. Therefore, we divided surface of an unbound subunit (free protein subunits before they are involved in forming the protein complexes) into several separate regions, and compared their average thermal fluctuations of different regions in order to characterize the dynamics pattern in unbound protein-protein interaction interfaces. RESULTS: We used weighted contact numbers (WCN), a parameter-free method to quantify packing density, to estimate the thermal fluctuations of residues in the interfaces. By analyzing the WCN distributions of interfaces in unbound subunits from 1394 non-homologous protein complexes, we show that the residues in the central regions of interfaces have higher packing density (i.e. more rigid); on the other hand, residues surrounding the central regions have smaller packing density (i.e. more flexible). The distinct distributions of packing density, suggesting distinct thermal fluctuation, reveals specific dynamics pattern in the interface of unbound protein subunits. CONCLUSIONS: We found general trend that the unbound protein-protein interaction interfaces consist of rigid residues in the central regions, which are surrounded by flexible residues. This finding suggests that the dynamics might be one of the important features for the formation of protein complexes.


Assuntos
Biologia Computacional/métodos , Mapeamento de Interação de Proteínas/métodos , Subunidades Proteicas/química , Subunidades Proteicas/metabolismo , Modelos Moleculares , Ligação Proteica , Conformação Proteica
6.
Proteins ; 83(8): 1407-13, 2015 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-25846748

RESUMO

The conservation profile of a protein is a curve of the conservation levels of amino acids along the sequence. Biologists are usually more interested in individual points on the curve (namely, the conserved amino acids) than the overall shape of the curve. Here, we show that the conservation curves of proteins bear the imprints of molecules that are evolutionarily coupled to the proteins. Our method is based on recent studies that a sequence conservation profile is quantitatively linked to its structural packing profile. We find that the conservation profiles of nucleic acid (NA) binding proteins are better correlated with the packing profiles of the protein-NA complexes than those of the proteins alone. This indicates that a nucleic acid binding protein evolves to accommodate the nucleic acid in such a way that the residues involved in binding have their conservation levels closely coupled with the specific nucleotides.


Assuntos
Sequência de Aminoácidos , Sequência Conservada , Proteínas de Ligação a DNA/química , Evolução Molecular , Homologia de Sequência de Aminoácidos , DNA/química , DNA/metabolismo , Proteínas de Ligação a DNA/metabolismo , Bases de Dados de Proteínas , Modelos Moleculares , Ligação Proteica , Conformação Proteica
7.
Mol Biol Evol ; 31(1): 135-9, 2014 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-24109601

RESUMO

Protein sequences evolve under selection pressures imposed by functional and biophysical requirements, resulting in site-dependent rates of amino acid substitution. Relative solvent accessibility (RSA) and local packing density (LPD) have emerged as the best candidates to quantify structural constraint. Recent research assumes that RSA is the main determinant of sequence divergence. However, it is not yet clear which is the best predictor of substitution rates. To address this issue, we compared RSA and LPD with site-specific rates of evolution for a diverse data set of enzymes. In contrast with recent studies, we found that LPD measures correlate better than RSA with evolutionary rate. Moreover, the independent contribution of RSA is minor. Taking into account that LPD is related to backbone flexibility, we put forward the possibility that the rate of evolution of a site is determined by the ease with which the backbone deforms to accommodate mutations.


Assuntos
Enzimas/química , Evolução Molecular , Relação Estrutura-Atividade , Substituição de Aminoácidos , Mutação , Conformação Proteica , Solventes
8.
BMC Evol Biol ; 14: 78, 2014 Apr 09.
Artigo em Inglês | MEDLINE | ID: mdl-24716445

RESUMO

BACKGROUND: Protein sites evolve at different rates due to functional and biophysical constraints. It is usually considered that the main structural determinant of a site's rate of evolution is its Relative Solvent Accessibility (RSA). However, a recent comparative study has shown that the main structural determinant is the site's Local Packing Density (LPD). LPD is related with dynamical flexibility, which has also been shown to correlate with sequence variability. Our purpose is to investigate the mechanism that connects a site's LPD with its rate of evolution. RESULTS: We consider two models: an empirical Flexibility Model and a mechanistic Stress Model. The Flexibility Model postulates a linear increase of site-specific rate of evolution with dynamical flexibility. The Stress Model, introduced here, models mutations as random perturbations of the protein's potential energy landscape, for which we use simple Elastic Network Models (ENMs). To account for natural selection we assume a single active conformation and use basic statistical physics to derive a linear relationship between site-specific evolutionary rates and the local stress of the mutant's active conformation.We compare both models on a large and diverse dataset of enzymes. In a protein-by-protein study we found that the Stress Model outperforms the Flexibility Model for most proteins. Pooling all proteins together we show that the Stress Model is strongly supported by the total weight of evidence. Moreover, it accounts for the observed nonlinear dependence of sequence variability on flexibility. Finally, when mutational stress is controlled for, there is very little remaining correlation between sequence variability and dynamical flexibility. CONCLUSIONS: We developed a mechanistic Stress Model of evolution according to which the rate of evolution of a site is predicted to depend linearly on the local mutational stress of the active conformation. Such local stress is proportional to LPD, so that this model explains the relationship between LPD and evolutionary rate. Moreover, the model also accounts for the nonlinear dependence between evolutionary rate and dynamical flexibility.


Assuntos
Evolução Molecular , Proteínas/genética , Estresse Mecânico , Evolução Biológica , Modelos Genéticos , Maleabilidade
9.
Nucleic Acids Res ; 40(Web Server issue): W228-31, 2012 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-22693223

RESUMO

Knotted proteins have recently received lots of attention due to their interesting topological novelty as well as its puzzling folding mechanisms. We previously published a pKNOT server, which provides a structural database of knotted proteins, analysis tools for detecting and analyzing knotted regions from structures as well as a Java-based 3D graphics viewer for visualizing knotted structures. However, there lacks a convenient platform performing similar tasks directly from 'protein sequences'. In the current version of the web server, referred to as pKNOT v.2, we implement a homology modeling tool such that the server can now accept protein sequences in addition to 3D structures or Protein Data Bank (PDB) IDs and return knot analysis. In addition, we have updated the database of knotted proteins from the current PDB with a combination of automatic and manual procedure. We believe that the updated pKNOT server with its extended functionalities will provide better service to biologists interested in the research of knotted proteins. The pKNOT v.2 is available from http://pknot.life.nctu.edu.tw/.


Assuntos
Software , Homologia Estrutural de Proteína , Internet , Conformação Proteica , Análise de Sequência de Proteína
10.
Nucleic Acids Res ; 40(Web Server issue): W232-7, 2012 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-22693212

RESUMO

Circular permutation (CP) is a protein structural rearrangement phenomenon, through which nature allows structural homologs to have different locations of termini and thus varied activities, stabilities and functional properties. It can be applied in many fields of protein research and bioengineering. The limitation of applying CP lies in its technical complexity, high cost and uncertainty of the viability of the resulting protein variants. Not every position in a protein can be used to create a viable circular permutant, but there is still a lack of practical computational tools for evaluating the positional feasibility of CP before costly experiments are carried out. We have previously designed a comprehensive method for predicting viable CP cleavage sites in proteins. In this work, we implement that method into an efficient and user-friendly web server named CPred (CP site predictor), which is supposed to be helpful to promote fundamental researches and biotechnological applications of CP. The CPred is accessible at http://sarst.life.nthu.edu.tw/CPred.


Assuntos
Conformação Proteica , Software , Inteligência Artificial , Internet , Probabilidade , Interface Usuário-Computador
11.
Proteins ; 81(7): 1192-9, 2013 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-23408640

RESUMO

We have recently showed that the weighted contact number profiles (or the packing density profiles) of proteins are well correlated with those of the corresponding sequence conservation profiles. The results suggest that a protein structure may contain sufficient information about sequence conservation comparable to that derived from multiple homologous sequences. However, there are ambiguities concerning how to compute the packing density of the subunit of a protein complex. For the subunits of a complex, there are different ways to compute its packing density--one including the packing contributions of the other subunits and the other one excluding their contributions. Here we selected two sets of enzyme complexes. Set A contains complexes with the active sites comprising residues from multiple subunits, while set B contains those with the active sites residing on single subunits. In Set A, if the packing density profile of a subunit is computed considering the contributions of the other subunits of the complex, it will agree better with the sequence conservation profile. But in Set B the situations are reversed. The results may be due to the stronger functional and structural constraints on the evolution processes on the complexes of Set A than those of Set B to maintain the enzymatic functions of the complexes. The comparison of the packing density and the sequence conservation profiles may provide a simple yet potentially useful way to understanding the structural and evolutionary couplings between the subunits of protein complexes.


Assuntos
Sequência de Aminoácidos , Sequência Conservada , Complexos Multiproteicos/química , Proteínas/química , Sítios de Ligação , Domínio Catalítico , Bases de Dados de Proteínas , Evolução Molecular , Alinhamento de Sequência , Relação Estrutura-Atividade
12.
J Membr Biol ; 246(12): 959-66, 2013 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-24121627

RESUMO

H⁺-translocating pyrophosphatase (H⁺-PPase, EC 3.6.1.1) plays an important role in acidifying vacuoles by transporting protons across membranes at the expense of pyrophosphate (PP(i)) hydrolysis. Vigna radiata H⁺-PPase (VrH⁺-PPase) contains 16 transmembrane helices (TMs). The hydrophobicity of TM3 is relatively lower than that of most other TMs, and the amino acids in this TM are highly conserved in plants. Furthermore, TM5 and -6, which are the core TMs involving in H⁺-PPase functions, are near TM3. It is thus proposed that TM3 is associated with H⁺-PPase activity. To address this possibility, site-directed mutagenesis was applied in this investigation to determine the role of TM3 in VrH⁺-PPase. Upon alanine/serine substitution, T138 and S142, whose side chains face toward the center TMs, were found to be involved in efficient proton transport. G149/S153 and G160/A164 pairs at the crucial termini of the two GxxxG-like motifs are indispensable in maintaining enzymatic activities and conformational stability. Moreover, stability in the vicinity surrounding G149 is pivotal for efficient expression. S153, M161 and A164 are critical for the K⁺-mediated stimulation of H⁺-PPase. Taken together, our results demonstrate that TM3 plays essential roles in PP(i) hydrolysis, proton transport, expression, and K⁺ stimulation of H⁺-PPase.


Assuntos
Pirofosfatase Inorgânica/química , Pirofosfatase Inorgânica/metabolismo , Proteínas de Plantas , Domínios e Motivos de Interação entre Proteínas , Sequência de Aminoácidos , Substituição de Aminoácidos , Ativação Enzimática , Expressão Gênica , Hidrólise , Pirofosfatase Inorgânica/genética , Íons/metabolismo , Leucina/química , Modelos Moleculares , Dados de Sequência Molecular , Mutação , Estrutura Secundária de Proteína , Alinhamento de Sequência
13.
Proteins ; 80(6): 1647-57, 2012 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-22454236

RESUMO

The knowledge of conserved sequences in proteins is valuable in identifying functionally or structurally important residues. Generating the conservation profile of a sequence requires aligning families of homologous sequences and having knowledge of their evolutionary relationships. Here, we report that the conservation profile at the residue level can be quantitatively derived from a single protein structure with only backbone information. We found that the reciprocal packing density profiles of protein structures closely resemble their sequence conservation profiles. For a set of 554 nonhomologous enzymes, 74% (408/554) of the proteins have a correlation coefficient > 0.5 between these two profiles. Our results indicate that the three-dimensional structure, instead of being a mere scaffold for positioning amino acid residues, exerts such strong evolutionary constraints on the residues of the protein that its profile of sequence conservation essentially reflects that of its structural characteristics.


Assuntos
Sequência Conservada , Enzimas/química , Enzimas/genética , Evolução Molecular , Domínio Catalítico , Bases de Dados de Proteínas , Conformação Proteica , Estrutura Terciária de Proteína , Subunidades Proteicas , Alinhamento de Sequência , Homologia de Sequência de Aminoácidos
14.
BMC Struct Biol ; 11: 43, 2011 Oct 21.
Artigo em Inglês | MEDLINE | ID: mdl-22013910

RESUMO

BACKGROUND: Multiple acyl-coenzyme A dehydrogenase deficiency (MADD) is an autosomal recessive disease caused by the defects in the mitochondrial electron transfer system and the metabolism of fatty acids. Recently, mutations in electron transfer flavoprotein dehydrogenase (ETFDH) gene, encoding electron transfer flavoprotein:ubiquinone oxidoreductase (ETF:QO) have been reported to be the major causes of riboflavin-responsive MADD. To date, no studies have been performed to explore the functional impact of these mutations or their mechanism of disrupting enzyme activity. RESULTS: High resolution melting (HRM) analysis and sequencing of the entire ETFDH gene revealed a novel mutation (p.Phe128Ser) and the hotspot mutation (p.Ala84Thr) from a patient with MADD. According to the predicted 3D structure of ETF:QO, the two mutations are located within the flavin adenine dinucleotide (FAD) binding domain; however, the two residues do not have direct interactions with the FAD ligand. Using molecular dynamics (MD) simulations and normal mode analysis (NMA), we found that the p.Ala84Thr and p.Phe128Ser mutations are most likely to alter the protein structure near the FAD binding site as well as disrupt the stability of the FAD binding required for the activation of ETF:QO. Intriguingly, NMA revealed that several reported disease-causing mutations in the ETF:QO protein show highly correlated motions with the FAD-binding site. CONCLUSIONS: Based on the present findings, we conclude that the changes made to the amino acids in ETF:QO are likely to influence the FAD-binding stability.


Assuntos
Flavoproteínas Transferidoras de Elétrons/química , Flavina-Adenina Dinucleotídeo/química , Proteínas Ferro-Enxofre/química , Oxirredutases atuantes sobre Doadores de Grupo CH-NH/química , Sítios de Ligação , Criança , Biologia Computacional , Flavoproteínas Transferidoras de Elétrons/genética , Flavoproteínas Transferidoras de Elétrons/metabolismo , Ativação Enzimática , Feminino , Flavina-Adenina Dinucleotídeo/metabolismo , Humanos , Proteínas Ferro-Enxofre/genética , Proteínas Ferro-Enxofre/metabolismo , Simulação de Dinâmica Molecular , Deficiência Múltipla de Acil Coenzima A Desidrogenase/etiologia , Deficiência Múltipla de Acil Coenzima A Desidrogenase/genética , Mutação , Oxirredutases atuantes sobre Doadores de Grupo CH-NH/genética , Oxirredutases atuantes sobre Doadores de Grupo CH-NH/metabolismo , Estrutura Terciária de Proteína
15.
BMC Bioinformatics ; 10: 366, 2009 Oct 31.
Artigo em Inglês | MEDLINE | ID: mdl-19878598

RESUMO

BACKGROUND: Template selection and target-template alignment are critical steps for template-based modeling (TBM) methods. To identify the template for the twilight zone of 15~25% sequence similarity between targets and templates is still difficulty for template-based protein structure prediction. This study presents the (PS)2-v2 server, based on our original server with numerous enhancements and modifications, to improve reliability and applicability. RESULTS: To detect homologous proteins with remote similarity, the (PS)2-v2 server utilizes the S2A2 matrix, which is a 60 x 60 substitution matrix using the secondary structure propensities of 20 amino acids, and the position-specific sequence profile (PSSM) generated by PSI-BLAST. In addition, our server uses multiple templates and multiple models to build and assess models. Our method was evaluated on the Lindahl benchmark for fold recognition and ProSup benchmark for sequence alignment. Evaluation results indicated that our method outperforms sequence-profile approaches, and had comparable performance to that of structure-based methods on these benchmarks. Finally, we tested our method using the 154 TBM targets of the CASP8 (Critical Assessment of Techniques for Protein Structure Prediction) dataset. Experimental results show that (PS)2-v2 is ranked 6th among 72 severs and is faster than the top-rank five serves, which utilize ab initio methods. CONCLUSION: Experimental results demonstrate that (PS)2-v2 with the S2A2 matrix is useful for template selections and target-template alignments by blending the amino acid and structural propensities. The multiple-template and multiple-model strategies are able to significantly improve the accuracies for target-template alignments in the twilight zone. We believe that this server is useful in structure prediction and modeling, especially in detecting homologous templates with sequence similarity in the twilight zone.


Assuntos
Biologia Computacional/métodos , Conformação Proteica , Proteínas/química , Software , Sítios de Ligação , Dobramento de Proteína
16.
Nucleic Acids Res ; 35(Web Server issue): W420-4, 2007 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-17526524

RESUMO

Knotted proteins are more commonly observed in recent years due to the enormously growing number of structures in the Protein Data Bank (PDB). Studies show that the knot regions contribute to both ligand binding and enzyme activity in proteins such as the chromophore-binding domain of phytochrome, ketol-acid reductoisomerase or SpoU methyltransferase. However, there are still many misidentified knots published in the literature due to the absence of a convenient web tool available to the general biologists. Here, we present the first web server to detect the knots in proteins as well as provide information on knotted proteins in PDB-the protein KNOT (pKNOT) web server. In pKNOT, users can either input PDB ID or upload protein coordinates in the PDB format. The pKNOT web server will detect the knots in the protein using the Taylor's smoothing algorithm. All the detected knots can be visually inspected using a Java-based 3D graphics viewer. We believe that the pKNOT web server will be useful to both biologists in general and structural biologists in particular.


Assuntos
Algoritmos , Biologia Computacional/métodos , Conformação Proteica , Dobramento de Proteína , Proteínas/química , Simulação por Computador , Bases de Dados de Proteínas , Internet , Modelos Moleculares , Modelos Estatísticos , Fitocromo/química , Software , Interface Usuário-Computador
17.
Nucleic Acids Res ; 35(Web Server issue): W588-94, 2007 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-17517770

RESUMO

Due to the importance of protein phosphorylation in cellular control, many researches are undertaken to predict the kinase-specific phosphorylation sites. Referred to our previous work, KinasePhos 1.0, incorporated profile hidden Markov model (HMM) with flanking residues of the kinase-specific phosphorylation sites. Herein, a new web server, KinasePhos 2.0, incorporates support vector machines (SVM) with the protein sequence profile and protein coupling pattern, which is a novel feature used for identifying phosphorylation sites. The coupling pattern [XdZ] denotes the amino acid coupling-pattern of amino acid types X and Z that are separated by d amino acids. The differences or quotients of coupling strength C(XdZ) between the positive set of phosphorylation sites and the background set of whole protein sequences from Swiss-Prot are computed to determine the number of coupling patterns for training SVM models. After the evaluation based on k-fold cross-validation and Jackknife cross-validation, the average predictive accuracy of phosphorylated serine, threonine, tyrosine and histidine are 90, 93, 88 and 93%, respectively. KinasePhos 2.0 performs better than other tools previously developed. The proposed web server is freely available at http://KinasePhos2.mbc.nctu.edu.tw/.


Assuntos
Biologia Computacional/métodos , Fosfoproteínas/química , Proteínas Quinases/metabolismo , Análise de Sequência de Proteína/métodos , Software , Domínio Catalítico , Simulação por Computador , Internet , Cadeias de Markov , Redes Neurais de Computação , Fosfoproteínas/metabolismo , Fosforilação , Probabilidade , Sensibilidade e Especificidade , Interface Usuário-Computador
18.
Nat Commun ; 10(1): 3131, 2019 07 16.
Artigo em Inglês | MEDLINE | ID: mdl-31311925

RESUMO

Alterations in membrane proteins (MPs) and their regulated pathways have been established as cancer hallmarks and extensively targeted in clinical applications. However, the analysis of MP-interacting proteins and downstream pathways across human malignancies remains challenging. Here, we present a systematically integrated method to generate a resource of cancer membrane protein-regulated networks (CaMPNets), containing 63,746 high-confidence protein-protein interactions (PPIs) for 1962 MPs, using expression profiles from 5922 tumors with overall survival outcomes across 15 human cancers. Comprehensive analysis of CaMPNets links MP partner communities and regulated pathways to provide MP-based gene sets for identifying prognostic biomarkers and druggable targets. For example, we identify CHRNA9 with 12 PPIs (e.g., ERBB2) can be a therapeutic target and find its anti-metastasis agent, bupropion, for treatment in nicotine-induced breast cancer. This resource is a study to systematically integrate MP interactions, genomics, and clinical outcomes for helping illuminate cancer-wide atlas and prognostic landscapes in tumor homo/heterogeneity.


Assuntos
Biomarcadores Tumorais/genética , Redes Reguladoras de Genes , Neoplasias/genética , Receptores Nicotínicos/genética , Animais , Antineoplásicos/farmacologia , Antineoplásicos/uso terapêutico , Biomarcadores Tumorais/antagonistas & inibidores , Biomarcadores Tumorais/metabolismo , Bupropiona/farmacologia , Bupropiona/uso terapêutico , Linhagem Celular Tumoral , Conjuntos de Dados como Assunto , Feminino , Perfilação da Expressão Gênica , Regulação Neoplásica da Expressão Gênica/efeitos dos fármacos , Humanos , Estimativa de Kaplan-Meier , Camundongos , Neoplasias/tratamento farmacológico , Neoplasias/mortalidade , Antagonistas Nicotínicos/farmacologia , Antagonistas Nicotínicos/uso terapêutico , Prognóstico , Mapeamento de Interação de Proteínas/métodos , Mapas de Interação de Proteínas/efeitos dos fármacos , Mapas de Interação de Proteínas/genética , Receptores Nicotínicos/metabolismo , Ensaios Antitumorais Modelo de Xenoenxerto
19.
BMC Bioinformatics ; 9 Suppl 12: S6, 2008 Dec 12.
Artigo em Inglês | MEDLINE | ID: mdl-19091029

RESUMO

BACKGROUND: RNA-protein interaction plays an essential role in several biological processes, such as protein synthesis, gene expression, posttranscriptional regulation and viral infectivity. Identification of RNA-binding sites in proteins provides valuable insights for biologists. However, experimental determination of RNA-protein interaction remains time-consuming and labor-intensive. Thus, computational approaches for prediction of RNA-binding sites in proteins have become highly desirable. Extensive studies of RNA-binding site prediction have led to the development of several methods. However, they could yield low sensitivities in trade-off for high specificities. RESULTS: We propose a method, RNAProB, which incorporates a new smoothed position-specific scoring matrix (PSSM) encoding scheme with a support vector machine model to predict RNA-binding sites in proteins. Besides the incorporation of evolutionary information from standard PSSM profiles, the proposed smoothed PSSM encoding scheme also considers the correlation and dependency from the neighboring residues for each amino acid in a protein. Experimental results show that smoothed PSSM encoding significantly enhances the prediction performance, especially for sensitivity. Using five-fold cross-validation, our method performs better than the state-of-the-art systems by 4.90%-6.83%, 0.88%-5.33%, and 0.10-0.23 in terms of overall accuracy, specificity, and Matthew's correlation coefficient, respectively. Most notably, compared to other approaches, RNAProB significantly improves sensitivity by 7.0%-26.9% over the benchmark data sets. To prevent data over fitting, a three-way data split procedure is incorporated to estimate the prediction performance. Moreover, physicochemical properties and amino acid preferences of RNA-binding proteins are examined and analyzed. CONCLUSION: Our results demonstrate that smoothed PSSM encoding scheme significantly enhances the performance of RNA-binding site prediction in proteins. This also supports our assumption that smoothed PSSM encoding can better resolve the ambiguity of discriminating between interacting and non-interacting residues by modelling the dependency from surrounding residues. The proposed method can be used in other research areas, such as DNA-binding site prediction, protein-protein interaction, and prediction of posttranslational modification sites.


Assuntos
Biologia Computacional/métodos , Proteínas/química , RNA/química , Algoritmos , Aminoácidos/química , Inteligência Artificial , Sítios de Ligação , DNA/química , Evolução Molecular , Redes Neurais de Computação , Ligação Proteica/genética , Reprodutibilidade dos Testes , Software
20.
Proteins ; 72(2): 625-34, 2008 Aug.
Artigo em Inglês | MEDLINE | ID: mdl-18247347

RESUMO

Recently, we have developed a method (Shih et al., Proteins: Structure, Function, and Bioinformatics 2007;68: 34-38) to compute correlation of fluctuations of proteins. This method, referred to as the protein fixed-point (PFP) model, is based on the positional vectors of atoms issuing from the fixed point, which is the point of the least fluctuations in proteins. One corollary from this model is that atoms lying on the same shell centered at the fixed point will have the same thermal fluctuations. In practice, this model provides a convenient way to compute the average dynamical properties of proteins directly from the geometrical shapes of proteins without the need of any mechanical models, and hence no trajectory integration or sophisticated matrix operations are needed. As a result, it is more efficient than molecular dynamics simulation or normal mode analysis. Though in the previous study the PFP model has been successfully applied to a number of proteins of various folds, it is not clear to what extent this model will be applied. In this article, we have carried out the comprehensive analysis of the PFP model for a dataset comprising 972 high-resolution X-ray structures with pairwise sequence identity or=0.5. Our result shows that the fixed-point model is indeed quite general and will be a useful tool for high throughput analysis of dynamical properties of proteins.


Assuntos
Proteínas/química , Modelos Moleculares , Conformação Proteica
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA