Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 7 de 7
Filtrar
Mais filtros










Base de dados
Intervalo de ano de publicação
1.
Proteomics ; 8(4): 750-78, 2008 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-18297652

RESUMO

Describing the connectivity of chemical and/or biological systems using networks is a straight gate for the introduction of mathematical tools in proteomics. Networks, in some cases even very large ones, are simple objects that are composed at least by nodes and edges. The nodes represent the parts of the system and the edges geometric and/or functional relationships between parts. In proteomics, amino acids, proteins, electrophoresis spots, polypeptidic fragments, or more complex objects can play the role of nodes. All of these networks can be numerically described using the so-called Connectivity Indices (CIs). The transformation of graphs (a picture) into CIs (numbers) facilitates the manipulation of information and the search for structure-function relationships in Proteomics. In this work, we review and comment on the challenges and new trends in the definition and applications of CIs in Proteomics. Emphasis is placed on 1-D-CIs for DNA and protein sequences, 2-D-CIs for RNA secondary structures, 3-D-topographic indices (TPGIs) for protein function annotation without alignment, 2-D-CIs and 3-D-TPGIs for the study of drug-protein or drug-RNA quantitative structure-binding relationships, and pseudo 3-D-CIs for protein surface molecular recognition. We also focus on CIs to describe Protein Interaction Networks or RNA co-expression networks. 2-D-CIs for patient blood proteome 2-DE maps or mass spectra are also covered.


Assuntos
Biologia Computacional/métodos , Proteômica/métodos , Sequência de Aminoácidos , Sequência de Bases , Proteínas Sanguíneas/química , DNA/química , Humanos , Ligantes , Masculino , Redes e Vias Metabólicas , Modelos Moleculares , Filogenia , Antígeno Prostático Específico/química , Ligação Proteica , Relação Quantitativa Estrutura-Atividade , RNA/química
2.
J Comput Chem ; 28(6): 1042-8, 2007 Apr 30.
Artigo em Inglês | MEDLINE | ID: mdl-17269125

RESUMO

Three-dimensional (3D) protein structures now frequently lack functional annotations because of the increase in the rate at which chemical structures are solved with respect to experimental knowledge of biological activity. As a result, predicting structure-function relationships for proteins is an active research field in computational chemistry and has implications in medicinal chemistry, biochemistry and proteomics. In previous studies stochastic spectral moments were used to predict protein stability or function (González-Díaz, H. et al. Bioorg Med Chem 2005, 13, 323; Biopolymers 2005, 77, 296). Nevertheless, these moments take into consideration only electrostatic interactions and ignore other important factors such as van der Waals interactions. The present study introduces a new class of 3D structure molecular descriptors for folded proteins named the stochastic van der Waals spectral moments ((o)beta(k)). Among many possible applications, recognition of kinases was selected due to the fact that previous computational chemistry studies in this area have not been reported, despite the widespread distribution of kinases. The best linear model found was Kact = -9.44 degrees beta(0)(c) +10.94 degrees beta(5)(c) -2.40 degrees beta(0)(i) + 2.45 degrees beta(5)(m) + 0.73, where core (c), inner (i) and middle (m) refer to specific spatial protein regions. The model with a high Matthew's regression coefficient (0.79) correctly classified 206 out of 230 proteins (89.6%) including both training and predicting series. An area under the ROC curve of 0.94 differentiates our model from a random classifier. A subsequent principal components analysis of 152 heterogeneous proteins demonstrated that beta(k) codifies information different to other descriptors used in protein computational chemistry studies. Finally, the model recognizes 110 out of 125 kinases (88.0%) in a virtual screening experiment and this can be considered as an additional validation study (these proteins were not used in training or predicting series).


Assuntos
Biologia Computacional/métodos , Proteínas Quinases/química , Relação Quantitativa Estrutura-Atividade , Algoritmos , Entropia , Cadeias de Markov , Análise de Componente Principal , Conformação Proteica , Dobramento de Proteína , Proteínas Quinases/metabolismo , Proteínas/química , Curva ROC , Eletricidade Estática
3.
Bioorg Med Chem ; 15(2): 962-8, 2007 Jan 15.
Artigo em Inglês | MEDLINE | ID: mdl-17081758

RESUMO

In this work we report a QSAR model that discriminates between chemically heterogeneous classes of anticoccidial and non-anticoccidial compounds. For this purpose we used the Markovian Chemicals in silico Design (MARCH-INSIDE) approach J. Mol. Mod.2002, 8, 237-245; J. Mol. Mod.2003, 9, 395-407]. Linear discriminant analysis allowed us to fit the discriminant function. This function correctly classifies 86.67% of anticoccidial compounds and 96.23% of inactive compounds in the training series. Overall classification is 94.12%. We validated the model by means of an external predicting series, with 86.96% of global predictability. Remarkably, the present model is based on topological as well as configuration-dependent molecular descriptors. Therefore, the model performs timely calculations and allows discrimination between Z/E and chiral isomers. Finally, to exemplify the use of the model in practice we report the prediction and experimental assay of trans-2-(2-nitrovinyl)furan. It is notable that lesion control was 72.86% at mg/kg of body weight with respect to 60% at 125 mg/kg for amprolium (control drug). The back-projection map for this compound predicts a high level of importance for the double bond and for the nitro group in the trans position. We conclude that the MARCH-INSIDE approach enables the accurate fast track identification of anticoccidial hits. Moreover, trans-2-(2-nitrovinyl)furan seems to be a promising drug for the treatment of coccidiosis.


Assuntos
Coccidiostáticos/síntese química , Coccidiostáticos/farmacologia , Furanos/síntese química , Furanos/farmacologia , Compostos de Vinila/síntese química , Compostos de Vinila/farmacologia , Animais , Peso Corporal/efeitos dos fármacos , Galinhas , Eimeria tenella/efeitos dos fármacos , Feminino , Modelos Moleculares , Relação Quantitativa Estrutura-Atividade , Reprodutibilidade dos Testes
4.
Bull Math Biol ; 68(7): 1555-72, 2006 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-16865609

RESUMO

MARCH-INSIDE methodology and a statistical classification method--linear discriminant analysis (LDA)--is proposed as an alternative method to the Draize eye irritation test. This methodology has been successfully applied to a set of 46 neutral organic chemicals, which have been defined as ocular irritant or nonirritant. The model allow to categorize correctly 37 out of 46 compounds, showing an accuracy of 80.46%. Specifically, this model demonstrates the existence of a good categorization average of 91.67 and 76.47% for irritant and nonirritant compounds, respectively. Validation of the model was carried out using two cross-validation tools: Leave-one-out (LOO) and leave-group-out (LGO), showing a global predictability of the model of 71.7 and 70%, respectively. The average of coincidence of the predictions between leave-one-out/leave-group-out studies and train set were 91.3% (42 out of 46 cases)/89.1% (41 out of 46 cases) proving the robustness of the model obtained. Ocular irritancy distribution diagram is carried out in order to determine the intervals of the property where the probability of finding an irritant compound is maximal relating to the choice of find a false nonirritant one. It seems that, until today, the present model may be the first predictive linear discriminant equation able to discriminate between eye irritant and nonirritant chemicals.


Assuntos
Traumatismos Oculares/induzido quimicamente , Irritantes/classificação , Cadeias de Markov , Modelos Biológicos , Compostos Orgânicos/classificação , Algoritmos , Alternativas aos Testes com Animais/métodos , Animais , Reações Falso-Positivas , Humanos , Irritantes/química , Irritantes/toxicidade , Compostos Orgânicos/toxicidade , Probabilidade , Relação Quantitativa Estrutura-Atividade , Curva ROC , Testes de Toxicidade Aguda
5.
J Inorg Biochem ; 100(7): 1290-7, 2006 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-16684570

RESUMO

Genomics projects have elucidated several genes that encode protein sequences. Subsequently, the advent of the proteomics age has enabled the synthesis and 3D structure determination for these protein sequences. Some of these proteins incorporate metal atoms but it is often not known whether they are metal-binding proteins and the nature of the biological activity is not understood. Consequently, the development of methods to predict metal-mediated biological activity of proteins from the 3D structure of metal-unbound proteins is a goal of major importance. More specifically, the amino terminal Cu(II)- and Ni(II)-binding (ATCUN) motif is a small metal-binding site found in the N-terminus of many naturally occurring proteins. The ATCUN motif participates in DNA cleavage and has anti-tumor activity. In this study, we calculated average 3D electrostatic potentials (xi(k)) for 265 different proteins including 133 potential ATCUN anti-tumor proteins. We also calculated xi(k) values for the total protein or for the following specific protein regions: the core, inner, middle, and outer orbits. A linear discriminant analysis model was subsequently developed to assign proteins into two groups called ATCUN DNA-cleavage proteins and non-active proteins. The best model found was: ATCUN=1.15.xi(1)(inner)+2.18.xi(5)(middle)+27.57.xi(0)(outer)-27.57.xi(0)(total)+0.09. The model correctly classified 182 out of 197 (91.4%) and 61 out of 66 (92.4%) proteins in training and external predicting series', respectively. Finally, desirability analysis was used to predict the values for the electrostatic potential in one single region and the combined values in two regions that are desirable for ATCUN-like proteins. To the best of our knowledge, the present work is the first study in which desirability analysis has been used in protein quantitative-structure-activity-relationship (QSAR).


Assuntos
DNA/metabolismo , Proteínas/metabolismo , Motivos de Aminoácidos , Hidrólise , Cadeias de Markov , Relação Quantitativa Estrutura-Atividade , Eletricidade Estática
6.
FEBS Lett ; 580(3): 723-30, 2006 Feb 06.
Artigo em Inglês | MEDLINE | ID: mdl-16413021

RESUMO

The development of 2D graph-theoretic representations for DNA sequences was very important for qualitative and quantitative comparison of sequences. Calculation of numeric features for these representations is useful for DNA-QSAR studies. Most of all graph-theoretic representations identify each one of the four bases with a unitary walk in one axe direction in the 2D space. In the case of proteins, twenty amino acids instead of four bases have to be considered. This fact has limited the introduction of useful 2D Cartesian representations and the corresponding sequences descriptors to encode protein sequence information. In this study, we overcome this problem grouping amino acids into four groups: acid, basic, polar and non-polar amino acids. The identification of each group with one of the four axis directions determines a novel 2D representation and numeric descriptors for proteins sequences. Afterwards, a Markov model has been used to calculate new numeric descriptors of the protein sequence. These descriptors are called herein the sequence 2D coupling numbers (zeta(k)). In this work, we calculated the zeta(k) values for 108 sequences of different polygalacturonases (PGs) and for 100 sequences of other proteins. A Linear Discriminant Analysis model derived here (PG=5.36.zeta1-3.98.zeta3-42.21) successfully discriminates between PGs and other proteins. The model correctly classified 100% of a subset of 81 PGs and 75 non-PG proteins sequences used to train the model. The model also correctly classified 51 out of 52 (98.07%) of proteins sequences used as external validation series. The uses of different group of amino acids and/or axes orientation give different results, so it is suggested to be explored for other databases. Finally, to illustrates the use of the model we report the isolation and prediction of the PG action for a novel sequence (AY908988) isolated by our group from Psidium guajava L. This prediction coincides very well with sequence alignment results found by the BLAST methodology. These findings illustrate the possibilities of the sequence descriptors derived for this novel 2D sequence representation in proteins sequence QSAR studies.


Assuntos
Algoritmos , Proteínas de Plantas/genética , Psidium/genética , Análise de Sequência de DNA , Software , Sequência de Aminoácidos , Processamento de Imagem Assistida por Computador , Dados de Sequência Molecular , Psidium/enzimologia
7.
Bioorg Med Chem Lett ; 16(3): 547-53, 2006 Feb.
Artigo em Inglês | MEDLINE | ID: mdl-16275068

RESUMO

The general belief is that quantitative structure-activity relationship (QSAR) techniques work only for small molecules and, protein sequences or, more recently, DNA sequences. However, with non-branched graph for proteins and DNA sequences the QSAR often have to be based on powerful non-linear techniques such as support vector machines. In our opinion, linear QSAR models based on RNA could be useful to assign biological activity when alignment techniques fail due to low sequence homology. The idea bases the high level of branching for the RNA graph. This work introduces the so-called Markov electrostatic potentials (k)xi(M) as a new class of RNA 2D-structure descriptors. Subsequently, we validate these molecular descriptors solving a QSAR classification problem for mycobacterial promoter sequences (mps), which constitute a very low sequence homology problem. The model developed (mps=-4.664.(0)xi(M)+0. 991.(1)xi(M)-2.432) was intended to predict whether a naturally occurring sequence is an mps or not on the basis of the calculated (k)xi(M) value for the corresponding RNA secondary structure. The RNA-QSAR approach recognises 115/135mps (85.2%) and 100% of control sequences. Average predictability and robustness were greater than 95%. A previous non-linear model predicts mps with a slightly higher accuracy (97%) but uses a very large parameter space for DNA sequences. Conversely, the (k)xi(M)-based RNA-QSAR encodes more structural information and needs only two variables.


Assuntos
DNA Bacteriano/química , Mycobacterium/genética , Regiões Promotoras Genéticas , Homologia de Sequência , Proteínas de Bactérias/fisiologia , Sequência de Bases , DNA Bacteriano/genética , Análise Discriminante , Desenho de Fármacos , Cadeias de Markov , Modelos Biológicos , Dados de Sequência Molecular , Relação Quantitativa Estrutura-Atividade , RNA/análise , RNA/química , Eletricidade Estática
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA