Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 56
Filter
1.
GigaByte ; 2024: gigabyte114, 2024.
Article in English | MEDLINE | ID: mdl-38525218

ABSTRACT

Molecular Property Diagnostic Suite (MPDS) was conceived and developed as an open-source disease-specific web portal based on Galaxy. MPDSCOVID-19 was developed for COVID-19 as a one-stop solution for drug discovery research. Galaxy platforms enable the creation of customized workflows connecting various modules in the web server. The architecture of MPDSCOVID-19 effectively employs Galaxy v22.04 features, which are ported on CentOS 7.8 and Python 3.7. MPDSCOVID-19 provides significant updates and the addition of several new tools updated after six years. Tools developed by our group in Perl/Python and open-source tools are collated and integrated into MPDSCOVID-19 using XML scripts. Our MPDS suite aims to facilitate transparent and open innovation. This approach significantly helps bring inclusiveness in the community while promoting free access and participation in software development. Availability & Implementation: The MPDSCOVID-19 portal can be accessed at https://mpds.neist.res.in:8085/.

2.
PLoS One ; 12(8): e0180505, 2017.
Article in English | MEDLINE | ID: mdl-28767653

ABSTRACT

Shigellosis or bacillary dysentery is an important cause of diarrhea, with the majority of the cases occurring in developing countries. Considering the high disease burden, increasing antibiotic resistance, serotype-specific immunity and the post-infectious sequelae associated with shigellosis, there is a pressing need of an effective vaccine against multiple serotypes of the pathogen. In the present study, we used bio-informatics approach to identify antigens shared among multiple serotypes of Shigella spp. This approach led to the identification of many immunogenic peptides. The five most promising peptides based on MHC binding efficiency were a putative lipoprotein (EL PGI I), a putative heat shock protein (EL PGI II), Spa32 (EL PGI III), IcsB (EL PGI IV) and a hypothetical protein (EL PGI V). These peptides were synthesized and the immunogenicity was evaluated in BALB/c mice by ELISA and cytokine assays. The putative heat shock protein (HSP) and the hypothetical protein elicited good humoral response, whereas putative lipoprotein, Spa32 and IcsB elicited good T-cell response as revealed by increased IFN-γ and TNF-α cytokine levels. The patient sera from confirmed cases of shigellosis were also evaluated for the presence of peptide specific antibodies with significant IgG and IgA antibodies against the HSP and the hypothetical protein, bestowing them as potential future vaccine candidates. The antigens reported in this study are novel and have not been tested as vaccine candidates against Shigella. This study offers time and cost-effective way of identifying unprecedented immunogenic antigens to be used as potential vaccine candidates. Moreover, this approach should easily be extendable to find new potential vaccine candidates for other pathogenic bacteria.


Subject(s)
Dysentery, Bacillary/microbiology , Shigella Vaccines/immunology , Shigella/immunology , Animals , Antibodies, Bacterial/blood , Cytokines/analysis , Databases, Protein , Dysentery, Bacillary/immunology , Enzyme-Linked Immunosorbent Assay , Epitopes/immunology , Heat-Shock Proteins/immunology , Immunoglobulin A/blood , Immunoglobulin G/blood , Interferon-gamma/analysis , Mice , Mice, Inbred BALB C , Peptides/immunology , Serogroup , Shigella/classification , Tumor Necrosis Factor-alpha/analysis
3.
Sci Rep ; 7: 46541, 2017 04 19.
Article in English | MEDLINE | ID: mdl-28422156

ABSTRACT

Laterosporulin10 (LS10) is a defensin like peptide from Brevibacillus sp. strain SKDU10 that inhibited microbial pathogens. However, in this study, anticancer activity of LS10 was examined against different cancer cell lines and compared with normal cells. LS10 displayed cytotoxicity against cancer cells like MCF-7, HEK293T, HT1080, HeLa and H1299 at below 10 µM concentration, but not against prostate epithelium cells RWPE-1. Additionally, no hemolysis was observed at significantly higher concentration compared to IC50 values observed for different cancer cell lines. Release of lactate dehydrogenase from cancer cell lines at 15 µM concentration upon 120 min treatment indicated the lytic ability of LS10. Accordingly, electron microscopy experiments also confirmed the necrotic effect of LS10 at 15 µM concentration against cancer cells. Furthermore, flow cytometry analysis of treated cancer cell lines revealed that LS10 induce apoptosis even at 2.5 µM concentration. Nevertheless, RWPE-1 cells remained viable even at 20 µM concentration. These results provide evidence that LS10 is an anticancer bacteriocin, which causes apoptotic and necrotic death of cancer cells at lower and higher concentrations, respectively. Taken all results together, the present study signifies that LS10 is an anticancer peptide that could be further developed for therapeutic applications.


Subject(s)
Antineoplastic Agents/pharmacology , Bacteriocins/pharmacology , Defensins/pharmacology , Neoplasms/drug therapy , Antineoplastic Agents/chemistry , Bacteriocins/chemistry , Defensins/chemistry , Drug Screening Assays, Antitumor , HEK293 Cells , HeLa Cells , Humans , MCF-7 Cells , Neoplasms/metabolism , Neoplasms/pathology
4.
Anticancer Agents Med Chem ; 14(7): 928-35, 2014.
Article in English | MEDLINE | ID: mdl-24661111

ABSTRACT

BACKGROUND: Aberrant activity of epidermal growth factor receptor (EGFR) family proteins has been found to be associated with a number of human cancers including that of lung and breast. Consequently, the search for EGFR family inhibitors, a well established target of pharmacological and therapeutic value has been ongoing. Therefore, over the years several small molecules, which compete for ATP in the kinase domain have been synthesised and some of them have proved to be effective in attenuating EGFR mediated proliferation. Thus, there exists in literature a vast amount of experimental data on EGFR tyrosine kinase inhibitors. In this paper, we describe a comprehensive database EGFRIndb that contains details of the small molecular inhibitors of EGFR family. DESCRIPTION: EGFRIndb is a literature curated database of small synthetic molecular inhibitors of EGFR. It consists of 4581 compounds showing in vitro inhibitory activities (IC50, IC80, GI50, GI90, EC50, Ki, Kd and percentage inhibition) either against EGFR or its different isoforms i.e. Erbb2 (v-erb-b2 avian erythroblastic leukaemia viral oncogene homolog 2) and Erbb4 (v-erb-b2 avian erythroblastic leukaemia viral oncogene homolog 4) or various mutants. For each compound, database provides information on structure, experimentally determined inhibitory activity of compound against kinase as well as various cell lines, properties (physical, elemental and topological) and drug likeness. Additionally, it provides information on irreversible as well as dual inhibitors that have gained importance in recent years due to the emergence of clinical resistance to known drugs. As compound activity against similar kinases is a measure of its selectivity and specificity, the database also provides this information. It also provides simple search, advanced search, browse facility as well as a tool for structure based searching. CONCLUSION: EGFRIndb gathers biological and chemical information on EGFR inhibitors from the literature. It is hoped that it will serve as a useful resource in drug discovery and provide data for docking, virtual screening and Quantitative structure-activity relationship (QSAR) model development to the cancer researchers.


Subject(s)
Antineoplastic Agents/chemistry , Databases, Chemical , ErbB Receptors/antagonists & inhibitors , Protein Kinase Inhibitors/chemistry
5.
Curr Med Chem ; 21(21): 2367-91, 2014.
Article in English | MEDLINE | ID: mdl-24533809

ABSTRACT

Cancer is one of the leading causes of mortality worldwide, with more than 10 million new cases each year. Despite the presence of several anticancer agents, cancer treatment is still not very effective. Main reasons behind this high mortality rate are the lack of screening tests for early diagnosis, and non-availability of tumor specific drug delivery system. Most of the current anticancer drugs are unable to differentiate between cancerous and normal cells, leading to systemic toxicity, and adverse side effects. In order to tackle this problem, a considerable progress has been made over the years to identify peptides, which specifically bind to the tumor cells, and tumor vasculature (tumor homing peptides). With the advances in phage display technology, and combinatorial libraries like one-bead one-compound library, several hundreds of tumor homing peptides, and their derivatives, which have potential to detect tumor in vivo, and deliver anticancer agents specifically to the tumor site, have been discovered. Currently, many tumor homing peptide-based therapies for cancer treatment and diagnosis are being tested in various phases of clinical trials. In this review, we have discussed the progress made so far in the identification of tumor homing peptides, and their applications in cancer therapeutics, diagnosis, and theranostics. In addition, a brief discussion on tumor homing peptide resource, and in silico designing of tumor homing peptides has also been provided.


Subject(s)
Molecular Probes/metabolism , Neoplasms/diagnosis , Neoplasms/therapy , Peptides/metabolism , Animals , Clinical Trials as Topic , Drug Design , Humans , Neoplasms/metabolism
6.
Sci Rep ; 3: 2984, 2013 Oct 18.
Article in English | MEDLINE | ID: mdl-24136089

ABSTRACT

Use of therapeutic peptides in cancer therapy has been receiving considerable attention in the recent years. Present study describes the development of computational models for predicting and discovering novel anticancer peptides. Preliminary analysis revealed that Cys, Gly, Ile, Lys, and Trp are dominated at various positions in anticancer peptides. Support vector machine models were developed using amino acid composition and binary profiles as input features on main dataset that contains experimentally validated anticancer peptides and random peptides derived from SwissProt database. In addition, models were developed on alternate dataset that contains antimicrobial peptides instead of random peptides. Binary profiles-based model achieved maximum accuracy 91.44% with MCC 0.83. We have developed a webserver, which would be helpful in: (i) predicting minimum mutations required for improving anticancer potency; (ii) virtual screening of peptides for discovering novel anticancer peptides, and (iii) scanning natural proteins for identification of anticancer peptides (http://crdd.osdd.net/raghava/anticp/).


Subject(s)
Antineoplastic Agents/chemistry , Computer Simulation , Drug Design , Peptides/chemistry , Antimicrobial Cationic Peptides/chemistry , Databases, Factual , Humans , Position-Specific Scoring Matrices , ROC Curve , Reproducibility of Results , Sequence Analysis, Protein , Support Vector Machine , Web Browser
7.
Clin Dev Immunol ; 2013: 263952, 2013.
Article in English | MEDLINE | ID: mdl-24489573

ABSTRACT

The secretion of Interleukin-4 (IL4) is the characteristic of T-helper 2 responses. IL4 is a cytokine produced by CD4+ T cells in response to helminthes and other extracellular parasites. It has a critical role in guiding antibody class switching, hematopoiesis and inflammation, and the development of appropriate effector T-cell responses. In this study, it is the first time an attempt has been made to understand whether it is possible to predict IL4 inducing peptides. The data set used in this study comprises 904 experimentally validated IL4 inducing and 742 noninducing MHC class II binders. Our analysis revealed that certain types of residues are preferred at certain positions in IL4 inducing peptides. It was also observed that IL4 inducing and noninducing epitopes differ in compositional and motif pattern. Based on our analysis we developed classification models where the hybrid method of amino acid pairs and motif information performed the best with maximum accuracy of 75.76% and MCC of 0.51. These results indicate that it is possible to predict IL4 inducing peptides with reasonable precession. These models would be useful in designing the peptides that may induce desired Th2 response.


Subject(s)
Interleukin-4/chemistry , Peptide Fragments/chemistry , Alleles , Amino Acid Motifs , Amino Acid Sequence , Binding Sites , Epitope Mapping , Epitopes/chemistry , Epitopes/immunology , Epitopes/metabolism , Histocompatibility Antigens Class II/chemistry , Histocompatibility Antigens Class II/genetics , Histocompatibility Antigens Class II/immunology , Histocompatibility Antigens Class II/metabolism , Interleukin-4/immunology , Interleukin-4/metabolism , Models, Immunological , Peptide Fragments/immunology , Peptide Fragments/metabolism , Position-Specific Scoring Matrices , Protein Binding , Reproducibility of Results , Support Vector Machine , T-Lymphocytes/immunology , T-Lymphocytes/metabolism
8.
Database (Oxford) ; 2012: bas015, 2012.
Article in English | MEDLINE | ID: mdl-22403286

ABSTRACT

Delivering drug molecules into the cell is one of the major challenges in the process of drug development. In past, cell penetrating peptides have been successfully used for delivering a wide variety of therapeutic molecules into various types of cells for the treatment of multiple diseases. These peptides have unique ability to gain access to the interior of almost any type of cell. Due to the huge therapeutic applications of CPPs, we have built a comprehensive database 'CPPsite', of cell penetrating peptides, where information is compiled from the literature and patents. CPPsite is a manually curated database of experimentally validated 843 CPPs. Each entry provides information of a peptide that includes ID, PubMed ID, peptide name, peptide sequence, chirality, origin, nature of peptide, sub-cellular localization, uptake efficiency, uptake mechanism, hydrophobicity, amino acid frequency and composition, etc. A wide range of user-friendly tools have been incorporated in this database like searching, browsing, analyzing, mapping tools. In addition, we have derived various types of information from these peptide sequences that include secondary/tertiary structure, amino acid composition and physicochemical properties of peptides. This database will be very useful for developing models for predicting effective cell penetrating peptides. Database URL: http://crdd.osdd.net/raghava/cppsite/.


Subject(s)
Cell-Penetrating Peptides , Databases, Protein , Database Management Systems , Internet , User-Computer Interface
9.
Amino Acids ; 42(5): 1703-13, 2012 May.
Article in English | MEDLINE | ID: mdl-21400228

ABSTRACT

Since endo-symbiotic events occur, all genes of mitochondrial aminoacyl tRNA synthetase (AARS) were lost or transferred from ancestral mitochondrial genome into the nucleus. The canonical pattern is that both cytosolic and mitochondrial AARSs coexist in the nuclear genome. In the present scenario all mitochondrial AARSs are nucleus-encoded, synthesized on cytosolic ribosomes and post-translationally imported from the cytosol into the mitochondria in eukaryotic cell. The site-based discrimination between similar types of enzymes is very challenging because they have almost same physico-chemical properties. It is very important to predict the sub-cellular location of AARSs, to understand the mitochondrial protein synthesis. We have analyzed and optimized the distinguishable patterns between cytosolic and mitochondrial AARSs. Firstly, support vector machines (SVM)-based modules have been developed using amino acid and dipeptide compositions and achieved Mathews correlation coefficient (MCC) of 0.82 and 0.73, respectively. Secondly, we have developed SVM modules using position-specific scoring matrix and achieved the maximum MCC of 0.78. Thirdly, we developed SVM modules using N-terminal, intermediate residues, C-terminal and split amino acid composition (SAAC) and achieved MCC of 0.82, 0.70, 0.39 and 0.86, respectively. Finally, a SVM module was developed using selected attributes of split amino acid composition (SA-SAAC) approach and achieved MCC of 0.92 with an accuracy of 96.00%. All modules were trained and tested on a non-redundant data set and evaluated using fivefold cross-validation technique. On the independent data sets, SA-SAAC based prediction model achieved MCC of 0.95 with an accuracy of 97.77%. The web-server 'MARSpred' based on above study is available at http://www.imtech.res.in/raghava/marspred/.


Subject(s)
Amino Acid Sequence , Amino Acids/chemistry , Amino Acyl-tRNA Synthetases/chemistry , Mitochondria/enzymology , Mitochondrial Proteins/chemistry , Algorithms , Amino Acyl-tRNA Synthetases/biosynthesis , Cell Nucleus/enzymology , Computational Biology , Cytosol/enzymology , Eukaryotic Cells/enzymology , Internet , Mitochondrial Proteins/biosynthesis , Position-Specific Scoring Matrices , Ribosomes/enzymology , Software , Support Vector Machine
11.
Nucleic Acids Res ; 39(Database issue): D975-9, 2011 Jan.
Article in English | MEDLINE | ID: mdl-21045064

ABSTRACT

The Cervical Cancer gene DataBase (CCDB, http://crdd.osdd.net/raghava/ccdb) is a manually curated catalog of experimentally validated genes that are thought, or are known to be involved in the different stages of cervical carcinogenesis. In spite of the large women population that is presently affected from this malignancy still at present, no database exists that catalogs information on genes associated with cervical cancer. Therefore, we have compiled 537 genes in CCDB that are linked with cervical cancer causation processes such as methylation, gene amplification, mutation, polymorphism and change in expression level, as evident from published literature. Each record contains details related to gene like architecture (exon-intron structure), location, function, sequences (mRNA/CDS/protein), ontology, interacting partners, homology to other eukaryotic genomes, structure and links to other public databases, thus augmenting CCDB with external data. Also, manually curated literature references have been provided to support the inclusion of the gene in the database and establish its association with cervix cancer. In addition, CCDB provides information on microRNA altered in cervical cancer as well as search facility for querying, several browse options and an online tool for sequence similarity search, thereby providing researchers with easy access to the latest information on genes involved in cervix cancer.


Subject(s)
Databases, Genetic , Genes, Neoplasm , Uterine Cervical Neoplasms/genetics , Female , Humans , User-Computer Interface
12.
Amino Acids ; 39(1): 101-10, 2010 Jun.
Article in English | MEDLINE | ID: mdl-19908123

ABSTRACT

The rate of human death due to malaria is increasing day-by-day. Thus the malaria causing parasite Plasmodium falciparum (PF) remains the cause of concern. With the wealth of data now available, it is imperative to understand protein localization in order to gain deeper insight into their functional roles. In this manuscript, an attempt has been made to develop prediction method for the localization of mitochondrial proteins. In this study, we describe a method for predicting mitochondrial proteins of malaria parasite using machine-learning technique. All models were trained and tested on 175 proteins (40 mitochondrial and 135 non-mitochondrial proteins) and evaluated using five-fold cross validation. We developed a Support Vector Machine (SVM) model for predicting mitochondrial proteins of P. falciparum, using amino acids and dipeptides composition and achieved maximum MCC 0.38 and 0.51, respectively. In this study, split amino acid composition (SAAC) is used where composition of N-termini, C-termini, and rest of protein is computed separately. The performance of SVM model improved significantly from MCC 0.38 to 0.73 when SAAC instead of simple amino acid composition was used as input. In addition, SVM model has been developed using composition of PSSM profile with MCC 0.75 and accuracy 91.38%. We achieved maximum MCC 0.81 with accuracy 92% using a hybrid model, which combines PSSM profile and SAAC. When evaluated on an independent dataset our method performs better than existing methods. A web server PFMpred has been developed for predicting mitochondrial proteins of malaria parasites ( http://www.imtech.res.in/raghava/pfmpred/).


Subject(s)
Amino Acids/analysis , Artificial Intelligence , Mitochondrial Proteins/analysis , Mitochondrial Proteins/chemistry , Plasmodium falciparum/chemistry , Algorithms , Amino Acids/chemistry , Databases, Protein , Dipeptides/chemistry , Mitochondrial Proteins/metabolism , Models, Molecular , Protein Transport
13.
Nucleic Acids Res ; 38(Database issue): D847-53, 2010 Jan.
Article in English | MEDLINE | ID: mdl-19820110

ABSTRACT

The continuing threat of infectious disease and future pandemics, coupled to the continuous increase of drug-resistant pathogens, makes the discovery of new and better vaccines imperative. For effective vaccine development, antigen discovery and validation is a prerequisite. The compilation of information concerning pathogens, virulence factors and antigenic epitopes has resulted in many useful databases. However, most such immunological databases focus almost exclusively on antigens where epitopes are known and ignore those for which epitope information was unavailable. We have compiled more than 500 antigens into the AntigenDB database, making use of the literature and other immunological resources. These antigens come from 44 important pathogenic species. In AntigenDB, a database entry contains information regarding the sequence, structure, origin, etc. of an antigen with additional information such as B and T-cell epitopes, MHC binding, function, gene-expression and post translational modifications, where available. AntigenDB also provides links to major internal and external databases. We shall update AntigenDB on a rolling basis, regularly adding antigens from other organisms and extra data analysis tools. AntigenDB is available freely at http://www.imtech.res.in/raghava/antigendb and its mirror site http://www.bic.uams.edu/raghava/antigendb.


Subject(s)
Antigens/chemistry , Computational Biology/methods , Databases, Genetic , Immune System/metabolism , Communicable Diseases/immunology , Communicable Diseases/metabolism , Computational Biology/trends , Databases, Protein , Epitopes/chemistry , Humans , Immunogenetics/methods , Information Storage and Retrieval/methods , Internet , Lymphocytes/immunology , Lymphocytes/metabolism , Peptide Mapping , Protein Processing, Post-Translational , Protein Structure, Tertiary , Software
14.
Protein Eng Des Sel ; 22(7): 441-4, 2009 Jul.
Article in English | MEDLINE | ID: mdl-19491216

ABSTRACT

Chemokines are low molecular mass cytokine-like proteins that orchestrate myriads of immune functions like leukocyte trafficking, T cell differentiation, angiogenesis, hematopeosis and mast cell degranulation. Chemokines also play a role as HIV-1 inhibitor and act as potent natural adjuvant in antitumor immunotherapy. Receptors for these molecules are all seven-pass transmembrane G-protein-coupled receptors that are intimately involved with chemokines in a wide array of physiological and pathological conditions. These receptors also have a major role as co-receptors for HIV-1 entry into target cells. Therefore, chemokine receptors have proven to be excellent targets for small molecule in pharmaceutical industry. The immense importance of chemokines and their receptors motivated us to develop a support vector machine-based method ChemoPred to predict this important class of proteins and further classify them into subfamilies. ChemoPred is capable of predicting chemokines and chemokine receptors with an accuracy of 95.08% and 92.19%, respectively. The overall accuracy of classification of chemokines into three subfamilies was 96.00% and that of chemokine receptors into three families was 92.87%. The server ChemoPred is freely available at www.imtech.res.in/raghava/chemopred.


Subject(s)
Chemokines/classification , Computational Biology/methods , Receptors, Chemokine/classification , Artificial Intelligence , Databases, Protein , Internet , Software
15.
Amino Acids ; 35(3): 599-605, 2008 Oct.
Article in English | MEDLINE | ID: mdl-18425404

ABSTRACT

The association of structurally disordered proteins with a number of diseases has engendered enormous interest and therefore demands a prediction method that would facilitate their expeditious study at molecular level. The present study describes the development of a computational method for predicting disordered proteins using sequence and profile compositions as input features for the training of SVM models. First, we developed the amino acid and dipeptide compositions based SVM modules which yielded sensitivities of 75.6 and 73.2% along with Matthew's Correlation Coefficient (MCC) values of 0.75 and 0.60, respectively. In addition, the use of predicted secondary structure content (coil, sheet and helices) in the form of composition values attained a sensitivity of 76.8% and MCC value of 0.77. Finally, the training of SVM models using evolutionary information hidden in the multiple sequence alignment profile improved the prediction performance by achieving a sensitivity value of 78% and MCC of 0.78. Furthermore, when evaluated on an independent dataset of partially disordered proteins, the same SVM module provided a correct prediction rate of 86.6%. Based on the above study, a web server ("DPROT") was developed for the prediction of disordered proteins, which is available at http://www.imtech.res.in/raghava/dprot/.


Subject(s)
Computational Biology/methods , Evolution, Molecular , Proteins/chemistry , Sequence Analysis, Protein/methods , Databases, Protein , Protein Structure, Secondary
16.
Trends Biotechnol ; 26(4): 190-200, 2008 Apr.
Article in English | MEDLINE | ID: mdl-18291542

ABSTRACT

Genome sequences from many organisms, including humans, have been completed, and high-throughput analyses have produced burgeoning volumes of 'omics' data. Bioinformatics is crucial for the management and analysis of such data and is increasingly used to accelerate progress in a wide variety of large-scale and object-specific functional analyses. Refined algorithms enable biotechnologists to follow 'computer-aided strategies' based on experiments driven by high-confidence predictions. In order to address compound problems, current efforts in immuno-informatics and reverse vaccinology are aimed at developing and tuning integrative approaches and user-friendly, automated bioinformatics environments. This will herald a move to 'computer-aided biotechnology': smart projects in which time-consuming and expensive large-scale experimental approaches are progressively replaced by prediction-driven investigations.


Subject(s)
Computer-Aided Design/trends , Database Management Systems , Immunotherapy, Active/methods , User-Computer Interface , Allergy and Immunology/trends , Animals , Artificial Intelligence , Biomedical Engineering/trends , Biotechnology/trends , Database Management Systems/trends , Databases, Genetic , Humans , Immunity/drug effects , Immunity/physiology
17.
Protein Eng Des Sel ; 21(4): 279-82, 2008 Apr.
Article in English | MEDLINE | ID: mdl-18287174

ABSTRACT

Cytokines are messengers of immune system. They are small secreted proteins that mediate and regulate the immune system, inflammation and hematopoiesis. Recent studies have revealed important roles played by the cytokines in adjuvants as therapeutic targets and in cancer therapy. In this paper, an attempt has been made to predict this important class of proteins and classify further them into families and subfamilies. A PSI-BLAST+Support Vector Machine-based hybrid approach is adopted to develop the prediction methods. CytoPred is capable of predicting cytokines with an accuracy of 98.29%. The overall accuracy of classification of cytokines into four families and further classification into seven subfamilies is 99.77 and 97.24%, respectively. It has been shown by comparison that CytoPred performs better than the already existing CTKPred. A user-friendly server CytoPred has been developed and available at http://www.imtech.res.in/raghava/cytopred.


Subject(s)
Cytokines/classification , Internet , Models, Biological , Software , Cytokines/metabolism , Sensitivity and Specificity
18.
Proteins ; 71(1): 189-94, 2008 Apr.
Article in English | MEDLINE | ID: mdl-17932917

ABSTRACT

RNA-binding proteins (RBPs) play key roles in post-transcriptional control of gene expression, which, along with transcriptional regulation, is a major way to regulate patterns of gene expression during development. Thus, the identification and prediction of RNA binding sites is an important step in comprehensive understanding of how RBPs control organism development. Combining evolutionary information and support vector machine (SVM), we have developed an improved method for predicting RNA binding sites or RNA interacting residues in a protein sequence. The prediction models developed in this study have been trained and tested on 86 RNA binding protein chains and evaluated using fivefold cross validation technique. First, a SVM model was developed that achieved a maximum Matthew's correlation coefficient (MCC) of 0.31. The performance of this SVM model further improved the MCC from 0.31 to 0.45, when multiple sequence alignment in the form of PSSM profiles was used as input to the SVM, which is far better than the maximum MCC achieved by previous methods (0.41) on the same dataset. In addition, SVM models were also developed on an alternative dataset that contained 107 RBP chains. Utilizing PSSM as input information to the SVM, the training/testing on this alternate dataset achieved a maximum MCC of 0.32. Conclusively, the prediction performance of SVM models developed in this study is better than the existing methods on the same datasets. A web server 'Pprint' was also developed for predicting RNA binding residues in a protein sequence which is freely available at http://www.imtech.res.in/raghava/pprint/.


Subject(s)
Artificial Intelligence , RNA-Binding Proteins/chemistry , RNA/chemistry , Binding Sites , Models, Molecular
19.
Protein Pept Lett ; 14(7): 626-31, 2007.
Article in English | MEDLINE | ID: mdl-17897087

ABSTRACT

Among secondary structure elements, beta-turns are ubiquitous and major feature of bioactive peptides. We analyzed 77 biologically active peptides with length varying from 9 to 20 residues. Out of 77 peptides, 58 peptides were found to contain at least one beta-turn. Further, at the residue level, 34.9% of total peptide residues were found to be in beta-turns, higher than the number of helical (32.3%) and beta-sheet residues (6.9%). So, we utilized the predicted beta-turns information to develop an improved method for predicting the three-dimensional (3D) structure of small peptides. In principle, we built four different structural models for each peptide. The first 'model I' was built by assigning all the peptide residues an extended conformation (phi = Psi = 180 degrees ). Second 'model II' was built using the information of regular secondary structures (helices, beta-strands and coil) predicted from PSIPRED. In third 'model III', secondary structure information including beta-turn types predicted from BetaTurns method was used. The fourth 'model IV' had main-chain phi, Psi angles of model III and side chain angles assigned using standard Dunbrack backbone dependent rotamer library. These models were further refined using AMBER package and the resultant C(alpha) rmsd values were calculated. It was found that adding the beta-turns to the regular secondary structures greatly reduces the rmsd values both before and after the energy minimization. Hence, the results indicate that regular and irregular secondary structures, particularly beta-turns information can provide valuable and vital information in the tertiary structure prediction of small bioactive peptides. Based on the above study, a web server PEPstr (http://www.imtech.res.in/raghava/pepstr/) was developed for predicting the tertiary structure of small bioactive peptides.


Subject(s)
Peptides/chemistry , Ligands , Monte Carlo Method , Peptides/pharmacology , Protein Binding , Protein Structure, Tertiary
20.
BMC Bioinformatics ; 8: 263, 2007 Jul 23.
Article in English | MEDLINE | ID: mdl-17645800

ABSTRACT

BACKGROUND: Antibacterial peptides are important components of the innate immune system, used by the host to protect itself from different types of pathogenic bacteria. Over the last few decades, the search for new drugs and drug targets has prompted an interest in these antibacterial peptides. We analyzed 486 antibacterial peptides, obtained from antimicrobial peptide database APD, in order to understand the preference of amino acid residues at specific positions in these peptides. RESULTS: It was observed that certain types of residues are preferred over others in antibacterial peptides, particularly at the N and C terminus. These observations encouraged us to develop a method for predicting antibacterial peptides in proteins from their amino acid sequence. First, the N-terminal residues were used for predicting antibacterial peptides using Artificial Neural Network (ANN), Quantitative Matrices (QM) and Support Vector Machine (SVM), which resulted in an accuracy of 83.63%, 84.78% and 87.85%, respectively. Then, the C-terminal residues were used for developing prediction methods, which resulted in an accuracy of 77.34%, 82.03% and 85.16% using ANN, QM and SVM, respectively. Finally, ANN, QM and SVM models were developed using N and C terminal residues, which achieved an accuracy of 88.17%, 90.37% and 92.11%, respectively. All the models developed in this study were evaluated using five-fold cross validation technique. These models were also tested on an independent or blind dataset. CONCLUSION: Among antibacterial peptides, there is preference for certain residues at N and C termini, which helps to demarcate them from non-antibacterial peptides. Both the termini play a crucial role in imparting the antibacterial property to these peptides. Among the methods developed, SVM shows the best performance in predicting antibacterial peptides followed by QM and ANN, in that order. AntiBP (Antibacterial peptides) will help in discovering efficacious antibacterial peptides, which we hope will prove to be a boon to combat the dreadful antibiotic resistant bacteria. A user friendly web server has also been developed to help the biological community, which is accessible at http://www.imtech.res.in/raghava/antibp/.


Subject(s)
Antimicrobial Cationic Peptides/chemistry , Models, Chemical , Sequence Analysis, Protein/methods , Amino Acid Sequence , Computer Simulation , Molecular Sequence Data , Protein Structure, Tertiary , Sequence Homology, Amino Acid
SELECTION OF CITATIONS
SEARCH DETAIL
...