Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 192
Filtrar
1.
Brief Bioinform ; 24(1)2023 01 19.
Artigo em Inglês | MEDLINE | ID: mdl-36516298

RESUMO

This paper describes a method Pprint2, which is an improved version of Pprint developed for predicting RNA-interacting residues in a protein. Training and independent/validation datasets used in this study comprises of 545 and 161 non-redundant RNA-binding proteins, respectively. All models were trained on training dataset and evaluated on the validation dataset. The preliminary analysis reveals that positively charged amino acids such as H, R and K, are more prominent in the RNA-interacting residues. Initially, machine learning based models have been developed using binary profile and obtain maximum area under curve (AUC) 0.68 on validation dataset. The performance of this model improved significantly from AUC 0.68 to 0.76, when evolutionary profile is used instead of binary profile. The performance of our evolutionary profile-based model improved further from AUC 0.76 to 0.82, when convolutional neural network has been used for developing model. Our final model based on convolutional neural network using evolutionary information achieved AUC 0.82 with Matthews correlation coefficient of 0.49 on the validation dataset. Our best model outperforms existing methods when evaluated on the independent/validation dataset. A user-friendly standalone software and web-based server named 'Pprint2' has been developed for predicting RNA-interacting residues (https://webs.iiitd.edu.in/raghava/pprint2 and https://github.com/raghavagps/pprint2).


Assuntos
Aminoácidos , RNA , Sítios de Ligação , RNA/metabolismo , Software , Proteínas de Ligação a RNA/metabolismo
2.
Brief Bioinform ; 24(1)2023 01 19.
Artigo em Inglês | MEDLINE | ID: mdl-36575815

RESUMO

In the current era, one of the major challenges is to manage the treatment of drug/antibiotic-resistant strains of bacteria. Phage therapy, a century-old technique, may serve as an alternative to antibiotics in treating bacterial infections caused by drug-resistant strains of bacteria. In this review, a systematic attempt has been made to summarize phage-based therapy in depth. This review has been divided into the following two sections: general information and computer-aided phage therapy (CAPT). In the case of general information, we cover the history of phage therapy, the mechanism of action, the status of phage-based products (approved and clinical trials) and the challenges. This review emphasizes CAPT, where we have covered primary phage-associated resources, phage prediction methods and pipelines. This review covers a wide range of databases and resources, including viral genomes and proteins, phage receptors, host genomes of phages, phage-host interactions and lytic proteins. In the post-genomic era, identifying the most suitable phage for lysing a drug-resistant strain of bacterium is crucial for developing alternate treatments for drug-resistant bacteria and this remains a challenging problem. Thus, we compile all phage-associated prediction methods that include the prediction of phages for a bacterial strain, the host for a phage and the identification of interacting phage-host pairs. Most of these methods have been developed using machine learning and deep learning techniques. This review also discussed recent advances in the field of CAPT, where we briefly describe computational tools available for predicting phage virions, the life cycle of phages and prophage identification. Finally, we describe phage-based therapy's advantages, challenges and opportunities.


Assuntos
Infecções Bacterianas , Bacteriófagos , Terapia por Fagos , Humanos , Terapia por Fagos/métodos , Prófagos , Genômica , Bactérias/genética , Antibacterianos
3.
Brief Bioinform ; 24(1)2023 01 19.
Artigo em Inglês | MEDLINE | ID: mdl-36524996

RESUMO

There are a number of antigens that induce autoimmune response against ß-cells, leading to type 1 diabetes mellitus (T1DM). Recently, several antigen-specific immunotherapies have been developed to treat T1DM. Thus, identification of T1DM associated peptides with antigenic regions or epitopes is important for peptide based-therapeutics (e.g. immunotherapeutic). In this study, for the first time, an attempt has been made to develop a method for predicting, designing, and scanning of T1DM associated peptides with high precision. We analysed 815 T1DM associated peptides and observed that these peptides are not associated with a specific class of HLA alleles. Thus, HLA binder prediction methods are not suitable for predicting T1DM associated peptides. First, we developed a similarity/alignment based method using Basic Local Alignment Search Tool and achieved a high probability of correct hits with poor coverage. Second, we developed an alignment-free method using machine learning techniques and got a maximum AUROC of 0.89 using dipeptide composition. Finally, we developed a hybrid method that combines the strength of both alignment free and alignment-based methods and achieves maximum area under the receiver operating characteristic of 0.95 with Matthew's correlation coefficient of 0.81 on an independent dataset. We developed a web server 'DMPPred' and stand-alone server for predicting, designing and scanning T1DM associated peptides (https://webs.iiitd.edu.in/raghava/dmppred/).


Assuntos
Diabetes Mellitus Tipo 1 , Humanos , Diabetes Mellitus Tipo 1/genética , Simulação por Computador , Peptídeos/química , Epitopos/química , Software
4.
Proteomics ; : e2400004, 2024 May 27.
Artigo em Inglês | MEDLINE | ID: mdl-38803012

RESUMO

Peptide hormones serve as genome-encoded signal transduction molecules that play essential roles in multicellular organisms, and their dysregulation can lead to various health problems. In this study, we propose a method for predicting hormonal peptides with high accuracy. The dataset used for training, testing, and evaluating our models consisted of 1174 hormonal and 1174 non-hormonal peptide sequences. Initially, we developed similarity-based methods utilizing BLAST and MERCI software. Although these similarity-based methods provided a high probability of correct prediction, they had limitations, such as no hits or prediction of limited sequences. To overcome these limitations, we further developed machine and deep learning-based models. Our logistic regression-based model achieved a maximum AUROC of 0.93 with an accuracy of 86% on an independent/validation dataset. To harness the power of similarity-based and machine learning-based models, we developed an ensemble method that achieved an AUROC of 0.96 with an accuracy of 89.79% and a Matthews correlation coefficient (MCC) of 0.8 on the validation set. To facilitate researchers in predicting and designing hormone peptides, we developed a web-based server called HOPPred. This server offers a unique feature that allows the identification of hormone-associated motifs within hormone peptides. The server can be accessed at: https://webs.iiitd.edu.in/raghava/hoppred/.

5.
Proteomics ; 24(6): e2300231, 2024 Mar.
Artigo em Inglês | MEDLINE | ID: mdl-37525341

RESUMO

Non-invasive diagnostics and therapies are crucial to prevent patients from undergoing painful procedures. Exosomal proteins can serve as important biomarkers for such advancements. In this study, we attempted to build a model to predict exosomal proteins. All models are trained, tested, and evaluated on a non-redundant dataset comprising 2831 exosomal and 2831 non-exosomal proteins, where no two proteins have more than 40% similarity. Initially, the standard similarity-based method Basic Local Alignment Search Tool (BLAST) was used to predict exosomal proteins, which failed due to low-level similarity in the dataset. To overcome this challenge, machine learning (ML) based models were developed using compositional and evolutionary features of proteins achieving an area under the receiver operating characteristics (AUROC) of 0.73. Our analysis also indicated that exosomal proteins have a variety of sequence-based motifs which can be used to predict exosomal proteins. Hence, we developed a hybrid method combining motif-based and ML-based approaches for predicting exosomal proteins, achieving a maximum AUROC of 0.85 and MCC of 0.56 on an independent dataset. This hybrid model performs better than presently available methods when assessed on an independent dataset. A web server and a standalone software ExoProPred (https://webs.iiitd.edu.in/raghava/exopropred/) have been created to help scientists predict and discover exosomal proteins and find functional motifs present in them.


Assuntos
Algoritmo Florestas Aleatórias , Análise de Sequência de Proteína , Humanos , Sequência de Aminoácidos , Análise de Sequência de Proteína/métodos , Proteínas/metabolismo , Software
6.
Brief Bioinform ; 23(5)2022 09 20.
Artigo em Inglês | MEDLINE | ID: mdl-35580839

RESUMO

Human leukocyte antigens (HLA) regulate various innate and adaptive immune responses and play a crucial immunomodulatory role. Recent studies revealed that non-classical HLA-(HLA-E & HLA-G) based immunotherapies have many advantages over traditional HLA-based immunotherapy, particularly against cancer and COVID-19 infection. In the last two decades, several methods have been developed to predict the binders of classical HLA alleles. In contrast, limited attempts have been made to develop methods for predicting non-classical HLA binding peptides, due to the scarcity of sufficient experimental data. Of note, in order to facilitate the scientific community, we have developed an artificial intelligence-based method for predicting binders of class-Ib HLA alleles. All the models were trained and tested on experimentally validated data obtained from the recent release of IEDB. The machine learning models achieved more than 0.98 AUC for HLA-G alleles on validation dataset. Similarly, our models achieved the highest AUC of 0.96 and 0.94 on the validation dataset for HLA-E*01:01 and HLA-E*01:03, respectively. We have summarized the models developed in the past for non-classical HLA and validated the performance with the models developed in this study. Moreover, to facilitate the community, we have utilized our tool for predicting the potential non-classical HLA binding peptides in the spike protein of different variants of virus causing COVID-19, including Omicron (B.1.1.529). One of the major challenges in the field of immunotherapy is to identify the promiscuous binders or antigenic regions that can bind to a large number of HLA alleles. To predict the promiscuous binders for the non-classical HLA alleles, we developed a web server HLAncPred (https://webs.iiitd.edu.in/raghava/hlancpred) and standalone package.


Assuntos
Inteligência Artificial , COVID-19 , Sítios de Ligação , COVID-19/genética , Antígenos HLA-G/metabolismo , Humanos , Peptídeos/química , Ligação Proteica , Glicoproteína da Espícula de Coronavírus/metabolismo
7.
Brief Bioinform ; 23(5)2022 09 20.
Artigo em Inglês | MEDLINE | ID: mdl-35943134

RESUMO

DNA-protein interaction is one of the most crucial interactions in the biological system, which decides the fate of many processes such as transcription, regulation and splicing of genes. In this study, we trained our models on a training dataset of 646 DNA-binding proteins having 15 636 DNA interacting and 298 503 non-interacting residues. Our trained models were evaluated on an independent dataset of 46 DNA-binding proteins having 965 DNA interacting and 9911 non-interacting residues. All proteins in the independent dataset have less than 30% of sequence similarity with proteins in the training dataset. A wide range of traditional machine learning and deep learning (1D-CNN) techniques-based models have been developed using binary, physicochemical properties and Position-Specific Scoring Matrix (PSSM)/evolutionary profiles. In the case of machine learning technique, eXtreme Gradient Boosting-based model achieved a maximum area under the receiver operating characteristics (AUROC) curve of 0.77 on the independent dataset using PSSM profile. Deep learning-based model achieved the highest AUROC of 0.79 on the independent dataset using a combination of all three profiles. We evaluated the performance of existing methods on the independent dataset and observed that our proposed method outperformed all the existing methods. In order to facilitate scientific community, we developed standalone software and web server, which are accessible from https://webs.iiitd.edu.in/raghava/dbpred.


Assuntos
Aprendizado Profundo , DNA/química , DNA/genética , Proteínas de Ligação a DNA , Bases de Dados de Proteínas , Matrizes de Pontuação de Posição Específica
8.
Brief Bioinform ; 23(5)2022 09 20.
Artigo em Inglês | MEDLINE | ID: mdl-35595541

RESUMO

Proteins/peptides have shown to be promising therapeutic agents for a variety of diseases. However, toxicity is one of the obstacles in protein/peptide-based therapy. The current study describes a web-based tool, ToxinPred2, developed for predicting the toxicity of proteins. This is an update of ToxinPred developed mainly for predicting toxicity of peptides and small proteins. The method has been trained, tested and evaluated on three datasets curated from the recent release of the SwissProt. To provide unbiased evaluation, we performed internal validation on 80% of the data and external validation on the remaining 20% of data. We have implemented the following techniques for predicting protein toxicity; (i) Basic Local Alignment Search Tool-based similarity, (ii) Motif-EmeRging and with Classes-Identification-based motif search and (iii) Prediction models. Similarity and motif-based techniques achieved a high probability of correct prediction with poor sensitivity/coverage, whereas models based on machine-learning techniques achieved balance sensitivity and specificity with reasonably high accuracy. Finally, we developed a hybrid method that combined all three approaches and achieved a maximum area under receiver operating characteristic curve around 0.99 with Matthews correlation coefficient 0.91 on the validation dataset. In addition, we developed models on alternate and realistic datasets. The best machine learning models have been implemented in the web server named 'ToxinPred2', which is available at https://webs.iiitd.edu.in/raghava/toxinpred2/ and a standalone version at https://github.com/raghavagps/toxinpred2. This is a general method developed for predicting the toxicity of proteins regardless of their source of origin.


Assuntos
Proteínas , Software , Bases de Dados de Proteínas , Aprendizado de Máquina , Peptídeos , Proteínas/toxicidade
9.
Brief Bioinform ; 22(3)2021 05 20.
Artigo em Inglês | MEDLINE | ID: mdl-32770192

RESUMO

Increasing use of therapeutic peptides for treating cancer has received considerable attention of the scientific community in the recent years. The present study describes the in silico model developed for predicting and designing anticancer peptides (ACPs). ACPs residue composition analysis show the preference of A, F, K, L and W. Positional preference analysis revealed that residues A, F and K are favored at N-terminus and residues L and K are preferred at C-terminus. Motif analysis revealed the presence of motifs like LAKLA, AKLAK, FAKL and LAKL in ACPs. Machine learning models were developed using various input features and implementing different machine learning classifiers on two datasets main and alternate dataset. In the case of main dataset, dipeptide composition based ETree classifier model achieved maximum Matthews correlation coefficient (MCC) of 0.51 and 0.83 area under receiver operating characteristics (AUROC) on the training dataset. In the case of alternate dataset, amino acid composition based ETree classifier performed best and achieved the highest MCC of 0.80 and AUROC of 0.97 on the training dataset. Five-fold cross-validation technique was implemented for model training and testing, and their performance was also evaluated on the validation dataset. Best models were implemented in the webserver AntiCP 2.0, which is freely available at https://webs.iiitd.edu.in/raghava/anticp2/. The webserver is compatible with multiple screens such as iPhone, iPad, laptop and android phones. The standalone version of the software is available at GitHub; docker-based container also developed.


Assuntos
Antineoplásicos/química , Bases de Dados de Proteínas , Aprendizado de Máquina , Modelos Moleculares , Peptídeos/química , Peptídeos/genética , Análise de Sequência de Proteína , Software , Antineoplásicos/uso terapêutico , Humanos , Neoplasias/tratamento farmacológico , Peptídeos/uso terapêutico , Valor Preditivo dos Testes
10.
Brief Bioinform ; 22(2): 936-945, 2021 03 22.
Artigo em Inglês | MEDLINE | ID: mdl-33034338

RESUMO

Interleukin 6 (IL-6) is a pro-inflammatory cytokine that stimulates acute phase responses, hematopoiesis and specific immune reactions. Recently, it was found that the IL-6 plays a vital role in the progression of COVID-19, which is responsible for the high mortality rate. In order to facilitate the scientific community to fight against COVID-19, we have developed a method for predicting IL-6 inducing peptides/epitopes. The models were trained and tested on experimentally validated 365 IL-6 inducing and 2991 non-inducing peptides extracted from the immune epitope database. Initially, 9149 features of each peptide were computed using Pfeature, which were reduced to 186 features using the SVC-L1 technique. These features were ranked based on their classification ability, and the top 10 features were used for developing prediction models. A wide range of machine learning techniques has been deployed to develop models. Random Forest-based model achieves a maximum AUROC of 0.84 and 0.83 on training and independent validation dataset, respectively. We have also identified IL-6 inducing peptides in different proteins of SARS-CoV-2, using our best models to design vaccine against COVID-19. A web server named as IL-6Pred and a standalone package has been developed for predicting, designing and screening of IL-6 inducing peptides (https://webs.iiitd.edu.in/raghava/il6pred/).


Assuntos
COVID-19/fisiopatologia , Simulação por Computador , Interleucina-6/biossíntese , Peptídeos/metabolismo , COVID-19/virologia , Bases de Dados de Proteínas , Conjuntos de Dados como Assunto , Humanos , Interleucina-6/fisiologia , Aprendizado de Máquina , SARS-CoV-2/isolamento & purificação
11.
Brief Bioinform ; 22(3)2021 05 20.
Artigo em Inglês | MEDLINE | ID: mdl-32510549

RESUMO

Dengue virus (DENV) researchers often face challenges with the highly time-consuming process of collecting and curating information on known inhibitors during the standard drug discovery process. To this end, however, required collective information is not yet available on a single platform. Hence, we have developed the DenvInD database for experimentally validated DENV inhibitors against its known targets presently hosted at https://webs.iiitd.edu.in/raghava/denvind/. This database provides comprehensive information, i.e. PubChem IDs, SMILES, IC50, EC50, CC50, and wherever available Ki values of the 484 compounds in vitro validated as inhibitors against respective drug targets of DENV. Also, the DenvInD database has been linked to the user-friendly web-based interface and accessibility features, such as simple search, advanced search and data browsing. All the required data curation was conducted manually from the reported scientific literature and PubChem. The collected information was then organized into the DenvInD database using sequence query language under user interface by hypertext markup language. DenvInD is the first useful repository of its kind which would augment the DENV drug discovery research by providing essential information on known DENV inhibitors for molecular docking, computational screening, pharmacophore modeling and quantitative structure-activity relationship modeling.


Assuntos
Antivirais/química , Bases de Dados de Compostos Químicos , Vírus da Dengue , Dengue/tratamento farmacológico , Descoberta de Drogas , Simulação de Acoplamento Molecular , Humanos , Relação Quantitativa Estrutura-Atividade
12.
Brief Bioinform ; 22(4)2021 07 20.
Artigo em Inglês | MEDLINE | ID: mdl-33201237

RESUMO

AlgPred 2.0 is a web server developed for predicting allergenic proteins and allergenic regions in a protein. It is an updated version of AlgPred developed in 2006. The dataset used for training, testing and validation consists of 10 075 allergens and 10 075 non-allergens. In addition, 10 451 experimentally validated immunoglobulin E (IgE) epitopes were used to identify antigenic regions in a protein. All models were trained on 80% of data called training dataset, and the performance of models was evaluated using 5-fold cross-validation technique. The performance of the final model trained on the training dataset was evaluated on 20% of data called validation dataset; no two proteins in any two sets have more than 40% similarity. First, a Basic Local Alignment Search Tool (BLAST) search has been performed against the dataset, and allergens were predicted based on the level of similarity with known allergens. Second, IgE epitopes obtained from the IEDB database were searched in the dataset to predict allergens based on their presence in a protein. Third, motif-based approaches like multiple EM for motif elicitation/motif alignment and search tool have been used to predict allergens. Fourth, allergen prediction models have been developed using a wide range of machine learning techniques. Finally, the ensemble approach has been used for predicting allergenic protein by combining prediction scores of different approaches. Our best model achieved maximum performance in terms of area under receiver operating characteristic curve 0.98 with Matthew's correlation coefficient 0.85 on the validation dataset. A web server AlgPred 2.0 has been developed that allows the prediction of allergens, mapping of IgE epitope, motif search and BLAST search (https://webs.iiitd.edu.in/raghava/algpred2/).


Assuntos
Alérgenos/química , Bases de Dados de Proteínas , Mapeamento de Epitopos , Epitopos/química , Hipersensibilidade , Imunoglobulina E/química , Análise de Sequência de Proteína , Software , Alérgenos/imunologia , Epitopos/imunologia , Humanos , Imunoglobulina E/imunologia , Valor Preditivo dos Testes
13.
Proteomics ; 22(3): e2000311, 2022 02.
Artigo em Inglês | MEDLINE | ID: mdl-34637591

RESUMO

Numerous cancer-specific prognostic models have been developed in the past, wherein one model is applicable for only one type of cancer. In this study, an attempt has been made to identify universal or multi-cancer prognostic biomarkers and develop models for predicting survival risk across different types of cancer patients. In order to accomplish this, we gauged the prognostic role of mRNA expression of 165 apoptosis-related genes across 33 cancers in the context of patient survival. Firstly, we identified specific prognostic biomarker genes for 30 cancers. The cancer-specific prognostic models achieved a minimum Hazard Ratio, HRSKCM = 1.99 and maximum HRTHCA = 41.59. Secondly, a comprehensive analysis was performed to identify universal biomarkers across many cancers. Our best prognostic model consisted of 11 genes (TOP2A, ISG20, CD44, LEF1, CASP2, PSEN1, PTK2, SATB1, SLC20A1, EREG, and CD2) and stratified risk groups across 27 cancers (HROV = 1.53-HRUVM = 11.74). The model was validated on eight independent cancer cohorts and exhibited a comparable performance. Further, we clustered cancer-types on the basis of shared survival related apoptosis genes. This approach proved helpful in development of cross-cancer prognostic models. To show its efficacy, a prognostic model consisting of 15 genes was thereby developed for LGG-KIRC pair (HRKIRC = 3.27, HRLGG = 4.23). Additionally, we predicted potential therapeutic candidates for LGG-KIRC high risk patients.


Assuntos
Apoptose/genética , Neoplasias , Biomarcadores Tumorais/genética , Humanos , Neoplasias/genética , Prognóstico , Fatores de Risco , Análise de Sobrevida , Transcriptoma
14.
Genomics ; 112(5): 3696-3702, 2020 09.
Artigo em Inglês | MEDLINE | ID: mdl-32360910

RESUMO

CancerEnD is an integrated resource developed for annotating 8524 unique expressed enhancers, associated genes, somatic mutations and copy number variations of 8063 cancer samples from 18 cancer types of TCGA. Somatic mutation data was taken from the COSMIC repository. To delineate the relationship of change in copy number of enhancer elements with the prognosis of cancer patients, survival analysis was done using the survival package in R. We identified 1762 overall survival associated enhancers, which can be used for prognostic purposes of cancer patients in a tissue-specific manner. CancerEnD (https://webs.iiitd.edu.in/raghava/cancerend/) is developed on a user-friendly responsive template, that enables searching, browsing and downloading of the annotated enhancer elements in terms of gene expression, copy number variation and survival association. We hope it provides a promising avenue for researchers to facilitate the understanding of enhancer deregulation in tumorigenesis, and to identify new biomarkers for therapy and disease-diagnosis.


Assuntos
Biomarcadores Tumorais/genética , Bases de Dados Genéticas , Elementos Facilitadores Genéticos , Neoplasias/genética , Variações do Número de Cópias de DNA , Humanos , Neoplasias/patologia , Prognóstico , Análise de Sobrevida
15.
BMC Bioinformatics ; 19(Suppl 13): 426, 2019 Feb 04.
Artigo em Inglês | MEDLINE | ID: mdl-30717654

RESUMO

BACKGROUND: Molecular docking studies on protein-peptide interactions are a challenging and time-consuming task because peptides are generally more flexible than proteins and tend to adopt numerous conformations. There are several benchmarking studies on protein-protein, protein-ligand and nucleic acid-ligand docking interactions. However, a series of docking methods is not rigorously validated for protein-peptide complexes in the literature. Considering the importance and wide application of peptide docking, we describe benchmarking of 6 docking methods on 133 protein-peptide complexes having peptide length between 9 to 15 residues. The performance of docking methods was evaluated using CAPRI parameters like FNAT, I-RMSD, L-RMSD. RESULT: Firstly, we performed blind docking and evaluate the performance of the top docking pose of each method. It was observed that FRODOCK performed better than other methods with average L-RMSD of 12.46 Å. The performance of all methods improved significantly for their best docking pose and achieved highest average L-RMSD of 3.72 Å in case of FRODOCK. Similarly, we performed re-docking and evaluated the performance of the top and best docking pose of each method. We achieved the best performance in case of ZDOCK with average L-RMSD 8.60 Å and 2.88 Å for the top and best docking pose respectively. Methods were also evaluated on 40 protein-peptide complexes used in the previous benchmarking study, where peptide have length up to 5 residues. In case of best docking pose, we achieved the highest average L-RMSD of 4.45 Å and 2.09 Å for the blind docking using FRODOCK and re-docking using AutoDock Vina respectively. CONCLUSION: The study shows that FRODOCK performed best in case of blind docking and ZDOCK in case of re-docking. There is a need to improve the ranking of docking pose generated by different methods, as the present ranking scheme is not satisfactory. To facilitate the scientific community for calculating CAPRI parameters between native and docked complexes, we developed a web-based service named PPDbench ( http://webs.iiitd.edu.in/raghava/ppdbench/ ).


Assuntos
Benchmarking , Simulação de Acoplamento Molecular/métodos , Peptídeos/química , Proteínas/química , Algoritmos , Sítios de Ligação , Bases de Dados de Proteínas , Ligação Proteica , Estrutura Secundária de Proteína , Reprodutibilidade dos Testes
16.
BMC Genomics ; 19(Suppl 9): 266, 2019 Apr 18.
Artigo em Inglês | MEDLINE | ID: mdl-30999857

RESUMO

InCoB, one of the largest annual bioinformatics conferences in the Asia-Pacific region since its launch in 2002, returned to New Delhi, India after 12 years, with a conference attendance of 314 delegates. The 2018 conference had sessions on Big Data and Algorithms, Next Generation Sequencing and Omics Science, Structure, Function and Interactions, Disease and Drug Discovery and Plant and Agricultural Bioinformatics. The conference also featured an industry track as well as panel discussions on Women in Bioinformatics and Democratization vs. Quality control in academic publishing. Asia Pacific Bioinformatics Interaction & Networking Society (APbians) was launched as an APBionet Special Interest Group. Of the 52 oral presentations made, 22 were accepted in supplemental issues of BMC Bioinformatics, BMC Genomics or BMC Medical Genomics and are briefly reviewed here. Next year's InCoB will be held in Jakarta, Indonesia from September 10-12, 2019.


Assuntos
Algoritmos , Biologia Computacional/métodos , Genoma Humano , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Software , Congressos como Assunto , Humanos
17.
BMC Genomics ; 19(Suppl 9): 985, 2019 Apr 18.
Artigo em Inglês | MEDLINE | ID: mdl-30999860

RESUMO

BACKGROUND: Fragile sites are the chromosomal regions that are susceptible to breakage, and their frequency varies among the human population. Based on the frequency of fragile site induction, they are categorized as common and rare fragile sites. Common fragile sites are sensitive to replication stress and often rearranged in cancer. Rare fragile sites are the archetypal trinucleotide repeats. Fragile sites are known to be involved in chromosomal rearrangements in tumors. Human miRNA genes are also present at fragile sites. A better understanding of genes and miRNAs lying in the fragile site regions and their association with disease progression is required. RESULT: HumCFS is a manually curated database of human chromosomal fragile sites. HumCFS provides useful information on fragile sites such as coordinates on the chromosome, cytoband, their chemical inducers and frequency of fragile site (rare or common), genes and miRNAs lying in fragile sites. Protein coding genes in the fragile sites were identified by mapping the coordinates of fragile sites with human genome Ensembl (GRCh38/hg38). Genes present in fragile sites were further mapped to DisGenNET database, to understand their possible link with human diseases. Human miRNAs from miRBase was also mapped on fragile site coordinates. In brief, HumCFS provides useful information about 125 human chromosomal fragile sites and their association with 4921 human protein-coding genes and 917 human miRNA's. CONCLUSION: User-friendly web-interface of HumCFS and hyper-linking with other resources will help researchers to search for genes, miRNAs efficiently and to intersect the relationship among them. For easy data retrieval and analysis, we have integrated standard web-based tools, such as JBrowse, BLAST etc. Also, the user can download the data in various file formats such as text files, gff3 files and Bed-format files which can be used on UCSC browser. Database URL: http://webs.iiitd.edu.in/raghava/humcfs/.


Assuntos
Sítios Frágeis do Cromossomo , Cromossomos Humanos , Bases de Dados Genéticas/estatística & dados numéricos , Predisposição Genética para Doença , Genoma Humano , Humanos
18.
Brief Bioinform ; 18(3): 467-478, 2017 05 01.
Artigo em Inglês | MEDLINE | ID: mdl-27016393

RESUMO

The conventional approach for designing vaccine against a particular disease involves stimulation of the immune system using the whole pathogen responsible for the disease. In the post-genomic era, a major challenge is to identify antigenic regions or epitopes that can stimulate different arms of the immune system. In the past two decades, numerous methods and databases have been developed for designing vaccine or immunotherapy against various pathogen-causing diseases. This review describes various computational resources important for designing subunit vaccines or epitope-based immunotherapy. First, different immunological databases are described that maintain epitopes, antigens and vaccine targets. This is followed by in silico tools used for predicting linear and conformational B-cell epitopes required for activating humoral immunity. Finally, information on T-cell epitope prediction methods is provided that includes indirect methods like prediction of Major Histocompatibility Complex and transporter-associated protein binders. Different studies for validating the predicted epitopes are also examined critically. This review enlists novel in silico resources and tools available for predicting humoral and cell-mediated immune potential. These predicted epitopes could be used for designing epitope-based vaccines or immunotherapy as they may activate the adaptive immunity. Authors emphasized the need to develop tools for the prediction of adjuvants to activate innate and adaptive immune system simultaneously. In addition, attention has also been given to novel prediction methods to predict general therapeutic properties of peptides like half-life, cytotoxicity and immune toxicity.


Assuntos
Biologia Computacional , Epitopos de Linfócito B , Epitopos de Linfócito T , Humanos , Peptídeos , Vacinas de Subunidades Antigênicas
19.
Int J Mol Sci ; 20(14)2019 Jul 22.
Artigo em Inglês | MEDLINE | ID: mdl-31336658

RESUMO

Understanding the gene regulatory network governing cancer initiation and progression is necessary, although it remains largely unexplored. Enhancer elements represent the center of this regulatory circuit. The study aims to identify the gene expression change driven by copy number variation in enhancer elements of pancreatic adenocarcinoma (PAAD). The pancreatic tissue specific enhancer and target gene data were taken from EnhancerAtlas. The gene expression and copy number data were taken from The Cancer Genome Atlas (TCGA). Differentially expressed genes (DEGs) and copy number variations (CNVs) were identified between matched tumor-normal samples of PAAD. Significant CNVs were matched onto enhancer coordinates by using genomic intersection functionality from BEDTools. By combining the gene expression and CNV data, we identified 169 genes whose expression shows a positive correlation with the CNV of enhancers. We further identified 16 genes which are regulated by a super enhancer and 15 genes which have high prognostic potential (Z-score > 1.96). Cox proportional hazard analysis of these genes indicates that these are better predictors of survival. Taken together, our integrative analytical approach identifies enhancer CNV-driven gene expression change in PAAD, which could lead to better understanding of PAAD pathogenesis and to the design of enhancer-based cancer treatment strategies.


Assuntos
Adenocarcinoma/genética , Biologia Computacional , Variações do Número de Cópias de DNA , Elementos Facilitadores Genéticos , Regulação Neoplásica da Expressão Gênica , Neoplasias Pancreáticas/genética , Adenocarcinoma/mortalidade , Adenocarcinoma/patologia , Biomarcadores Tumorais , Biologia Computacional/métodos , Perfilação da Expressão Gênica , Redes Reguladoras de Genes , Humanos , Neoplasias Pancreáticas/mortalidade , Neoplasias Pancreáticas/patologia , Prognóstico , Modelos de Riscos Proporcionais , Transcriptoma
20.
J Transl Med ; 16(1): 181, 2018 07 03.
Artigo em Inglês | MEDLINE | ID: mdl-29970096

RESUMO

BACKGROUND: Evidences in literature strongly advocate the potential of immunomodulatory peptides for use as vaccine adjuvants. All the mechanisms of vaccine adjuvants ensuing immunostimulatory effects directly or indirectly stimulate antigen presenting cells (APCs). While numerous methods have been developed in the past for predicting B cell and T-cell epitopes; no method is available for predicting the peptides that can modulate the APCs. METHODS: We named the peptides that can activate APCs as A-cell epitopes and developed methods for their prediction in this study. A dataset of experimentally validated A-cell epitopes was collected and compiled from various resources. To predict A-cell epitopes, we developed support vector machine-based machine learning models using different sequence-based features. RESULTS: A hybrid model developed on a combination of sequence-based features (dipeptide composition and motif occurrence), achieved the highest accuracy of 95.71% with Matthews correlation coefficient (MCC) value of 0.91 on the training dataset. We also evaluated the hybrid models on an independent dataset and achieved a comparable accuracy of 95.00% with MCC 0.90. CONCLUSION: The models developed in this study were implemented in a web-based platform VaxinPAD to predict and design immunomodulatory peptides or A-cell epitopes. This web server available at http://webs.iiitd.edu.in/raghava/vaxinpad/ will facilitate researchers in designing peptide-based vaccine adjuvants.


Assuntos
Adjuvantes Imunológicos/farmacologia , Células Apresentadoras de Antígenos/efeitos dos fármacos , Simulação por Computador , Desenho de Fármacos , Vacinas de Subunidades Antigênicas/farmacologia , Motivos de Aminoácidos , Sequência de Aminoácidos , Bases de Dados de Proteínas , Epitopos/metabolismo , Humanos , Fatores Imunológicos/farmacologia , Internet , Modelos Teóricos , Máquina de Vetores de Suporte , Interface Usuário-Computador , Vacinas de Subunidades Antigênicas/química
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA