RESUMO
Fever is common among individuals seeking healthcare after traveling to tropical regions. Despite the association with potentially severe disease, the etiology is often not determined. Plasma protein patterns can be informative to understand the host response to infection and can potentially indicate the pathogen causing the disease. In this study, we measured 49 proteins in the plasma of 124 patients with fever after travel to tropical or subtropical regions. The patients had confirmed diagnoses of either malaria, dengue fever, influenza, bacterial respiratory tract infection, or bacterial gastroenteritis, representing the most common etiologies. We used multivariate and machine learning methods to identify combinations of proteins that contributed to distinguishing infected patients from healthy controls, and each other. Malaria displayed the most unique protein signature, indicating a strong immunoregulatory response with high levels of IL10, sTNFRI and II, and sCD25 but low levels of sCD40L. In contrast, bacterial gastroenteritis had high levels of sCD40L, APRIL, and IFN-γ, while dengue was the only infection with elevated IFN-α2. These results suggest that characterization of the inflammatory profile of individuals with fever can help to identify disease-specific host responses, which in turn can be used to guide future research on diagnostic strategies and therapeutic interventions.
Assuntos
Infecções Bacterianas , Dengue , Gastroenterite , Malária , Infecções Respiratórias , Humanos , Dengue/diagnóstico , Infecções Respiratórias/complicações , Gastroenterite/complicações , Viagem , Febre/complicaçõesRESUMO
Tuberculosis (TB) is a deadly infectious disease that affects millions of people globally. TB proteomics signature discovery has been a rapidly growing area of research that aims to identify protein biomarkers for the early detection, diagnosis, and treatment monitoring of TB. In this review, we have highlighted recent advances in this field and how it is moving from the study of single proteins to high-throughput profiling and from only using proteomics to include additional types of data in multi-omics studies. We have further covered the different sample types and experimental technologies used in TB proteomics signature discovery, focusing on studies of HIV-negative adults. The published signatures were defined as either coming from hypothesis-based protein targeting or from unbiased discovery approaches. The methodological approaches influenced the type of proteins identified and were associated with the circulating protein abundance. However, both approaches largely identified proteins involved in similar biological pathways, including acute-phase responses and T-helper type 1 and type 17 responses. By analysing the frequency of proteins in the different signatures, we could also highlight potential robust biomarker candidates. Finally, we discuss the potential value of integration of multi-omics data and the importance of control cohorts and signature validation.
Assuntos
Mycobacterium tuberculosis , Tuberculose , Humanos , Adulto , Tuberculose/diagnóstico , Biomarcadores/metabolismo , ProteômicaRESUMO
BACKGROUND: StrongestPath is a Cytoscape 3 application that enables the analysis of interactions between two proteins or groups of proteins in a collection of protein-protein interaction (PPI) network or signaling network databases. When there are different levels of confidence over the interactions, the application is able to process them and identify the cascade of interactions with the highest total confidence score. Given a set of proteins, StrongestPath can extract a set of possible interactions between the input proteins, and expand the network by adding new proteins that have the most interactions with highest total confidence to the current network of proteins. The application can also identify any activating or inhibitory regulatory paths between two distinct sets of transcription factors and target genes. This application can be used on the built-in human and mouse PPI or signaling databases, or any user-provided database for some organism. RESULTS: Our results on 12 signaling pathways from the NetPath database demonstrate that the application can be used for indicating proteins which may play significant roles in a pathway by finding the strongest path(s) in the PPI or signaling network. CONCLUSION: Easy access to multiple public large databases, generating output in a short time, addressing some key challenges in one platform, and providing a user-friendly graphical interface make StrongestPath an extremely useful application.
Assuntos
Mapas de Interação de Proteínas , Proteínas , Animais , Camundongos , Proteínas/genética , Proteínas/metabolismoRESUMO
BACKGROUND: Acute lymphoblastic leukemia (ALL) is the most common type of cancer diagnosed in children and Glucocorticoids (GCs) form an essential component of the standard chemotherapy in most treatment regimens. The category of infant ALL patients carrying a translocation involving the mixed lineage leukemia (MLL) gene (gene KMT2A) is characterized by resistance to GCs and poor clinical outcome. Although some studies examined GC-resistance in infant ALL patients, the understanding of this phenomenon remains limited and impede the efforts to improve prognosis. METHODS: This study integrates differential co-expression (DC) and protein-protein interaction (PPI) networks to find active protein modules associated with GC-resistance in MLL-rearranged infant ALL patients. A network was constructed by linking differentially co-expressed gene pairs between GC-resistance and GC-sensitive samples and later integrated with PPI networks by keeping the links that are also present in the PPI network. The resulting network was decomposed into two sub-networks, specific to each phenotype. Finally, both sub-networks were clustered into modules using weighted gene co-expression network analysis (WGCNA) and further analyzed with functional enrichment analysis. RESULTS: Through the integration of DC analysis and PPI network, four protein modules were found active under the GC-resistance phenotype but not under the GC-sensitive. Functional enrichment analysis revealed that these modules are related to proteasome, electron transport chain, tRNA-aminoacyl biosynthesis, and peroxisome signaling pathways. These findings are in accordance with previous findings related to GC-resistance in other hematological malignancies such as pediatric ALL. CONCLUSIONS: Differential co-expression analysis is a promising approach to incorporate the dynamic context of gene expression profiles into the well-documented protein interaction networks. The approach allows the detection of relevant protein modules that are highly enriched with DC gene pairs. Functional enrichment analysis of detected protein modules generates new biological hypotheses and may help in explaining the GC-resistance in MLL-rearranged infant ALL patients.
Assuntos
Glucocorticoides/uso terapêutico , Leucemia-Linfoma Linfoblástico de Células Precursoras/tratamento farmacológico , Leucemia-Linfoma Linfoblástico de Células Precursoras/metabolismo , Bases de Dados de Proteínas , Resistencia a Medicamentos Antineoplásicos/genética , Resistencia a Medicamentos Antineoplásicos/fisiologia , Perfilação da Expressão Gênica , Redes Reguladoras de Genes/genética , Redes Reguladoras de Genes/fisiologia , Histona-Lisina N-Metiltransferase/genética , Histona-Lisina N-Metiltransferase/metabolismo , Humanos , Proteína de Leucina Linfoide-Mieloide/genética , Proteína de Leucina Linfoide-Mieloide/metabolismo , Transdução de Sinais/genética , Transdução de Sinais/fisiologiaRESUMO
By the development of information theory in 1948 by Claude Shannon to address the problems in the field of data storage and data communication over (noisy) communication channel, it has been successfully applied in many other research areas such as bioinformatics and systems biology. In this manuscript, we attempt to review some of the existing literatures in systems biology, which are using the information theory measures in their calculations. As we have reviewed most of the existing information-theoretic methods in gene regulatory and metabolic networks in the first part of the review, so in the second part of our study, the application of information theory in other types of biological networks including protein-protein interaction and signaling networks will be surveyed.
Assuntos
Mapas de Interação de Proteínas , Transdução de Sinais , Algoritmos , Animais , Entropia , Humanos , Teoria da Informação , Biologia de SistemasRESUMO
"A Mathematical Theory of Communication", was published in 1948 by Claude Shannon to establish a framework that is now known as information theory. In recent decades, information theory has gained much attention in the area of systems biology. The aim of this paper is to provide a systematic review of those contributions that have applied information theory in inferring or understanding of biological systems. Based on the type of system components and the interactions between them, we classify the biological systems into 4 main classes: gene regulatory, metabolic, protein-protein interaction and signaling networks. In the first part of this review, we attempt to introduce most of the existing studies on two types of biological networks, including gene regulatory and metabolic networks, which are founded on the concepts of information theory.
Assuntos
Redes Reguladoras de Genes , Redes e Vias Metabólicas , Algoritmos , Animais , Entropia , Humanos , Teoria da Informação , Biologia de Sistemas , IncertezaRESUMO
Background: The cervicovaginal epithelial barrier is crucial for defending the female reproductive tract against sexually transmitted infections. Hormones, specifically estradiol and progesterone, along with their respective receptor expressions, play an important role in modulating this barrier. However, the influence of estradiol and progesterone on gene and protein expression in the ectocervical mucosa of naturally cycling women is not well understood. Methods: Mucosal and blood samples were collected from Kenyan female sex workers at high risk of sexually transmitted infections. All samples were obtained at two time points, separated by two weeks, aiming for the follicular and luteal phases of the menstrual cycle. Ectocervical tissue biopsies were analyzed by RNA-sequencing and in situ immunofluorescence staining, cervicovaginal lavage samples (CVL) were evaluated using protein profiling, and plasma samples were analyzed for hormone levels. Results: Unsupervised clustering of RNA-sequencing data was performed using Weighted gene co-expression network analysis (WGCNA). In the follicular phase, estradiol levels positively correlated with a gene module representing epithelial structure and function, and negatively correlated with a gene module representing cell cycle regulation. These correlations were confirmed using regression analysis including adjustment for bacterial vaginosis status. Using WGCNA, no gene module correlated with progesterone levels in the follicular phase. In the luteal phase, no gene module correlated with either estradiol or progesterone levels. Protein profiling on CVL revealed that higher levels of estradiol during the follicular phase correlated with increased expression of epithelial barrier integrity markers, including DSG1. This contrasted to the limited correlations of protein expression with estradiol levels in the luteal phase. In situ imaging analysis confirmed that higher estradiol levels during the follicular phase correlated with increased DSG1 expression. Conclusion: We demonstrate that estradiol levels positively correlate with specific markers of ectocervical epithelial structure and function, particularly DSG1, during the follicular phase of the menstrual cycle. Neither progesterone levels during the follicular phase nor estradiol and progesterone levels during the luteal phase correlated with any specific sets of gene markers. These findings align with the expression of estradiol and progesterone receptors in the ectocervical epithelium during these menstrual phases.
Assuntos
Colo do Útero , Desmogleína 1 , Estradiol , Fase Folicular , Humanos , Feminino , Estradiol/sangue , Fase Folicular/metabolismo , Colo do Útero/metabolismo , Adulto , Desmogleína 1/metabolismo , Desmogleína 1/genética , Progesterona/sangue , Progesterona/metabolismo , Adulto Jovem , Fase Luteal/metabolismo , Profissionais do Sexo , Epitélio/metabolismoRESUMO
Glycolipids constitute a major part of the cell envelope of Mycobacterium tuberculosis (Mtb). They are potent immunomodulatory molecules recognized by several immune receptors like pattern recognition receptors such as TLR2, DC-SIGN and Dectin-2 on antigen-presenting cells and by T cell receptors on T lymphocytes. The Mtb glycolipids lipoarabinomannan (LAM) and its biosynthetic relatives, phosphatidylinositol mannosides (PIMs) and lipomannan (LM), as well as other Mtb glycolipids, such as phenolic glycolipids and sulfoglycolipids have the ability to modulate the immune response, stimulating or inhibiting a pro-inflammatory response. We explore here the downmodulating effect of Mtb glycolipids. A great proportion of the studies used in vitro approaches although in vivo infection with Mtb might also lead to a dampening of myeloid cell and T cell responses to Mtb glycolipids. This dampened response has been explored ex vivo with immune cells from peripheral blood from Mtb-infected individuals and in mouse models of infection. In addition to the dampening of the immune response caused by Mtb glycolipids, we discuss the hyporesponse to Mtb glycolipids caused by prolonged Mtb infection and/or exposure to Mtb antigens. Hyporesponse to LAM has been observed in myeloid cells from individuals with active and latent tuberculosis (TB). For some myeloid subsets, this effect is stronger in latent versus active TB. Since the immune response in individuals with latent TB represents a more protective profile compared to the one in patients with active TB, this suggests that downmodulation of myeloid cell functions by Mtb glycolipids may be beneficial for the host and protect against active TB disease. The mechanisms of this downmodulation, including tolerance through epigenetic modifications, are only partly explored.
Assuntos
Mycobacterium tuberculosis , Tuberculose , Animais , Camundongos , Glicolipídeos , Membrana Celular , Parede CelularRESUMO
Annually, approximately 10 million people are diagnosed with active tuberculosis (TB), and 1.4 million die of the disease. If left untreated, each person with active TB will infect 10-15 new individuals. The lack of non-sputum-based diagnostic tests leads to delayed diagnoses of active pulmonary TB cases, contributing to continued disease transmission. In this exploratory study, we aimed to identify biomarkers associated with active TB. We assessed the plasma levels of 92 proteins associated with inflammation in individuals with active TB (n = 20), latent TB (n = 14), or healthy controls (n = 10). Using co-expression network analysis, we identified one module of proteins with strong association with active TB. We removed proteins from the module that had low abundance or were associated with non-TB diseases in published transcriptomic datasets, resulting in a 12-protein plasma signature that was highly enriched in individuals with pulmonary and extrapulmonary TB and was further associated with disease severity.
RESUMO
Regardless of all efforts on community discovery algorithms, it is still an open and challenging subject in network science. Recognizing communities in a multilayer network, where there are several layers (types) of connections, is even more complicated. Here, we concentrated on a specific type of communities called seed-centric local communities in the multilayer environment and developed a novel method based on the information cascade concept, called PLCDM. Our simulations on three datasets (real and artificial) signify that the suggested method outstrips two known earlier seed-centric local methods. Additionally, we compared it with other global multilayer and single-layer methods. Eventually, we applied our method on a biological two-layer network of Colon Adenocarcinoma (COAD), reconstructed from transcriptomic and post-transcriptomic datasets, and assessed the output modules. The functional enrichment consequences infer that the modules of interest hold biomolecules involved in the pathways associated with the carcinogenesis.
Assuntos
Adenocarcinoma/genética , Algoritmos , Neoplasias do Colo/genética , Mapas de Interação de Proteínas/genética , Transcriptoma/genética , Adenocarcinoma/metabolismo , Carcinogênese/genética , Neoplasias do Colo/metabolismo , HumanosRESUMO
Complexity of cascading interrelations between molecular cell components at different levels from genome to metabolome ordains a massive difficulty in comprehending biological happenings. However, considering these complications in the systematic modelings will result in realistic and reliable outputs. The multilayer networks approach is a relatively innovative concept that could be applied for multiple omics datasets as an integrative methodology to overcome heterogeneity difficulties. Herein, we employed the multilayer framework to rehabilitate colon adenocarcinoma network by observing co-expression correlations, regulatory relations, and physical binding interactions. Hub nodes in this three-layer network were selected using a heterogeneous random walk with random jump procedure. We exploited local composite modules around the hub nodes having high overlay with cancer-specific pathways, and investigated their genes showing a different expressional pattern in the tumor progression. These genes were examined for survival effects on the patient's lifespan, and those with significant impacts were selected as potential candidate biomarkers. Results suggest that identified genes indicate noteworthy importance in the carcinogenesis of the colon.
Assuntos
Adenocarcinoma/genética , Carcinogênese/genética , Neoplasias do Colo/genética , Perfilação da Expressão Gênica/métodos , Regulação Neoplásica da Expressão Gênica/genética , Transcriptoma , Fatores Estimuladores Upstream/genética , Adenocarcinoma/patologia , Neoplasias do Colo/patologia , Progressão da Doença , Fator de Transcrição E2F1/genética , HumanosRESUMO
The task of drug-target interaction prediction holds significant importance in pharmacology and therapeutic drug design. In this paper, we present FRnet-DTI, an auto-encoder based feature manipulation and a convolutional neural network based classifier for drug target interaction prediction. Two convolutional neural networks are proposed: FRnet-Encode and FRnet-Predict. Here, one model is used for feature manipulation and the other one for classification. Using the first method FRnet-Encode, we generate 4096 features for each of the instances in each of the datasets and use the second method, FRnet-Predict, to identify interaction probability employing those features. We have tested our method on four gold standard datasets extensively used by other researchers. Experimental results shows that our method significantly improves over the state-of-the-art method on three out of four drug-target interaction gold standard datasets on both area under curve for Receiver Operating Characteristic (auROC) and area under Precision Recall curve (auPR) metric. We also introduce twenty new potential drug-target pairs for interaction based on high prediction scores. The source codes and implementation details of our methods are available from https://github.com/farshidrayhanuiu/FRnet-DTI/ and also readily available to use as an web application from http://farshidrayhan.pythonanywhere.com/FRnet-DTI/.
RESUMO
Pseudomonas aeruginosa and Staphylococcus aureus are two evolutionary distant bacterial species that are frequently isolated from persistent infections such as chronic infectious wounds and severe lung infections in cystic fibrosis patients. To the best of our knowledge no comprehensive genome scale co-expression study has been already conducted on these two species and in most cases only the expression of very few genes has been the subject of investigation. In this study, in order to investigate the level of expressional conservation between these two species, using heterogeneous gene expression datasets the weighted gene co-expression network analysis (WGCNA) approach was applied to study both single and cross species genome scale co-expression patterns of these two species. Single species co-expression network analysis revealed that in P. aeruginosa, genes involved in quorum sensing (QS), iron uptake, nitrate respiration and type III secretion systems and in S. aureus, genes associated with the regulation of carbon metabolism, fatty acid-phospholipids metabolism and proteolysis represent considerable co-expression across a variety of experimental conditions. Moreover, the comparison of gene co-expression networks between P. aeruginosa and S. aureus was led to the identification of four co-expressed gene modules in both species totally consisting of 318 genes. Several genes related to two component signal transduction systems, small colony variants (SCVs) morphotype and protein complexes were found in the detected modules. We believe that targeting the key players among the identified co-expressed orthologous genes will be a potential intervention strategy to control refractory co-infections caused by these two bacterial species.
Assuntos
Regulação Bacteriana da Expressão Gênica , Genes Bacterianos , Pseudomonas aeruginosa/genética , Staphylococcus aureus/genética , Virulência/genética , Humanos , Infecções por Pseudomonas/microbiologia , Pseudomonas aeruginosa/patogenicidade , Percepção de Quorum , Infecções Estafilocócicas/microbiologia , Staphylococcus aureus/patogenicidadeRESUMO
The Hippo signaling pathway (HSP) has been identified as an essential and complex signaling pathway for tumor suppression that coordinates proliferation, differentiation, cell death, cell growth and stemness. In the present study, we conducted a genome-scale co-expression analysis to reconstruct the HSP in colorectal cancer (CRC). Five key modules were detected through network clustering, and a detailed discussion of two modules containing respectively 18 and 13 over and down-regulated members of HSP was provided. Our results suggest new potential regulatory factors in the HSP. The detected modules also suggest novel genes contributing to CRC. Moreover, differential expression analysis confirmed the differential expression pattern of HSP members and new suggested regulatory factors between tumor and normal samples. These findings can further reveal the importance of HSP in CRC.
Assuntos
Neoplasias Colorretais , Regulação Neoplásica da Expressão Gênica , Redes Reguladoras de Genes , Modelos Biológicos , Proteínas de Neoplasias , Proteínas Serina-Treonina Quinases , Transdução de Sinais , Neoplasias Colorretais/genética , Neoplasias Colorretais/metabolismo , Neoplasias Colorretais/patologia , Estudo de Associação Genômica Ampla , Via de Sinalização Hippo , Humanos , Proteínas de Neoplasias/genética , Proteínas de Neoplasias/metabolismo , Proteínas Serina-Treonina Quinases/genética , Proteínas Serina-Treonina Quinases/metabolismoRESUMO
Prediction of new drug-target interactions is critically important as it can lead the researchers to find new uses for old drugs and to disclose their therapeutic profiles or side effects. However, experimental prediction of drug-target interactions is expensive and time-consuming. As a result, computational methods for predictioning new drug-target interactions have gained a tremendous interest in recent times. Here we present iDTI-ESBoost, a prediction model for identification of drug-target interactions using evolutionary and structural features. Our proposed method uses a novel data balancing and boosting technique to predict drug-target interaction. On four benchmark datasets taken from a gold standard data, iDTI-ESBoost outperforms the state-of-the-art methods in terms of area under receiver operating characteristic (auROC) curve. iDTI-ESBoost also outperforms the latest and the best-performing method found in the literature in terms of area under precision recall (auPR) curve. This is significant as auPR curves are argued as suitable metric for comparison for imbalanced datasets similar to the one studied here. Our reported results show the effectiveness of the classifier, balancing methods and the novel features incorporated in iDTI-ESBoost. iDTI-ESBoost is a novel prediction method that has for the first time exploited the structural features along with the evolutionary features to predict drug-protein interactions. We believe the excellent performance of iDTI-ESBoost both in terms of auROC and auPR would motivate the researchers and practitioners to use it to predict drug-target interactions. To facilitate that, iDTI-ESBoost is implemented and made publicly available at: http://farshidrayhan.pythonanywhere.com/iDTI-ESBoost/ .
Assuntos
Sistemas de Liberação de Medicamentos/métodos , Descoberta de Drogas/métodos , Algoritmos , Biologia Computacional/métodos , Simulação por Computador , Interações Medicamentosas , Previsões/métodos , Proteínas/química , Curva ROCRESUMO
PURPOSE: Despite vast improvements that have been made in the treatment of children with acute lymphoblastic leukemia (ALL), the majority of infant ALL patients (~80 %, < 1 year of age) that carry a chromosomal translocation involving the mixed lineage leukemia (MLL) gene shows a poor response to chemotherapeutic drugs, especially glucocorticoids (GCs), which are essential components of all current treatment regimens. Although addressed in several studies, the mechanism(s) underlying this phenomenon have remained largely unknown. A major drawback of most previous studies is their primary focus on individual genes, thereby neglecting the putative significance of inter-gene correlations. Here, we aimed at studying GC resistance in MLL-rearranged infant ALL patients by inferring an associated module of genes using co-expression network analysis. The implications of newly identified candidate genes with associations to other well-known relevant genes from the same module, or with associations to known transcription factor or microRNA interactions, were substantiated using literature data. METHODS: A weighted gene co-expression network was constructed to identify gene modules associated with GC resistance in MLL-rearranged infant ALL patients. Significant gene ontology (GO) terms and signaling pathways enriched in relevant modules were used to provide guidance towards which module(s) consisted of promising candidates suitable for further analysis. RESULTS: Through gene co-expression network analysis a novel set of genes (module) related to GC-resistance was identified. The presence in this module of the S100 and ANXA genes, both well-known biomarkers for GC resistance in MLL-rearranged infant ALL, supports its validity. Subsequent gene set net correlation analyses of the novel module provided further support for its validity by showing that the S100 and ANXA genes act as 'hub' genes with potentially major regulatory roles in GC sensitivity, but having lost this role in the GC resistant phenotype. The detected module implicates new genes as being candidates for further analysis through associations with known GC resistance-related genes. CONCLUSIONS: From our data we conclude that available systems biology approaches can be employed to detect new candidate genes that may provide further insights into drug resistance of MLL-rearranged infant ALL cases. Such approaches complement conventional gene-wise approaches by taking putative functional interactions between genes into account.
Assuntos
Antineoplásicos/uso terapêutico , Resistencia a Medicamentos Antineoplásicos/genética , Perfilação da Expressão Gênica/métodos , Glucocorticoides/uso terapêutico , Leucemia-Linfoma Linfoblástico de Células Precursoras/tratamento farmacológico , Leucemia-Linfoma Linfoblástico de Células Precursoras/genética , Redes Reguladoras de Genes , Histona-Lisina N-Metiltransferase/genética , Humanos , Lactente , Proteína de Leucina Linfoide-Mieloide/genéticaRESUMO
The labor-intensive and expensive experimental process of drug-target interaction prediction has motivated many researchers to focus on in silico prediction, which leads to the helpful information in supporting the experimental interaction data. Therefore, they have proposed several computational approaches for discovering new drug-target interactions. Several learning-based methods have been increasingly developed which can be categorized into two main groups: similarity-based and feature-based. In this paper, we firstly use the bi-gram features extracted from the Position Specific Scoring Matrix (PSSM) of proteins in predicting drug-target interactions. Our results demonstrate the high-confidence prediction ability of the Bigram-PSSM model in terms of several performance indicators specifically for enzymes and ion channels. Moreover, we investigate the impact of negative selection strategy on the performance of the prediction, which is not widely taken into account in the other relevant studies. This is important, as the number of non-interacting drug-target pairs are usually extremely large in comparison with the number of interacting ones in existing drug-target interaction data. An interesting observation is that different levels of performance reduction have been attained for four datasets when we change the sampling method from the random sampling to the balanced sampling.
Assuntos
Bases de Dados Factuais/normas , Interações Medicamentosas , Preparações Farmacêuticas/metabolismo , Matrizes de Pontuação de Posição Específica , Proteínas/metabolismo , Previsões , Humanos , Preparações Farmacêuticas/química , Proteínas/química , Distribuição AleatóriaRESUMO
Detecting functional motifs in biological networks is one of the challenging problems in systems biology. Given a multiset of colors as query and a list-colored graph (an undirected graph with a set of colors assigned to each of its vertices), the problem is reduced to finding connected subgraphs, which best cover the multiset of query. To solve this NP-complete problem, we propose a new color-based centrality measure for list-colored graphs. Based on this newly-defined measure of centrality, a novel polynomial time algorithm is developed to discover functional motifs in list-colored graphs, using a greedy strategy. This algorithm, called CeFunMO, has superior running time and acceptable accuracy in comparison with other well-known algorithms, such as RANGI and GraMoFoNe.
Assuntos
Biologia Computacional/métodos , Mapeamento de Interação de Proteínas/métodos , Software , Algoritmos , Animais , Humanos , LevedurasRESUMO
INTRODUCTION: Identification of the interaction between drugs and target proteins is a crucial task in genomic drug discovery. The in silico prediction is an appropriate alternative for the laborious and costly experimental process of drug-target interaction prediction. Developing a variety of computational methods opens a new direction in analyzing and detecting new drug-target pairs. AREAS COVERED: In this review, we will focus on chemogenomic methods which have established a learning framework for predicting drug-target interactions. Learning-based methods are classified into supervised and semi-supervised, and the supervised learning methods are studied as two separate parts including similarity-based methods and feature-based methods. EXPERT OPINION: In spite of many improvements for pharmacology applications by learning-based methods, there are many over simplification settings in construction of predictive models that may lead to over-optimistic results on drug-target interaction prediction.
Assuntos
Descoberta de Drogas/métodos , Genômica/métodos , Terapia de Alvo Molecular , Simulação por Computador , Humanos , Modelos TeóricosRESUMO
With the growing understanding of complex diseases, the focus of drug discovery has shifted away from the well-accepted "one target, one drug" model, to a new "multi-target, multi-drug" model, aimed at systemically modulating multiple targets. Identification of the interaction between drugs and target proteins plays an important role in genomic drug discovery, in order to discover new drugs or novel targets for existing drugs. Due to the laborious and costly experimental process of drug-target interaction prediction, in silico prediction could be an efficient way of providing useful information in supporting experimental interaction data. An important notion that has emerged in post-genomic drug discovery is that the large-scale integration of genomic, proteomic, signaling and metabolomic data can allow us to construct complex networks of the cell that would provide us with a new framework for understanding the molecular basis of physiological or pathophysiological states. An emerging paradigm of polypharmacology in the post-genomic era is that drug, target and disease spaces can be correlated to study the effect of drugs on different spaces and their interrelationships can be exploited for designing drugs or cocktails which can effectively target one or more disease states. The future goal, therefore, is to create a computational platform that integrates genome-scale metabolic pathway, protein-protein interaction networks, gene transcriptional analysis in order to build a comprehensive network for multi-target multi-drug discovery.