RESUMO
For breast cancer, clinically important subtypes are well characterized at the molecular level in terms of gene expression profiles. In addition, signaling pathways in breast cancer have been extensively studied as therapeutic targets due to their roles in tumor growth and metastasis. However, it is challenging to put signaling pathways and gene expression profiles together to characterize biological mechanisms of breast cancer subtypes since many signaling events result from post-translational modifications, rather than gene expression differences. We designed a logic-based computational framework to explain the differences in gene expression profiles among breast cancer subtypes using Pathway Logic and transcriptional network information. Pathway Logic is a rewriting-logic-based formal system for modeling biological pathways including post-translational modifications. Our method demonstrated its utility by constructing subtype-specific path from key receptors (TNFR, TGFBR1 and EGFR) to key transcription factor (TF) regulators (RELA, ATF2, SMAD3 and ELK1) and identifying potential association between pathways via TFs in basal-specific paths, which could provide a novel insight on aggressive breast cancer subtypes. Codes and results are available at http://epigenomics.snu.ac.kr/PL/.
Assuntos
Neoplasias da Mama/genética , Biologia Computacional/métodos , Perfilação da Expressão Gênica/métodos , Redes Reguladoras de Genes , Modelos Genéticos , Algoritmos , Conjuntos de Dados como Assunto , Fator de Crescimento Epidérmico/genética , Fator de Crescimento Epidérmico/metabolismo , Feminino , Regulação Neoplásica da Expressão Gênica , Humanos , Processamento de Proteína Pós-Traducional/genética , Transdução de Sinais/genética , Fatores de Transcrição/metabolismo , Fator de Crescimento Transformador beta1/genética , Fator de Crescimento Transformador beta1/metabolismo , Fator de Necrose Tumoral alfa/genética , Fator de Necrose Tumoral alfa/metabolismoRESUMO
BACKGROUND: As more complete genome sequences become available, bioinformatics challenges arise in how to exploit genome sequences to make phenotypic predictions. One type of phenotypic prediction is to determine sets of compounds that will support the growth of a bacterium from the metabolic network inferred from the genome sequence of that organism. RESULTS: We present a method for computationally determining alternative growth media for an organism based on its metabolic network and transporter complement. Our method predicted 787 alternative anaerobic minimal nutrient sets for Escherichia coli K-12 MG1655 from the EcoCyc database. The program automatically partitioned the nutrients within these sets into 21 equivalence classes, most of which correspond to compounds serving as sources of carbon, nitrogen, phosphorous, and sulfur, or combinations of these essential elements. The nutrient sets were predicted with 72.5% accuracy as evaluated by comparison with 91 growth experiments. Novel aspects of our approach include (a) exhaustive consideration of all combinations of nutrients rather than assuming that all element sources can substitute for one another(an assumption that can be invalid in general) (b) leveraging the notion of a machinery-duplicating constraint, namely, that all intermediate metabolites used in active reactions must be produced in increasing concentrations to prevent successive dilution from cell division, (c) the use of Satisfiability Modulo Theory solvers rather than Linear Programming solvers, because our approach cannot be formulated as linear programming, (d) the use of Binary Decision Diagrams to produce an efficient implementation. CONCLUSIONS: Our method for generating minimal nutrient sets from the metabolic network and transporters of an organism combines linear constraint solving with binary decision diagrams to efficiently produce solution sets to provided growth problems.
Assuntos
Algoritmos , Meios de Cultura , Redes e Vias Metabólicas , Biologia Computacional/métodos , Escherichia coli K12/genética , Escherichia coli K12/crescimento & desenvolvimento , Escherichia coli K12/metabolismo , Proteínas de Escherichia coli/metabolismo , Genômica , Proteínas de Membrana Transportadoras/metabolismo , Modelos BiológicosRESUMO
PURPOSE: New strategies for developing inhibitors of Mycobacterium tuberculosis (Mtb) are required in order to identify the next generation of tuberculosis (TB) drugs. Our approach leverages the integration of intensive data mining and curation and computational approaches, including cheminformatics combined with bioinformatics, to suggest biological targets and their small molecule modulators. METHODS: We now describe an approach that uses the TBCyc pathway and genome database, the Collaborative Drug Discovery database of molecules with activity against Mtb and their associated targets, a 3D pharmacophore approach and Bayesian models of TB activity in order to select pathways and metabolites and ultimately prioritize molecules that may be acting as substrate mimics and exhibit activity against TB. RESULTS: In this study we combined the TB cheminformatics and pathways databases that enabled us to computationally search >80,000 vendor available molecules and ultimately test 23 compounds in vitro that resulted in two compounds (N-(2-furylmethyl)-N'-[(5-nitro-3-thienyl)carbonyl]thiourea and N-[(5-nitro-3-thienyl)carbonyl]-N'-(2-thienylmethyl)thiourea) proposed as mimics of D-fructose 1,6 bisphosphate, (MIC of 20 and 40 µg/ml, respectively). CONCLUSION: This is a simple yet novel approach that has the potential to identify inhibitors of bacterial growth as illustrated by compounds identified in this study that have activity against Mtb.
Assuntos
Antituberculosos/química , Antituberculosos/farmacologia , Biologia Computacional/métodos , Descoberta de Drogas/métodos , Mycobacterium tuberculosis/efeitos dos fármacos , Tuberculose/tratamento farmacológico , Animais , Proteínas de Bactérias/genética , Proteínas de Bactérias/metabolismo , Teorema de Bayes , Mineração de Dados , Bases de Dados Factuais , Humanos , Redes e Vias Metabólicas/efeitos dos fármacos , Modelos Moleculares , Terapia de Alvo Molecular/métodos , Mycobacterium tuberculosis/enzimologia , Mycobacterium tuberculosis/genética , Mycobacterium tuberculosis/metabolismo , Tuberculose/microbiologiaRESUMO
The study of biological systems is complex and of great importance. There exist numerous approaches to signal transduction processes, including symbolic modeling of cellular adaptation. The use of formal methods for computational systems biology eases the analysis of cellular models and the establishment of the causes and consequences of certain cellular situations associated to diseases. In this paper, we define an application of logic modeling with rewriting logic and soft set theory. Our approach to decision making with soft sets offers a novel strategy that complements standard strategies. We implement a metalevel strategy to control and guide the rewriting process of the Maude rewriting engine. In particular, we adapt mathematical methods to capture imprecision, vagueness, and uncertainty in the available data. Using this new strategy, we propose an extension in the biological symbolic models of Pathway Logic. Our ultimate aim is to automatically determine the rules that are most appropriate and adjusted to reality in dynamic systems using decision making with incomplete soft sets.
RESUMO
In biological systems, pathways define complex interaction networks where multiple molecular elements are involved in a series of controlled reactions producing responses to specific biomolecular signals. These biosystems are dynamic and there is a need for mathematical and computational methods able to analyze the symbolic elements and the interactions between them and produce adequate readouts of such systems. In this work, we use rewriting logic to analyze the cellular signaling of epidermal growth factor (EGF) and its cell surface receptor (EGFR) in order to induce cellular proliferation. Signaling is initiated by binding the ligand protein EGF to the membrane-bound receptor EGFR so as to trigger a reactions path which have several linked elements through the cell from the membrane till the nucleus. We present two different types of search for analyzing the EGF/proliferation system with the help of Pathway Logic tool, which provides a knowledge-based development environment to carry out the modeling of the signaling. The first one is a standard (forward) search. The second one is a novel approach based on narrowing, which allows us to trace backwards the causes of a given final state. The analysis allows the identification of critical elements that have to be activated to provoke proliferation.
Assuntos
Fator de Crescimento Epidérmico/metabolismo , Lógica , Modelos Biológicos , Transdução de Sinais , Proliferação de Células , Receptores ErbB/metabolismo , HumanosRESUMO
Integrated computational approaches for Mycobacterium tuberculosis (Mtb) are useful to identify new molecules that could lead to future tuberculosis (TB) drugs. Our approach uses information derived from the TBCyc pathway and genome database, the Collaborative Drug Discovery TB database combined with 3D pharmacophores and dual event Bayesian models of whole-cell activity and lack of cytotoxicity. We have prioritized a large number of molecules that may act as mimics of substrates and metabolites in the TB metabolome. We computationally searched over 200,000 commercial molecules using 66 pharmacophores based on substrates and metabolites from Mtb and further filtering with Bayesian models. We ultimately tested 110 compounds in vitro that resulted in two compounds of interest, BAS 04912643 and BAS 00623753 (MIC of 2.5 and 5 µg/mL, respectively). These molecules were used as a starting point for hit-to-lead optimization. The most promising class proved to be the quinoxaline di-N-oxides, evidenced by transcriptional profiling to induce mRNA level perturbations most closely resembling known protonophores. One of these, SRI58 exhibited an MIC = 1.25 µg/mL versus Mtb and a CC50 in Vero cells of >40 µg/mL, while featuring fair Caco-2 A-B permeability (2.3 x 10-6 cm/s), kinetic solubility (125 µM at pH 7.4 in PBS) and mouse metabolic stability (63.6% remaining after 1 h incubation with mouse liver microsomes). Despite demonstration of how a combined bioinformatics/cheminformatics approach afforded a small molecule with promising in vitro profiles, we found that SRI58 did not exhibit quantifiable blood levels in mice.
Assuntos
Antituberculosos/farmacologia , Biologia Computacional/métodos , Metaboloma/efeitos dos fármacos , Mycobacterium tuberculosis/metabolismo , Bibliotecas de Moléculas Pequenas/farmacologia , Animais , Antituberculosos/química , Teorema de Bayes , Células CACO-2 , Chlorocebus aethiops , Avaliação Pré-Clínica de Medicamentos/métodos , Humanos , Camundongos , Testes de Sensibilidade Microbiana , Mycobacterium tuberculosis/efeitos dos fármacos , Bibliotecas de Moléculas Pequenas/química , Tuberculose/tratamento farmacológico , Tuberculose/microbiologia , Células VeroRESUMO
BACKGROUND: Chagas disease is a neglected tropical disease (NTD) caused by the eukaryotic parasite Trypanosoma cruzi. The current clinical and preclinical pipeline for T. cruzi is extremely sparse and lacks drug target diversity. METHODOLOGY/PRINCIPAL FINDINGS: In the present study we developed a computational approach that utilized data from several public whole-cell, phenotypic high throughput screens that have been completed for T. cruzi by the Broad Institute, including a single screen of over 300,000 molecules in the search for chemical probes as part of the NIH Molecular Libraries program. We have also compiled and curated relevant biological and chemical compound screening data including (i) compounds and biological activity data from the literature, (ii) high throughput screening datasets, and (iii) predicted metabolites of T. cruzi metabolic pathways. This information was used to help us identify compounds and their potential targets. We have constructed a Pathway Genome Data Base for T. cruzi. In addition, we have developed Bayesian machine learning models that were used to virtually screen libraries of compounds. Ninety-seven compounds were selected for in vitro testing, and 11 of these were found to have EC50 < 10 µM. We progressed five compounds to an in vivo mouse efficacy model of Chagas disease and validated that the machine learning model could identify in vitro active compounds not in the training set, as well as known positive controls. The antimalarial pyronaridine possessed 85.2% efficacy in the acute Chagas mouse model. We have also proposed potential targets (for future verification) for this compound based on structural similarity to known compounds with targets in T. cruzi. CONCLUSIONS/ SIGNIFICANCE: We have demonstrated how combining chemoinformatics and bioinformatics for T. cruzi drug discovery can bring interesting in vivo active molecules to light that may have been overlooked. The approach we have taken is broadly applicable to other NTDs.
Assuntos
Doença de Chagas/parasitologia , Descoberta de Drogas/métodos , Genoma de Protozoário/genética , Aprendizado de Máquina , Tripanossomicidas/farmacologia , Trypanosoma cruzi/genética , Animais , Teorema de Bayes , Linhagem Celular , Doença de Chagas/tratamento farmacológico , Biologia Computacional , Modelos Animais de Doenças , Feminino , Ensaios de Triagem em Larga Escala , Humanos , Redes e Vias Metabólicas , Camundongos , Camundongos Endogâmicos BALB C , Tripanossomicidas/isolamento & purificação , Trypanosoma cruzi/efeitos dos fármacosRESUMO
Classical antibiotic discovery efforts have relied mainly on molecular library screening coupled with target-based lead optimization. The conventional approaches are unable to tackle the emergence of antibiotic resistance and are failing to provide understanding of multiple mechanisms behind drug actions and the off-target effects. These insufficiencies have prompted researchers to focus on a multidisciplinary approach of systems biology-based antibiotic discovery. Systems biology is capable of providing a big-picture view for therapeutic targets through interconnected networks of biochemical reactions derived from both experimental and computational techniques. In this chapter, we have compiled software tools and databases that are typically used for target identification through in silico analyses. We have also identified enzyme- and broad-spectrum metabolite-based drug targets that have emerged through in silico systems microbiology.
Assuntos
Anti-Infecciosos/farmacologia , Avaliação Pré-Clínica de Medicamentos/métodos , Biologia de Sistemas/métodos , Animais , Bases de Dados Factuais , Humanos , Microbiologia , SoftwareRESUMO
We are witnessing the growing menace of both increasing cases of drug-sensitive and drug-resistant Mycobacterium tuberculosis strains and the challenge to produce the first new tuberculosis (TB) drug in well over 40 years. The TB community, having invested in extensive high-throughput screening efforts, is faced with the question of how to optimally leverage these data to move from a hit to a lead to a clinical candidate and potentially, a new drug. Complementing this approach, yet conducted on a much smaller scale, cheminformatic techniques have been leveraged and are examined in this review. We suggest that these computational approaches should be optimally integrated within a workflow with experimental approaches to accelerate TB drug discovery.
Assuntos
Antituberculosos/química , Antituberculosos/farmacologia , Descoberta de Drogas , Bases de Dados Factuais , Ensaios de Triagem em Larga Escala , Humanos , Mycobacterium tuberculosis/efeitos dos fármacosRESUMO
BACKGROUND: Cancer is a heterogeneous disease resulting from the accumulation of genetic defects that negatively impact control of cell division, motility, adhesion and apoptosis. Deregulation in signaling along the EgfR-MAPK pathway is common in breast cancer, though the manner in which deregulation occurs varies between both individuals and cancer subtypes. RESULTS: We were interested in identifying subnetworks within the EgfR-MAPK pathway that are similarly deregulated across subsets of breast cancers. To that end, we mapped genomic, transcriptional and proteomic profiles for 30 breast cancer cell lines onto a curated Pathway Logic symbolic systems model of EgfR-MAPK signaling. This model was composed of 539 molecular states and 396 rules governing signaling between active states. We analyzed these models and identified several subtype-specific subnetworks, including one that suggested Pak1 is particularly important in regulating the MAPK cascade when it is over-expressed. We hypothesized that Pak1 over-expressing cell lines would have increased sensitivity to Mek inhibitors. We tested this experimentally by measuring quantitative responses of 20 breast cancer cell lines to three Mek inhibitors. We found that Pak1 over-expressing luminal breast cancer cell lines are significantly more sensitive to Mek inhibition compared to those that express Pak1 at low levels. This indicates that Pak1 over-expression may be a useful clinical marker to identify patient populations that may be sensitive to Mek inhibitors. CONCLUSIONS: All together, our results support the utility of symbolic system biology models for identification of therapeutic approaches that will be effective against breast cancer subsets.