RESUMO
Transcriptional regulation enables cells to respond to environmental changes. Of the estimated 304 candidate transcription factors (TFs) in Escherichia coli K-12 MG1655, 185 have been experimentally identified, but ChIP methods have been used to fully characterize only a few dozen. Identifying these remaining TFs is key to improving our knowledge of the E. coli transcriptional regulatory network (TRN). Here, we developed an integrated workflow for the computational prediction and comprehensive experimental validation of TFs using a suite of genome-wide experiments. We applied this workflow to (i) identify 16 candidate TFs from over a hundred uncharacterized genes; (ii) capture a total of 255 DNA binding peaks for ten candidate TFs resulting in six high-confidence binding motifs; (iii) reconstruct the regulons of these ten TFs by determining gene expression changes upon deletion of each TF and (iv) identify the regulatory roles of three TFs (YiaJ, YdcI, and YeiE) as regulators of l-ascorbate utilization, proton transfer and acetate metabolism, and iron homeostasis under iron-limited conditions, respectively. Together, these results demonstrate how this workflow can be used to discover, characterize, and elucidate regulatory functions of uncharacterized TFs in parallel.
Assuntos
Escherichia coli K12/genética , Proteínas de Escherichia coli/genética , Perfilação da Expressão Gênica , Fatores de Transcrição/genética , Escherichia coli K12/metabolismo , Proteínas de Escherichia coli/metabolismo , Regulação Bacteriana da Expressão Gênica , Redes Reguladoras de Genes , Análise de Sequência com Séries de Oligonucleotídeos/métodos , Fatores de Transcrição/metabolismoRESUMO
Many frequently prescribed drugs are non-genotoxic carcinogens (NGC) in rodent liver. Their mode of action and health risks for humans remain to be elucidated. Here, we investigated the impact of two model NGC, the anti-epileptic drug phenobarbital (PB) and the contraceptive cyproterone acetate (CPA), on intrahepatic epithelial-mesenchymal crosstalk and on growth of first stages of hepatocarcinogenesis. Unaltered hepatocytes (HC) and preneoplastic HC (HCPREN) were isolated from rat liver for primary culture. DNA replication of HC and HCPREN was increased by in vitro treatment with 10 µM CPA, but not 1 mM PB. Next, mesenchymal cells (MC) obtained from liver of rats treated with either PB (50 mg/kg bw/day) or CPA (100 mg/kg bw/day), were cultured. Supernatants from both types of MC raised DNA synthesis of HC and HCPREN. This indicates that PB induces replication of HC and HCPREN only indirectly, via growth factors secreted by MC. CPA, however, acts on HC and HCPREN directly as well as indirectly via mesenchymal factors. Transcriptomics and bio-informatics revealed that PB and CPA induce extensive changes in the expression profile of MC affecting many growth factors and pathways. MC from PB-treated rats produced and secreted enhanced levels of HBEGF and GDF15, factors found to suppress apoptosis and/or induce DNA synthesis in cultured HC and HCPREN. MC from CPA-treated animals showed enhanced expression and secretion of HGF, which strongly raised DNA replication of HC and HCPREN. In conclusion, our findings reveal profound effects of two prototypical NGC on the hepatic mesenchyme. The resulting release of factors, which suppress apoptosis and/or enhance cell replication preferentially in cancer prestages, appears to be crucial for tumor promotion by NGC in the liver.
Assuntos
Carcinógenos/toxicidade , Acetato de Ciproterona/toxicidade , Hepatócitos/efeitos dos fármacos , Peptídeos e Proteínas de Sinalização Intercelular/metabolismo , Mesoderma/citologia , Fenobarbital/toxicidade , Animais , Apoptose , Testes de Carcinogenicidade , Células Cultivadas , Replicação do DNA , Feminino , Fígado/citologia , Fígado/efeitos dos fármacos , Masculino , Cultura Primária de Células , Ratos , Ratos WistarRESUMO
UNLABELLED: JSBML, the official pure Java programming library for the Systems Biology Markup Language (SBML) format, has evolved with the advent of different modeling formalisms in systems biology and their ability to be exchanged and represented via extensions of SBML. JSBML has matured into a major, active open-source project with contributions from a growing, international team of developers who not only maintain compatibility with SBML, but also drive steady improvements to the Java interface and promote ease-of-use with end users. AVAILABILITY AND IMPLEMENTATION: Source code, binaries and documentation for JSBML can be freely obtained under the terms of the LGPL 2.1 from the website http://sbml.org/Software/JSBML. More information about JSBML can be found in the user guide at http://sbml.org/Software/JSBML/docs/. CONTACT: jsbml-development@googlegroups.com or andraeger@eng.ucsd.edu SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Assuntos
Modelos Biológicos , Software , Biologia de Sistemas , Simulação por Computador , Linguagens de ProgramaçãoRESUMO
Many environmental pollutants and drugs, including steroid hormones, hypolipidemics and antiepileptics, are non-genotoxic carcinogens (NGC) in rodent liver. The mechanism of action and the risk for human health are still insufficiently known. Here, we study the effects of phenobarbital (PB), a widely used model NGC, on hepatic epithelial-mesenchymal crosstalk and the impact on hepatic apoptosis. Mesenchymal cells (MC) and hepatocytes (HC) were isolated from control and PB-treated rat livers. PB induced extensive changes in gene expression in MC and much less in HC as shown by transcriptomics with oligoarrays. In MC only, transcript levels of numerous proinflammatory cytokines were elevated. Correspondingly, ELISA on the supernatant of MC from PB-treated rats revealed enhanced release of various cytokines. In cultured HC, this supernatant caused (i) nuclear translocation and activation of nuclear factor-κB (shown by immunoblots of nuclear extracts and reporter gene assays), (ii) elevated expression of proinflammatory genes and (iii) protection from the proapoptotic action of transforming growth factor beta 1 (TGFß1). PB treatment in vivo or in vitro elevated the production and release of tumor necrosis factor alpha from MC, which was identified as mainly responsible for the inhibition of apoptosis in HC. In conclusion, our findings reveal profound proinflammatory effects of PB on hepatic mesenchyme and mesenchymal-epithelial interactions. The resulting release of cytokines acts antiapoptotic in HC, an effect crucial for tumor promotion and carcinogenesis by NGC.
Assuntos
Apoptose/efeitos dos fármacos , Carcinógenos/toxicidade , Fenobarbital/toxicidade , Animais , Células Cultivadas , Hepatócitos/efeitos dos fármacos , Inflamação/genética , Inflamação/metabolismo , Neoplasias Hepáticas/induzido quimicamente , Masculino , NF-kappa B/metabolismo , Ratos Wistar , Receptores de Glicina/genética , Receptores de Glicina/metabolismo , TranscriptomaRESUMO
The process of hepatocarcinogenesis in the diethylnitrosamine (DEN) initiation/phenobarbital (PB) promotion mouse model involves the selective clonal outgrowth of cells harboring oncogene mutations in Ctnnb1, while spontaneous or DEN-only-induced tumors are often Ha-ras- or B-raf-mutated. The molecular mechanisms and pathways underlying these different tumor sub-types are not well characterized. Their identification may help identify markers for xenobiotic promoted versus spontaneously occurring liver tumors. Here, we have characterized mouse liver tumors harboring either Ctnnb1 or Ha-ras mutations via integrated molecular profiling at the transcriptional, translational and post-translational levels. In addition, metabolites of the intermediary metabolism were quantified by high resolution (1)H magic angle nuclear magnetic resonance. We have identified tumor genotype-specific differences in mRNA and miRNA expression, protein levels, post-translational modifications, and metabolite levels that facilitate the molecular and biochemical stratification of tumor phenotypes. Bioinformatic integration of these data at the pathway level led to novel insights into tumor genotype-specific aberrant cell signaling and in particular to a better understanding of alterations in pathways of the cell intermediary metabolism, which are driven by the constitutive activation of the ß-Catenin and Ha-ras oncoproteins in tumors of the two genotypes.
Assuntos
Biomarcadores Tumorais/genética , Perfilação da Expressão Gênica , Genes ras/genética , Neoplasias Hepáticas Experimentais/genética , Neoplasias Hepáticas Experimentais/metabolismo , Metabolômica , Mutação/genética , beta Catenina/genética , Animais , Biomarcadores Tumorais/metabolismo , Western Blotting , Redes e Vias Metabólicas , Camundongos , MicroRNAs/genética , Análise de Sequência com Séries de Oligonucleotídeos , Processamento de Proteína Pós-Traducional , RNA Mensageiro/genética , Reação em Cadeia da Polimerase em Tempo Real , Reação em Cadeia da Polimerase Via Transcriptase ReversaRESUMO
SUMMARY: Microarrays are commonly used to detect changes in gene expression between different biological samples. For this purpose, many analysis tools have been developed that offer visualization, statistical analysis and more sophisticated analysis methods. Most of these tools are designed specifically for messenger RNA microarrays. However, today, more and more different microarray platforms are available. Changes in DNA methylation, microRNA expression or even protein phosphorylation states can be detected with specialized arrays. For these microarray technologies, the number of available tools is small compared with mRNA analysis tools. Especially, a joint analysis of different microarray platforms that have been used on the same set of biological samples is hardly supported by most microarray analysis tools. Here, we present InCroMAP, a tool for the analysis and visualization of high-level microarray data from individual or multiple different platforms. Currently, InCroMAP supports mRNA, microRNA, DNA methylation and protein modification datasets. Several methods are offered that allow for an integrated analysis of data from those platforms. The available features of InCroMAP range from visualization of DNA methylation data over annotation of microRNA targets and integrated gene set enrichment analysis to a joint visualization of data from all platforms in the context of metabolic or signalling pathways. AVAILABILITY: InCroMAP is freely available as Java™ application at www.cogsys.cs.uni-tuebingen.de/software/InCroMAP, including a comprehensive user's guide and example files.
Assuntos
Análise de Sequência com Séries de Oligonucleotídeos/métodos , Software , Metilação de DNA , Expressão Gênica , Perfilação da Expressão Gênica , MicroRNAs/metabolismo , Análise Serial de Proteínas , Proteínas/metabolismo , RNA Mensageiro/metabolismoRESUMO
We present a new tool for hepatocarcinogenicity evaluation of drug candidates in rodents. ToxDBScan is a web tool offering quick and easy similarity screening of new drug candidates against two large-scale public databases, which contain expression profiles for substances with known carcinogenic profiles: TG-GATEs and DrugMatrix. ToxDBScan uses a set similarity score that computes the putative similarity based on similar expression of genes to identify chemicals with similar genotoxic and hepatocarcinogenic potential. We propose using a discretized representation of expression profiles, which use only information on up- or down-regulation of genes as relevant features. Therefore, only the deregulated genes are required as input. ToxDBScan provides an extensive report on similar compounds, which includes additional information on compounds, differential genes and pathway enrichments. We evaluated ToxDBScan with expression data from 15 chemicals with known hepatocarcinogenic potential and observed a sensitivity of 88 Based on the identified chemicals, we achieved perfect classification of the independent test set. ToxDBScan is publicly available from the ZBIT Bioinformatics Toolbox.
Assuntos
Carcinógenos/toxicidade , Neoplasias Hepáticas/induzido quimicamente , Animais , Biologia Computacional/métodos , Bases de Dados Factuais , Regulação para Baixo/efeitos dos fármacos , Avaliação Pré-Clínica de Medicamentos/métodos , Perfilação da Expressão Gênica/métodos , Neoplasias Hepáticas/genética , Roedores , Transcriptoma/efeitos dos fármacos , Transcriptoma/genética , Regulação para Cima/efeitos dos fármacos , Regulação para Cima/genéticaRESUMO
MOTIVATION: Traditionally, microarrays were almost exclusively used for the genome-wide analysis of differential gene expression. But nowadays, their scope of application has been extended to various genomic features, such as microRNAs (miRNAs), proteins and DNA methylation (DNAm). Most available methods for the visualization of these datasets are focused on individual platforms and are not capable of integratively visualizing multiple microarray datasets from cross-platform studies. Above all, there is a demand for methods that can visualize genomic features that are not directly linked to protein-coding genes, such as regulatory RNAs (e.g. miRNAs) and epigenetic alterations (e.g. DNAm), in a pathway-centred manner. RESULTS: We present a novel pathway-based visualization method that is especially suitable for the visualization of high-throughput datasets from multiple different microarray platforms that were used for the analysis of diverse genomic features in the same set of biological samples. The proposed methodology includes concepts for linking DNAm and miRNA expression datasets to canonical signalling and metabolic pathways. We further point out strategies for displaying data from multiple proteins and protein modifications corresponding to the same gene. Ultimately, we show how data from four distinct platform types (messenger RNA, miRNA, protein and DNAm arrays) can be integratively visualized in the context of canonical pathways. AVAILABILITY: The described method is implemented as part of the InCroMAP application that is freely available at www.cogsys.cs.uni-tuebingen.de/software/InCroMAP. CONTACT: clemens.wrzodek@uni-tuebingen.de or andreas.zell@uni-tuebingen.de.
Assuntos
Gráficos por Computador , Análise em Microsséries/métodos , Software , Biologia Computacional/métodos , DNA/genética , Bases de Dados Genéticas , Perfilação da Expressão Gênica/métodos , Genômica/métodos , Redes e Vias Metabólicas , MicroRNAs/genética , Proteínas/genética , Proteômica/métodos , RNA Mensageiro/genéticaRESUMO
MOTIVATION: The biological pathway exchange language (BioPAX) and the systems biology markup language (SBML) belong to the most popular modeling and data exchange languages in systems biology. The focus of SBML is quantitative modeling and dynamic simulation of models, whereas the BioPAX specification concentrates mainly on visualization and qualitative analysis of pathway maps. BioPAX describes reactions and relations. In contrast, SBML core exclusively describes quantitative processes such as reactions. With the SBML qualitative models extension (qual), it has recently also become possible to describe relations in SBML. Before the development of SBML qual, relations could not be properly translated into SBML. Until now, there exists no BioPAX to SBML converter that is fully capable of translating both reactions and relations. RESULTS: The entire nature pathway interaction database has been converted from BioPAX (Level 2 and Level 3) into SBML (Level 3 Version 1) including both reactions and relations by using the new qual extension package. Additionally, we present the new webtool BioPAX2SBML for further BioPAX to SBML conversions. Compared with previous conversion tools, BioPAX2SBML is more comprehensive, more robust and more exact. AVAILABILITY: BioPAX2SBML is freely available at http://webservices.cs.uni-tuebingen.de/ and the complete collection of the PID models is available at http://www.cogsys.cs.uni-tuebingen.de/downloads/Qualitative-Models/.
Assuntos
Linguagens de Programação , Software , Biologia de Sistemas , Bases de Dados Factuais , Humanos , InternetRESUMO
BACKGROUND: Alternative splicing (AS) is a process which generates several distinct mRNA isoforms from the same gene by splicing different portions out of the precursor transcript. Due to the (patho-)physiological importance of AS, a complete inventory of AS is of great interest. While this is in reach for human and mammalian model organisms, our knowledge of AS in plants has remained more incomplete. Experimental approaches for monitoring AS are either based on transcript sequencing or rely on hybridization to DNA microarrays. Among the microarray platforms facilitating the discovery of AS events, tiling arrays are well-suited for identifying intron retention, the most prevalent type of AS in plants. However, analyzing tiling array data is challenging, because of high noise levels and limited probe coverage. RESULTS: In this work, we present a novel method to detect intron retentions (IR) and exon skips (ES) from tiling arrays. While statistical tests have typically been proposed for this purpose, our method instead utilizes support vector machines (SVMs) which are appreciated for their accuracy and robustness to noise. Existing EST and cDNA sequences served for supervised training and evaluation. Analyzing a large collection of publicly available microarray and sequence data for the model plant A. thaliana, we demonstrated that our method is more accurate than existing approaches. The method was applied in a genome-wide screen which resulted in the discovery of 1,355 IR events. A comparison of these IR events to the TAIR annotation and a large set of short-read RNA-seq data showed that 830 of the predicted IR events are novel and that 525 events (39%) overlap with either the TAIR annotation or the IR events inferred from the RNA-seq data. CONCLUSIONS: The method developed in this work expands the scarce repertoire of analysis tools for the identification of alternative mRNA splicing from whole-genome tiling arrays. Our predictions are highly enriched with known AS events and complement the A. thaliana genome annotation with respect to AS. Since all predicted AS events can be precisely attributed to experimental conditions, our work provides a basis for follow-up studies focused on the elucidation of the regulatory mechanisms underlying tissue-specific and stress-dependent AS in plants.
Assuntos
Processamento Alternativo , Arabidopsis/genética , Genoma de Planta , Íntrons , Análise de Sequência com Séries de Oligonucleotídeos/métodos , DNA Complementar/genética , Éxons , Etiquetas de Sequências Expressas , Análise de Sequência de RNARESUMO
Genome-wide association studies (GWAS) have identified over two hundred chromosomal loci that modulate risk of coronary artery disease (CAD). The genes affected by variants at these loci are largely unknown and an untapped resource to improve our understanding of CAD pathophysiology and identify potential therapeutic targets. Here, we prioritized 68 genes as the most likely causal genes at genome-wide significant loci identified by GWAS of CAD and examined their regulatory roles in 286 metabolic and vascular tissue gene-protein sub-networks ("modules"). The modules and genes within were scored for CAD druggability potential. The scoring enriched for targets of cardiometabolic drugs currently in clinical use and in-depth analysis of the top-scoring modules validated established and revealed novel target tissues, biological processes, and druggable targets. This study provides an unprecedented resource of tissue-defined gene-protein interactions directly affected by genetic variance in CAD risk loci.
Assuntos
Doença da Artéria Coronariana/genética , Redes Reguladoras de Genes , Doença da Artéria Coronariana/tratamento farmacológico , Descoberta de Drogas , Redes Reguladoras de Genes/efeitos dos fármacos , Predisposição Genética para Doença , Estudo de Associação Genômica Ampla , Humanos , Desequilíbrio de Ligação , Terapia de Alvo Molecular , Polimorfismo de Nucleotídeo Único/efeitos dos fármacos , Locos de Características Quantitativas/efeitos dos fármacosRESUMO
BACKGROUND: Despite a myriad of attempts in the last three decades to diagnose ovarian cancer (OC) earlier, this clinical aim still remains a significant challenge. Aberrant methylation patterns of linked CpGs analyzed in DNA fragments shed by cancers into the bloodstream (i.e. cell-free DNA) can provide highly specific signals indicating cancer presence. METHODS: We analyzed 699 cancerous and non-cancerous tissues using a methylation array or reduced representation bisulfite sequencing to discover the most specific OC methylation patterns. A three-DNA-methylation-serum-marker panel was developed using targeted ultra-high coverage bisulfite sequencing in 151 women and validated in 250 women with various conditions, particularly in those associated with high CA125 levels (endometriosis and other benign pelvic masses), serial samples from 25 patients undergoing neoadjuvant chemotherapy, and a nested case control study of 172 UKCTOCS control arm participants which included serum samples up to two years before OC diagnosis. RESULTS: The cell-free DNA amount and average fragment size in the serum samples was up to ten times higher than average published values (based on samples that were immediately processed) due to leakage of DNA from white blood cells owing to delayed time to serum separation. Despite this, the marker panel discriminated high grade serous OC patients from healthy women or patients with a benign pelvic mass with specificity/sensitivity of 90.7% (95% confidence interval [CI] = 84.3-94.8%) and 41.4% (95% CI = 24.1-60.9%), respectively. Levels of all three markers plummeted after exposure to chemotherapy and correctly identified 78% and 86% responders and non-responders (Fisher's exact test, p = 0.04), respectively, which was superior to a CA125 cut-off of 35 IU/mL (20% and 75%). 57.9% (95% CI 34.0-78.9%) of women who developed OC within two years of sample collection were identified with a specificity of 88.1% (95% CI = 77.3-94.3%). Sensitivity and specificity improved further when specifically analyzing CA125 negative samples only (63.6% and 87.5%, respectively). CONCLUSIONS: Our data suggest that DNA methylation patterns in cell-free DNA have the potential to detect a proportion of OCs up to two years in advance of diagnosis and may potentially guide personalized treatment. The prospective use of novel collection vials, which stabilize blood cells and reduce background DNA contamination in serum/plasma samples, will facilitate clinical implementation of liquid biopsy analyses.
Assuntos
Biomarcadores Tumorais/genética , Ácidos Nucleicos Livres/genética , Metilação de DNA , Neoplasias Ovarianas/genética , Adulto , Idoso , Biomarcadores Tumorais/sangue , Biomarcadores Tumorais/normas , Feminino , Humanos , Pessoa de Meia-Idade , Neoplasias Ovarianas/sangue , Distribuição Aleatória , Análise de Sequência de DNA/métodos , Análise de Sequência de DNA/normasRESUMO
BACKGROUND: Monitoring treatment and early detection of fatal breast cancer (BC) remains a major unmet need. Aberrant circulating DNA methylation (DNAme) patterns are likely to provide a highly specific cancer signal. We hypothesized that cell-free DNAme markers could indicate disseminated breast cancer, even in the presence of substantial quantities of background DNA. METHODS: We used reduced representation bisulfite sequencing (RRBS) of 31 tissues and established serum assays based on ultra-high coverage bisulfite sequencing in two independent prospective serum sets (n = 110). The clinical use of one specific region, EFC#93, was validated in 419 patients (in both pre- and post-adjuvant chemotherapy samples) from SUCCESS (Simultaneous Study of Gemcitabine-Docetaxel Combination adjuvant treatment, as well as Extended Bisphosphonate and Surveillance-Trial) and 925 women (pre-diagnosis) from the UKCTOCS (UK Collaborative Trial of Ovarian Cancer Screening) population cohort, with overall survival and occurrence of incident breast cancer (which will or will not lead to death), respectively, as primary endpoints. RESULTS: A total of 18 BC specific DNAme patterns were discovered in tissue, of which the top six were further tested in serum. The best candidate, EFC#93, was validated for clinical use. EFC#93 was an independent poor prognostic marker in pre-chemotherapy samples (hazard ratio [HR] for death = 7.689) and superior to circulating tumor cells (CTCs) (HR for death = 5.681). More than 70% of patients with both CTCs and EFC#93 serum DNAme positivity in their pre-chemotherapy samples relapsed within five years. EFC#93-positive disseminated disease in post-chemotherapy samples seems to respond to anti-hormonal treatment. The presence of EFC#93 serum DNAme identified 42.9% and 25% of women who were diagnosed with a fatal BC within 3-6 and 6-12 months of sample donation, respectively, with a specificity of 88%. The sensitivity with respect to detecting fatal BC was ~ 4-fold higher compared to non-fatal BC. CONCLUSIONS: Detection of EFC#93 serum DNAme patterns offers a new tool for early diagnosis and management of disseminated breast cancers. Clinical trials are required to assess whether EFC#93-positive women in the absence of radiological detectable breast cancers will benefit from anti-hormonal treatment before the breast lesions become clinically apparent.
Assuntos
Biomarcadores Tumorais/genética , Neoplasias da Mama/genética , Ácidos Nucleicos Livres/genética , Metilação de DNA , Adulto , Idoso , Biomarcadores Tumorais/sangue , Neoplasias da Mama/sangue , Neoplasias da Mama/patologia , Feminino , Humanos , Pessoa de Meia-Idade , Células Neoplásicas Circulantes/metabolismo , Células Neoplásicas Circulantes/patologiaRESUMO
Bioinformatics analysis has become an integral part of research in biology. However, installation and use of scientific software can be difficult and often requires technical expert knowledge. Reasons are dependencies on certain operating systems or required third-party libraries, missing graphical user interfaces and documentation, or nonstandard input and output formats. In order to make bioinformatics software easily accessible to researchers, we here present a web-based platform. The Center for Bioinformatics Tuebingen (ZBIT) Bioinformatics Toolbox provides web-based access to a collection of bioinformatics tools developed for systems biology, protein sequence annotation, and expression data analysis. Currently, the collection encompasses software for conversion and processing of community standards SBML and BioPAX, transcription factor analysis, and analysis of microarray data from transcriptomics and proteomics studies. All tools are hosted on a customized Galaxy instance and run on a dedicated computation cluster. Users only need a web browser and an active internet connection in order to benefit from this service. The web platform is designed to facilitate the usage of the bioinformatics tools for researchers without advanced technical background. Users can combine tools for complex analyses or use predefined, customizable workflows. All results are stored persistently and reproducible. For each tool, we provide documentation, tutorials, and example data to maximize usability. The ZBIT Bioinformatics Toolbox is freely available at https://webservices.cs.uni-tuebingen.de/.
Assuntos
Biologia Computacional/métodos , Expressão Gênica , Internet , Software , Estatística como Assunto , Biologia de Sistemas/métodos , Ceramidas/metabolismo , Humanos , Cinética , Modelos Biológicos , NF-kappa B/metabolismo , Transdução de Sinais , Fatores de Transcrição/metabolismoRESUMO
BACKGROUND: The size and complexity of published biochemical network reconstructions are steadily increasing, expanding the potential scale of derived computational models. However, the construction of large biochemical network models is a laborious and error-prone task. Automated methods have simplified the network reconstruction process, but building kinetic models for these systems is still a manually intensive task. Appropriate kinetic equations, based upon reaction rate laws, must be constructed and parameterized for each reaction. The complex test-and-evaluation cycles that can be involved during kinetic model construction would thus benefit from automated methods for rate law assignment. RESULTS: We present a high-throughput algorithm to automatically suggest and create suitable rate laws based upon reaction type according to several criteria. The criteria for choices made by the algorithm can be influenced in order to assign the desired type of rate law to each reaction. This algorithm is implemented in the software package SBMLsqueezer 2. In addition, this program contains an integrated connection to the kinetics database SABIO-RK to obtain experimentally-derived rate laws when desired. CONCLUSIONS: The described approach fills a heretofore absent niche in workflows for large-scale biochemical kinetic model construction. In several applications the algorithm has already been demonstrated to be useful and scalable. SBMLsqueezer is platform independent and can be used as a stand-alone package, as an integrated plugin, or through a web interface, enabling flexible solutions and use-case scenarios.
Assuntos
Modelos Biológicos , Software , Biologia de Sistemas/métodos , Algoritmos , Simulação por Computador , Cinética , Redes e Vias Metabólicas/fisiologiaRESUMO
The current gold-standard method for cancer safety assessment of drugs is a rodent two-year bioassay, which is associated with significant costs and requires testing a high number of animals over lifetime. Due to the absence of a comprehensive set of short-term assays predicting carcinogenicity, new approaches are currently being evaluated. One promising approach is toxicogenomics, which by virtue of genome-wide molecular profiling after compound treatment can lead to an increased mechanistic understanding, and potentially allow for the prediction of a carcinogenic potential via mathematical modeling. The latter typically involves the extraction of informative genes from omics datasets, which can be used to construct generalizable models allowing for the early classification of compounds with unknown carcinogenic potential. Here we formally describe and compare two novel methodologies for the reproducible extraction of characteristic mRNA signatures, which were employed to capture specific gene expression changes observed for nongenotoxic carcinogens. While the first method integrates multiple gene rankings, generated by diverse algorithms applied to data from different subsamplings of the training compounds, the second approach employs a statistical ratio for the identification of informative genes. Both methods were evaluated on a dataset obtained from the toxicogenomics database TG-GATEs to predict the outcome of a two-year bioassay based on profiles from 14-day treatments. Additionally, we applied our methods to datasets from previous studies and showed that the derived prediction models are on average more accurate than those built from the original signatures. The selected genes were mostly related to p53 signaling and to specific changes in anabolic processes or energy metabolism, which are typically observed in tumor cells. Among the genes most frequently incorporated into prediction models were Phlda3, Cdkn1a, Akr7a3, Ccng1 and Abcb4.
Assuntos
Algoritmos , Carcinógenos/toxicidade , Neoplasias Hepáticas/genética , RNA Mensageiro/genética , Toxicogenética/métodos , Transcriptoma , Animais , Proteínas Reguladoras de Apoptose/genética , Proteínas Reguladoras de Apoptose/metabolismo , Testes de Carcinogenicidade/métodos , Proteínas de Ciclo Celular/genética , Proteínas de Ciclo Celular/metabolismo , Bases de Dados Factuais , Avaliação Pré-Clínica de Medicamentos , Perfilação da Expressão Gênica , Fígado/efeitos dos fármacos , Fígado/metabolismo , Fígado/patologia , Neoplasias Hepáticas/induzido quimicamente , Neoplasias Hepáticas/metabolismo , Neoplasias Hepáticas/patologia , Modelos Genéticos , RNA Mensageiro/metabolismo , Ratos , Risco , Proteínas Supressoras de Tumor/genética , Proteínas Supressoras de Tumor/metabolismoRESUMO
In the area of omics profiling in toxicology, i.e. toxicogenomics, characteristic molecular profiles have previously been incorporated into prediction models for early assessment of a carcinogenic potential and mechanism-based classification of compounds. Traditionally, the biomarker signatures used for model construction were derived from individual high-throughput techniques, such as microarrays designed for monitoring global mRNA expression. In this study, we built predictive models by integrating omics data across complementary microarray platforms and introduced new concepts for modeling of pathway alterations and molecular interactions between multiple biological layers. We trained and evaluated diverse machine learning-based models, differing in the incorporated features and learning algorithms on a cross-omics dataset encompassing mRNA, miRNA, and protein expression profiles obtained from rat liver samples treated with a heterogeneous set of substances. Most of these compounds could be unambiguously classified as genotoxic carcinogens, non-genotoxic carcinogens, or non-hepatocarcinogens based on evidence from published studies. Since mixed characteristics were reported for the compounds Cyproterone acetate, Thioacetamide, and Wy-14643, we reclassified these compounds as either genotoxic or non-genotoxic carcinogens based on their molecular profiles. Evaluating our toxicogenomics models in a repeated external cross-validation procedure, we demonstrated that the prediction accuracy of our models could be increased by joining the biomarker signatures across multiple biological layers and by adding complex features derived from cross-platform integration of the omics data. Furthermore, we found that adding these features resulted in a better separation of the compound classes and a more confident reclassification of the three undefined compounds as non-genotoxic carcinogens.
Assuntos
Testes de Carcinogenicidade , Carcinógenos/química , Neoplasias Hepáticas Experimentais/metabolismo , Toxicogenética/métodos , Algoritmos , Animais , Área Sob a Curva , Inteligência Artificial , Análise por Conglomerados , Biologia Computacional , Perfilação da Expressão Gênica , Regulação Neoplásica da Expressão Gênica , Neoplasias Hepáticas Experimentais/induzido quimicamente , Masculino , Modelos Estatísticos , Análise Serial de Proteínas , RNA Mensageiro/metabolismo , Ratos , Ratos WistarRESUMO
In systems biology, the combination of multiple types of omics data, such as metabolomics, proteomics, transcriptomics, and genomics, yields more information on a biological process than the analysis of a single type of data. Thus, data from different omics platforms is usually combined in one experimental setup to obtain insight into a biological process or a disease state. Particularly high accuracy metabolomics data from modern mass spectrometry instruments is currently more and more integrated into biological studies. Reflecting this trend, we extended InCroMAP, a data integration, analysis and visualization tool for genomics, transcriptomics, and proteomics data. Now, the tool is able to perform an integrated enrichment analysis and pathway-based visualization of multi-omics data and thus, it is suitable for the evaluation of comprehensive systems biology studies.
Assuntos
Biologia Computacional/métodos , Sistemas de Gerenciamento de Base de Dados , Bases de Dados Genéticas , Software , Análise em Microsséries , Interface Usuário-ComputadorRESUMO
BACKGROUND AND SCOPE: Today, web-based data analysis pipelines exist for a wide variety of microarray platforms, such as ordinary gene-centered arrays, exon arrays and SNP arrays. However, most of the available software tools provide only limited support for reverse-phase protein arrays (RPPA), as relevant inherent properties of the corresponding datasets are not taken into account. Thus, we developed the web-based data analysis pipeline RPPApipe, which was specifically tailored to suit the characteristics of the RPPA platform and encompasses various tools for data preprocessing, statistical analysis, clustering and pathway analysis. IMPLEMENTATION AND PERFORMANCE: All tools which are part of the RPPApipe software were implemented using R/Bioconductor. The software was embedded into our web-based ZBIT Bioinformatics Toolbox which is a customized instance of the Galaxy platform. AVAILABILITY: RPPApipe is freely available under GNU Public License from http://webservices.cs.uni-tuebingen.de. A full documentation of the tool can be found on the corresponding website http://www.cogsys.cs.uni-tuebingen.de/software/RPPApipe.
Assuntos
Biologia Computacional/métodos , Interpretação Estatística de Dados , Análise Serial de Proteínas/métodos , Software , InternetRESUMO
Alternative splicing (AS) is an important mechanism implicated in eukaryotic gene expression, whereby exon segments of precursor-mRNA transcripts are joined together in different arrangements corresponding to diverse isoforms of mature mRNA. Accumulating evidence suggests that in many instances this process is specifically regulated and contributes to the structural and functional diversification of tissues and cell types. Furthermore, several studies support the view that environmental stresses dramatically impact on AS and reported the presence of novel transcript isoforms in response to biotic or abiotic stresses. Since specific regulation of AS in plants is a largely unexplored field of research, large-scale approaches aimed at monitoring AS on a genome-wide level are of increasing importance to gain insights into tissue-specific splicing regulation and to study the effects of changed environmental conditions on pre-mRNA splicing.Here, we describe the concepts of a traditional statistical approach, and a more recently developed machine learning-based method for AS detection from tiling arrays. The here presented approaches were employed for the detection and profiling of AS events in the model plant A. thaliana, and applied to a large dataset comprising transcriptomic expression data from 11 tissues and 13 stress conditions.