RESUMO
The cMonkey integrated biclustering algorithm identifies conditionally co-regulated modules of genes (biclusters). cMonkey integrates various orthogonal pieces of information which support evidence of gene co-regulation, and optimizes biclusters to be supported simultaneously by one or more of these prior constraints. The algorithm served as the cornerstone for constructing the first global, predictive Environmental Gene Regulatory Influence Network (EGRIN) model for a free-living cell, and has now been applied to many more organisms. However, due to its computational inefficiencies, long run-time and complexity of various input data types, cMonkey was not readily usable by the wider community. To address these primary concerns, we have significantly updated the cMonkey algorithm and refactored its implementation, improving its usability and extendibility. These improvements provide a fully functioning and user-friendly platform for building co-regulated gene modules and the tools necessary for their exploration and interpretation. We show, via three separate analyses of data for E. coli, M. tuberculosis and H. sapiens, that the updated algorithm and inclusion of novel scoring functions for new data types (e.g. ChIP-seq and transcription factor over-expression [TFOE]) improve discovery of biologically informative co-regulated modules. The complete cMonkey2 software package, including source code, is available at https://github.com/baliga-lab/cmonkey2.
Assuntos
Regulação da Expressão Gênica , Software , Algoritmos , Carcinoma de Células Escamosas/genética , Imunoprecipitação da Cromatina , Escherichia coli/genética , Regulação Bacteriana da Expressão Gênica , Neoplasias Pulmonares/genética , Mycobacterium tuberculosis/genética , Regulon , Análise de Sequência de DNA , Fatores de Transcrição/metabolismoRESUMO
The ease of generating high-throughput data has enabled investigations into organismal complexity at the systems level through the inference of networks of interactions among the various cellular components (genes, RNAs, proteins and metabolites). The wider scientific community, however, currently has limited access to tools for network inference, visualization and analysis because these tasks often require advanced computational knowledge and expensive computing resources. We have designed the network portal (http://networks.systemsbiology.net) to serve as a modular database for the integration of user uploaded and public data, with inference algorithms and tools for the storage, visualization and analysis of biological networks. The portal is fully integrated into the Gaggle framework to seamlessly exchange data with desktop and web applications and to allow the user to create, save and modify workspaces, and it includes social networking capabilities for collaborative projects. While the current release of the database contains networks for 13 prokaryotic organisms from diverse phylogenetic clades (4678 co-regulated gene modules, 3466 regulators and 9291 cis-regulatory motifs), it will be rapidly populated with prokaryotic and eukaryotic organisms as relevant data become available in public repositories and through user input. The modular architecture, simple data formats and open API support community development of the portal.
Assuntos
Bases de Dados Genéticas , Redes Reguladoras de Genes , Algoritmos , Archaea/genética , Archaea/metabolismo , Bactérias/genética , Bactérias/metabolismo , Gráficos por Computador , Perfilação da Expressão Gênica , Internet , Motivos de Nucleotídeos , Elementos Reguladores de Transcrição , Software , Integração de Sistemas , Fatores de Transcrição/metabolismoRESUMO
Microbes can tailor transcriptional responses to diverse environmental challenges despite having streamlined genomes and a limited number of regulators. Here, we present data-driven models that capture the dynamic interplay of the environment and genome-encoded regulatory programs of two types of prokaryotes: Escherichia coli (a bacterium) and Halobacterium salinarum (an archaeon). The models reveal how the genome-wide distributions of cis-acting gene regulatory elements and the conditional influences of transcription factors at each of those elements encode programs for eliciting a wide array of environment-specific responses. We demonstrate how these programs partition transcriptional regulation of genes within regulons and operons to re-organize gene-gene functional associations in each environment. The models capture fitness-relevant co-regulation by different transcriptional control mechanisms acting across the entire genome, to define a generalized, system-level organizing principle for prokaryotic gene regulatory networks that goes well beyond existing paradigms of gene regulation. An online resource (http://egrin2.systemsbiology.net) has been developed to facilitate multiscale exploration of conditional gene regulation in the two prokaryotes.
Assuntos
Redes Reguladoras de Genes , Genoma Microbiano , Modelos Genéticos , Algoritmos , Escherichia coli/genética , Regulação da Expressão Gênica , Aptidão Genética , Halobacterium salinarum/genética , Óperon , Elementos Reguladores de Transcrição , RegulonRESUMO
Poor prognosis and drug resistance in glioblastoma (GBM) can result from cellular heterogeneity and treatment-induced shifts in phenotypic states of tumor cells, including dedifferentiation into glioma stem-like cells (GSCs). This rare tumorigenic cell subpopulation resists temozolomide, undergoes proneural-to-mesenchymal transition (PMT) to evade therapy, and drives recurrence. Through inference of transcriptional regulatory networks (TRNs) of patient-derived GSCs (PD-GSCs) at single-cell resolution, we demonstrate how the topology of transcription factor interaction networks drives distinct trajectories of cell state transitions in PD-GSCs resistant or susceptible to cytotoxic drug treatment. By experimentally testing predictions based on TRN simulations, we show that drug treatment drives surviving PD-GSCs along a trajectory of intermediate states, exposing vulnerability to potentiated killing by siRNA or a second drug targeting treatment-induced transcriptional programs governing non-genetic cell plasticity. Our findings demonstrate an approach to uncover TRN topology and use it to rationally predict combinatorial treatments that disrupts acquired resistance in GBM.
RESUMO
Poor prognosis and drug resistance in glioblastoma (GBM) can result from cellular heterogeneity and treatment-induced shifts in phenotypic states of tumor cells, including dedifferentiation into glioma stem-like cells (GSCs). This rare tumorigenic cell subpopulation resists temozolomide, undergoes proneural-to-mesenchymal transition (PMT) to evade therapy, and drives recurrence. Through inference of transcriptional regulatory networks (TRNs) of patient-derived GSCs (PD-GSCs) at single-cell resolution, we demonstrate how the topology of transcription factor interaction networks drives distinct trajectories of cell-state transitions in PD-GSCs resistant or susceptible to cytotoxic drug treatment. By experimentally testing predictions based on TRN simulations, we show that drug treatment drives surviving PD-GSCs along a trajectory of intermediate states, exposing vulnerability to potentiated killing by siRNA or a second drug targeting treatment-induced transcriptional programs governing nongenetic cell plasticity. Our findings demonstrate an approach to uncover TRN topology and use it to rationally predict combinatorial treatments that disrupt acquired resistance in GBM.
Assuntos
Resistencia a Medicamentos Antineoplásicos , Regulação Neoplásica da Expressão Gênica , Redes Reguladoras de Genes , Glioma , Células-Tronco Neoplásicas , Humanos , Células-Tronco Neoplásicas/metabolismo , Células-Tronco Neoplásicas/efeitos dos fármacos , Células-Tronco Neoplásicas/patologia , Resistencia a Medicamentos Antineoplásicos/genética , Glioma/genética , Glioma/patologia , Glioma/metabolismo , Glioma/tratamento farmacológico , Temozolomida/farmacologia , Neoplasias Encefálicas/genética , Neoplasias Encefálicas/patologia , Neoplasias Encefálicas/metabolismo , Neoplasias Encefálicas/tratamento farmacológico , Linhagem Celular Tumoral , Glioblastoma/genética , Glioblastoma/patologia , Glioblastoma/metabolismo , Glioblastoma/tratamento farmacológicoRESUMO
The success of Mycobacterium tuberculosis (Mtb) is largely attributed to its ability to physiologically adapt and withstand diverse localized stresses within host microenvironments. Here, we present a data-driven model (EGRIN 2.0) that captures the dynamic interplay of environmental cues and genome-encoded regulatory programs in Mtb. Analysis of EGRIN 2.0 shows how modulation of the MtrAB two-component signaling system tunes Mtb growth in response to related host microenvironmental cues. Disruption of MtrAB by tunable CRISPR interference confirms that the signaling system regulates multiple peptidoglycan hydrolases, among other targets, that are important for cell division. Further, MtrA decreases the effectiveness of antibiotics by mechanisms of both intrinsic resistance and drug tolerance. Together, the model-enabled dissection of complex MtrA regulation highlights its importance as a drug target and illustrates how EGRIN 2.0 facilitates discovery and mechanistic characterization of Mtb adaptation to specific host microenvironments within the host.
Assuntos
Mycobacterium tuberculosis , Fatores de Transcrição , Fatores de Transcrição/genética , Proteínas de Bactérias/genética , Divisão Celular , Tolerância a MedicamentosRESUMO
The scale of post-transcriptional regulation and the implications of its interplay with other forms of regulation in environmental acclimation are underexplored for organisms of the domain Archaea. Here, we have investigated the scale of post-transcriptional regulation in the extremely halophilic archaeon Halobacterium salinarum NRC-1 by integrating the transcriptome-wide locations of transcript processing sites (TPSs) and SmAP1 binding, the genome-wide locations of antisense RNAs (asRNAs), and the consequences of RNase_2099C knockout on the differential expression of all genes. This integrated analysis has discovered that 54% of all protein-coding genes in the genome of this haloarchaeon are likely targeted by multiple mechanisms for putative post-transcriptional processing and regulation, with about 20% of genes likely being regulated by combinatorial schemes involving SmAP1, asRNAs, and RNase_2099C. Comparative analysis of mRNA levels (transcriptome sequencing [RNA-Seq]) and protein levels (sequential window acquisition of all theoretical fragment ion spectra mass spectrometry [SWATH-MS]) for 2,579 genes over four phases of batch culture growth in complex medium generated additional evidence for the conditional post-transcriptional regulation of 7% of all protein-coding genes. We demonstrate that post-transcriptional regulation may act to fine-tune specialized and rapid acclimation to stressful environments, e.g., as a switch to turn on gas vesicle biogenesis to promote vertical relocation under anoxic conditions and modulate the frequency of transposition by insertion sequence (IS) elements of the IS200/IS605, IS4, and ISH3 families. Findings from this study are provided as an atlas in a public Web resource (https://halodata.systemsbiology.net). IMPORTANCE While the transcriptional regulation landscape of archaea has been extensively investigated, we currently have limited knowledge about post-transcriptional regulation and its driving mechanisms in this domain of life. In this study, we collected and integrated omics data from multiple sources and technologies to infer post-transcriptionally regulated genes and the putative mechanisms modulating their expression at the protein level in Halobacterium salinarum NRC-1. The results suggest that post-transcriptional regulation may drive environmental acclimation by regulating hallmark biological processes. To foster discoveries by other research groups interested in the topic, we extended our integrated data to the public in the form of an interactive atlas (https://halodata.systemsbiology.net).
Assuntos
Archaea , Transcriptoma , Humanos , Archaea/genética , Transcriptoma/genética , Genoma , RNA Antissenso/genética , Ribonucleases/genéticaRESUMO
Glioblastoma (GBM) is a heterogeneous tumor made up of cell states that evolve over time. Here, we modeled tumor evolutionary trajectories during standard-of-care treatment using multi-omic single-cell analysis of a primary tumor sample, corresponding mouse xenografts subjected to standard of care therapy, and recurrent tumor at autopsy. We mined the multi-omic data with single-cell SYstems Genetics Network AnaLysis (scSYGNAL) to identify a network of 52 regulators that mediate treatment-induced shifts in xenograft tumor-cell states that were also reflected in recurrence. By integrating scSYGNAL-derived regulatory network information with transcription factor accessibility deviations derived from single-cell ATAC-seq data, we developed consensus networks that modulate cell state transitions across subpopulations of primary and recurrent tumor cells. Finally, by matching targeted therapies to active regulatory networks underlying tumor evolutionary trajectories, we provide a framework for applying single-cell-based precision medicine approaches to an individual patient in a concurrent, adjuvant, or recurrent setting.
RESUMO
We present predictive models for comprehensive systems analysis of Clostridioides difficile, the etiology of pseudomembranous colitis. By leveraging 151 published transcriptomes, we generated an EGRIN model that organizes 90% of C. difficile genes into a transcriptional regulatory network of 297 co-regulated modules, implicating genes in sporulation, carbohydrate transport, and metabolism. By advancing a metabolic model through addition and curation of metabolic reactions including nutrient uptake, we discovered 14 amino acids, diverse carbohydrates, and 10 metabolic genes as essential for C. difficile growth in the intestinal environment. Finally, we developed a PRIME model to uncover how EGRIN-inferred combinatorial gene regulation by transcription factors, such as CcpA and CodY, modulates essential metabolic processes to enable C. difficile growth relative to commensal colonization. The C. difficile interactive web portal provides access to these model resources to support collaborative systems-level studies of context-specific virulence mechanisms in C. difficile.
Assuntos
Clostridioides difficile , Proteínas de Bactérias/genética , Proteínas de Bactérias/metabolismo , Clostridioides , Clostridioides difficile/genética , Regulação Bacteriana da Expressão Gênica , Redes e Vias Metabólicas/genética , Análise de SistemasRESUMO
Despite recent advancements in the treatment of multiple myeloma (MM), nearly all patients ultimately relapse and many become refractory to multiple lines of therapies. Therefore, we not only need the ability to predict which patients are at high risk for disease progression but also a means to understand the mechanisms underlying their risk. Here, we report a transcriptional regulatory network (TRN) for MM inferred from cross-sectional multi-omics data from 881 patients that predicts how 124 chromosomal abnormalities and somatic mutations causally perturb 392 transcription regulators of 8549 genes to manifest in distinct clinical phenotypes and outcomes. We identified 141 genetic programs whose activity profiles stratify patients into 25 distinct transcriptional states and proved to be more predictive of outcomes than did mutations. The coherence of these programs and accuracy of our network-based risk prediction was validated in two independent datasets. We observed subtype-specific vulnerabilities to interventions with existing drugs and revealed plausible mechanisms for relapse, including the establishment of an immunosuppressive microenvironment. Investigation of the t(4;14) clinical subtype using the TRN revealed that 16% of these patients exhibit an extreme-risk combination of genetic programs (median progression-free survival of 5 months) that create a distinct phenotype with targetable genes and pathways.
RESUMO
BACKGROUND: There is no detailed report about the angiographic leakage of polypoidal choroidal vasculopathy (PCV) lesions on indocyanine green (ICG) angiography. This study aimed to investigate the angiographic leakage of polypoidal lesions in PCV on ICG angiography. METHODS: One hundred and forty-four eyes of 137 patients diagnosed as PCV were prospectively observed. Fundus examination, fluorescein angiography, and ICG angiography were performed. Leakage of polypoidal lesions and clinical features were recorded according to the angiograms. RESULTS: In all 144 eyes, 110 eyes showed angiographic leakage (leakage group) on ICG angiography and three subtypes of leakage group were noted, which were polypoidal dilations leakage (47 eyes, 42.7%), branching vascular networks leakage (14 eyes, 12.7%) and leakage of both (49 eyes, 44.5%). The other 34 eyes showed regression of polypoidal lesions (regression group). In leakage group, the rates of pigment epithelial detachment (PED), best corrected visual acuity (BCVA) < 0.1 and old subretinal hemorrhages were 56.4% (62 eyes), 19.1% (21 eyes), and 4.6% (5 eyes) respectively, compared with 8.8% (3 eyes), 50% (17 eyes) and 38.2% (13 eyes) of regression group (P < 0.001). The history of regression group was significantly longer (P < 0.001). CONCLUSIONS: Angiographic leakage and regression can be observed in PCV lesions. Leakage of both polypoidal dilations and branching vascular networks is the most common subtype in leakage group. PCV in leakage group is more likely to be related to PED, better BCVA and shorter history, while PCV regression group tends to relevant to old subretinal hemorrhage, worse BCVA and longer history. This may reflect that the former is active or in the early course while the later is resting or in the late phase of PCV.