Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 5 de 5
Filtrar
1.
Bioinformatics ; 38(9): 2519-2528, 2022 04 28.
Artículo en Inglés | MEDLINE | ID: mdl-35188184

RESUMEN

MOTIVATION: Gene regulatory networks define regulatory relationships between transcription factors and target genes within a biological system, and reconstructing them is essential for understanding cellular growth and function. Methods for inferring and reconstructing networks from genomics data have evolved rapidly over the last decade in response to advances in sequencing technology and machine learning. The scale of data collection has increased dramatically; the largest genome-wide gene expression datasets have grown from thousands of measurements to millions of single cells, and new technologies are on the horizon to increase to tens of millions of cells and above. RESULTS: In this work, we present the Inferelator 3.0, which has been significantly updated to integrate data from distinct cell types to learn context-specific regulatory networks and aggregate them into a shared regulatory network, while retaining the functionality of the previous versions. The Inferelator is able to integrate the largest single-cell datasets and learn cell-type-specific gene regulatory networks. Compared to other network inference methods, the Inferelator learns new and informative Saccharomyces cerevisiae networks from single-cell gene expression data, measured by recovery of a known gold standard. We demonstrate its scaling capabilities by learning networks for multiple distinct neuronal and glial cell types in the developing Mus musculus brain at E18 from a large (1.3 million) single-cell gene expression dataset with paired single-cell chromatin accessibility data. AVAILABILITY AND IMPLEMENTATION: The inferelator software is available on GitHub (https://github.com/flatironinstitute/inferelator) under the MIT license and has been released as python packages with associated documentation (https://inferelator.readthedocs.io/). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Redes Reguladoras de Genes , Programas Informáticos , Animales , Ratones , Genómica , Genoma , Cromatina
2.
Genome Res ; 29(3): 449-463, 2019 03.
Artículo en Inglés | MEDLINE | ID: mdl-30696696

RESUMEN

Transcriptional regulatory networks (TRNs) provide insight into cellular behavior by describing interactions between transcription factors (TFs) and their gene targets. The assay for transposase-accessible chromatin (ATAC)-seq, coupled with TF motif analysis, provides indirect evidence of chromatin binding for hundreds of TFs genome-wide. Here, we propose methods for TRN inference in a mammalian setting, using ATAC-seq data to improve gene expression modeling. We test our methods in the context of T Helper Cell Type 17 (Th17) differentiation, generating new ATAC-seq data to complement existing Th17 genomic resources. In this resource-rich mammalian setting, our extensive benchmarking provides quantitative, genome-scale evaluation of TRN inference, combining ATAC-seq and RNA-seq data. We refine and extend our previous Th17 TRN, using our new TRN inference methods to integrate all Th17 data (gene expression, ATAC-seq, TF knockouts, and ChIP-seq). We highlight newly discovered roles for individual TFs and groups of TFs ("TF-TF modules") in Th17 gene regulation. Given the popularity of ATAC-seq, which provides high-resolution with low sample input requirements, we anticipate that our methods will improve TRN inference in new mammalian systems, especially in vivo, for cells directly from humans and animal models.


Asunto(s)
Cromatina/genética , Redes Reguladoras de Genes , Células Th17/metabolismo , Factores de Transcripción/metabolismo , Diferenciación Celular , Cromatina/química , Ensamble y Desensamble de Cromatina , Humanos , Unión Proteica , Programas Informáticos , Células Th17/citología
3.
PLoS Comput Biol ; 15(1): e1006591, 2019 01.
Artículo en Inglés | MEDLINE | ID: mdl-30677040

RESUMEN

Gene regulatory networks are composed of sub-networks that are often shared across biological processes, cell-types, and organisms. Leveraging multiple sources of information, such as publicly available gene expression datasets, could therefore be helpful when learning a network of interest. Integrating data across different studies, however, raises numerous technical concerns. Hence, a common approach in network inference, and broadly in genomics research, is to separately learn models from each dataset and combine the results. Individual models, however, often suffer from under-sampling, poor generalization and limited network recovery. In this study, we explore previous integration strategies, such as batch-correction and model ensembles, and introduce a new multitask learning approach for joint network inference across several datasets. Our method initially estimates the activities of transcription factors, and subsequently, infers the relevant network topology. As regulatory interactions are context-dependent, we estimate model coefficients as a combination of both dataset-specific and conserved components. In addition, adaptive penalties may be used to favor models that include interactions derived from multiple sources of prior knowledge including orthogonal genomics experiments. We evaluate generalization and network recovery using examples from Bacillus subtilis and Saccharomyces cerevisiae, and show that sharing information across models improves network reconstruction. Finally, we demonstrate robustness to both false positives in the prior information and heterogeneity among datasets.


Asunto(s)
Biología Computacional/métodos , Regulación de la Expresión Génica/genética , Redes Reguladoras de Genes/genética , Modelos Genéticos , Bacillus subtilis/genética , Bases de Datos Genéticas , Saccharomyces cerevisiae/genética
4.
Elife ; 92020 01 27.
Artículo en Inglés | MEDLINE | ID: mdl-31985403

RESUMEN

Understanding how gene expression programs are controlled requires identifying regulatory relationships between transcription factors and target genes. Gene regulatory networks are typically constructed from gene expression data acquired following genetic perturbation or environmental stimulus. Single-cell RNA sequencing (scRNAseq) captures the gene expression state of thousands of individual cells in a single experiment, offering advantages in combinatorial experimental design, large numbers of independent measurements, and accessing the interaction between the cell cycle and environmental responses that is hidden by population-level analysis of gene expression. To leverage these advantages, we developed a method for scRNAseq in budding yeast (Saccharomyces cerevisiae). We pooled diverse transcriptionally barcoded gene deletion mutants in 11 different environmental conditions and determined their expression state by sequencing 38,285 individual cells. We benchmarked a framework for learning gene regulatory networks from scRNAseq data that incorporates multitask learning and constructed a global gene regulatory network comprising 12,228 interactions.


Organisms switch their genes on and off to adapt to changing environments. This takes place thanks to complex networks of regulators that control which genes are actively 'read' by the cell to create the RNA molecules that are needed at the time. Piecing together these networks is key to fully understand the inner workings of living organisms, and how to potentially modify or artificially create them. Single-cell RNA sequencing is a powerful new tool that can measure which genes are turned on (or 'expressed') in an individual cell. Datasets with millions of gene expression profiles for individual cells now exist for organisms such as mice or humans. Yet, it is difficult to use these data to reconstruct networks of regulators; this is partly because scientists are not sure if the computational methods normally used to build these networks also work for single-cell RNA sequencing data. One way to check if this is the case is to use the methods on single-cell datasets from organisms where the networks of regulators are already known, and check whether the computational tools help to reach the same conclusion. Unfortunately, the regulatory networks in the organisms for which scientists have a lot of single-cell RNA sequencing data are still poorly known. There are living beings in which the networks are well characterised ­ such as yeast ­ but it has been difficult to do single-cell sequencing in them at the scale seen in other organisms. Jackson, Castro et al. first adapted a system for single-cell sequencing so that it would work in yeast. This generated a gene expression dataset of over 40,000 yeast cells. They then used a computational method (called the Inferelator) on these data to construct networks of regulators, and the results showed that the method performed well. This allowed Jackson, Castro et al. to start mapping how different networks connect, for example those that control the response to the environment and cell division. This is one of the benefits of single-cell RNA methods: cell division for example is not a process that can be examined at the level of a population, since the cells may all be at different life stages. In the future, the dataset will also be useful to scientists to benchmark a variety of single cell computational tools.


Asunto(s)
Código de Barras del ADN Taxonómico , Redes Reguladoras de Genes , Genotipo , Saccharomyces cerevisiae/genética , Análisis de Secuencia de ARN/métodos , Análisis de la Célula Individual/métodos , Eliminación de Gen , Regulación Fúngica de la Expresión Génica , Genes Fúngicos , Saccharomyces cerevisiae/metabolismo , Factores de Transcripción/metabolismo
5.
Cancer Cell ; 37(1): 37-54.e9, 2020 01 13.
Artículo en Inglés | MEDLINE | ID: mdl-31883968

RESUMEN

Cyclin-dependent kinase 7 (CDK7) is a central regulator of the cell cycle and gene transcription. However, little is known about its impact on genomic instability and cancer immunity. Using a selective CDK7 inhibitor, YKL-5-124, we demonstrated that CDK7 inhibition predominately disrupts cell-cycle progression and induces DNA replication stress and genome instability in small cell lung cancer (SCLC) while simultaneously triggering immune-response signaling. These tumor-intrinsic events provoke a robust immune surveillance program elicited by T cells, which is further enhanced by the addition of immune-checkpoint blockade. Combining YKL-5-124 with anti-PD-1 offers significant survival benefit in multiple highly aggressive murine models of SCLC, providing a rationale for new combination regimens consisting of CDK7 inhibitors and immunotherapies.


Asunto(s)
Quinasas Ciclina-Dependientes/antagonistas & inhibidores , Quinasas Ciclina-Dependientes/genética , Inestabilidad Genómica , Neoplasias Pulmonares/genética , Carcinoma Pulmonar de Células Pequeñas/genética , Animales , Antineoplásicos/farmacología , Linfocitos T CD4-Positivos/citología , Linfocitos T CD8-positivos/citología , Quimiocina CXCL9/metabolismo , Daño del ADN , Femenino , Humanos , Sistema Inmunológico , Inflamación , Interferón gamma/metabolismo , Neoplasias Pulmonares/tratamiento farmacológico , Neoplasias Pulmonares/inmunología , Masculino , Ratones , Pruebas de Micronúcleos , Receptor de Muerte Celular Programada 1/antagonistas & inhibidores , Pirazoles/farmacología , Pirroles/farmacología , Transducción de Señal , Carcinoma Pulmonar de Células Pequeñas/tratamiento farmacológico , Carcinoma Pulmonar de Células Pequeñas/inmunología , Factor de Necrosis Tumoral alfa/metabolismo , Quinasa Activadora de Quinasas Ciclina-Dependientes
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA