Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 40
Filtrar
Mais filtros

Base de dados
Tipo de documento
Intervalo de ano de publicação
1.
Nat Immunol ; 24(10): 1735-1747, 2023 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-37679549

RESUMO

Neurodegenerative diseases, including Alzheimer's disease (AD), are characterized by innate immune-mediated inflammation, but functional and mechanistic effects of the adaptive immune system remain unclear. Here we identify brain-resident CD8+ T cells that coexpress CXCR6 and PD-1 and are in proximity to plaque-associated microglia in human and mouse AD brains. We also establish that CD8+ T cells restrict AD pathologies, including ß-amyloid deposition and cognitive decline. Ligand-receptor interaction analysis identifies CXCL16-CXCR6 intercellular communication between microglia and CD8+ T cells. Further, Cxcr6 deficiency impairs accumulation, tissue residency programming and clonal expansion of brain PD-1+CD8+ T cells. Ablation of Cxcr6 or CD8+ T cells ultimately increases proinflammatory cytokine production from microglia, with CXCR6 orchestrating brain CD8+ T cell-microglia colocalization. Collectively, our study reveals protective roles for brain CD8+ T cells and CXCR6 in mouse AD pathogenesis and highlights that microenvironment-specific, intercellular communication orchestrates tissue homeostasis and protection from neuroinflammation.

2.
J Cell Sci ; 135(10)2022 05 15.
Artigo em Inglês | MEDLINE | ID: mdl-35502723

RESUMO

The mammary gland epithelial tree contains two distinct cell populations, luminal and basal. The investigation of how this heterogeneity is developed and how it influences tumorigenesis has been hampered by the need to perform studies on these populations using animal models. Comma-1D is an immortalized mouse mammary epithelial cell line that has unique morphogenetic properties. By performing single-cell RNA-seq studies, we found that Comma-1D cultures consist of two main populations with luminal and basal features, and a smaller population with mixed lineage and bipotent characteristics. We demonstrated that multiple transcription factors associated with the differentiation of the mammary epithelium in vivo also modulate this process in Comma-1D cultures. Additionally, we found that only cells with luminal features were able to acquire transformed characteristics after an oncogenic HER2 (also known as ERBB2) mutant was introduced in their genomes. Overall, our studies characterize, at a single-cell level, the heterogeneity of the Comma-1D cell line and illustrate how Comma-1D cells can be used as an experimental model to study both the differentiation and the transformation processes in vitro.


Assuntos
Neoplasias da Mama , Linhagem Celular , Glândulas Mamárias Animais , Animais , Neoplasias da Mama/genética , Neoplasias da Mama/metabolismo , Células Epiteliais , Feminino , Glândulas Mamárias Animais/citologia , Camundongos , Análise de Célula Única
3.
Nat Methods ; 17(8): 807-814, 2020 08.
Artigo em Inglês | MEDLINE | ID: mdl-32737473

RESUMO

Enhancers are important non-coding elements, but they have traditionally been hard to characterize experimentally. The development of massively parallel assays allows the characterization of large numbers of enhancers for the first time. Here, we developed a framework using Drosophila STARR-seq to create shape-matching filters based on meta-profiles of epigenetic features. We integrated these features with supervised machine-learning algorithms to predict enhancers. We further demonstrated that our model could be transferred to predict enhancers in mammals. We comprehensively validated the predictions using a combination of in vivo and in vitro approaches, involving transgenic assays in mice and transduction-based reporter assays in human cell lines (153 enhancers in total). The results confirmed that our model can accurately predict enhancers in different species without re-parameterization. Finally, we examined the transcription factor binding patterns at predicted enhancers versus promoters. We demonstrated that these patterns enable the construction of a secondary model that effectively distinguishes enhancers and promoters.


Assuntos
Epigênese Genética/fisiologia , Reconhecimento Automatizado de Padrão/métodos , Animais , Linhagem Celular , Drosophila , Histonas/genética , Histonas/metabolismo , Humanos , Camundongos , Camundongos Transgênicos , Reprodutibilidade dos Testes
4.
EMBO Rep ; 22(12): e53201, 2021 12 06.
Artigo em Inglês | MEDLINE | ID: mdl-34633138

RESUMO

During the female lifetime, the expansion of the epithelium dictated by the ovarian cycles is supported by a transient increase in the mammary epithelial stem cell population (MaSCs). Notably, activation of Wnt/ß-catenin signaling is an important trigger for MaSC expansion. Here, we report that the miR-424/503 cluster is a modulator of canonical Wnt signaling in the mammary epithelium. We show that mammary tumors of miR-424(322)/503-depleted mice exhibit activated Wnt/ß-catenin signaling. Importantly, we show a strong association between miR-424/503 deletion and breast cancers with high levels of Wnt/ß-catenin signaling. Moreover, miR-424/503 cluster is required for Wnt-mediated MaSC expansion induced by the ovarian cycles. Lastly, we show that miR-424/503 exerts its function by targeting two binding sites at the 3'UTR of the LRP6 co-receptor and reducing its expression. These results unveil an unknown link between the miR-424/503, regulation of Wnt signaling, MaSC fate, and tumorigenesis.


Assuntos
Epitélio , Proteína-6 Relacionada a Receptor de Lipoproteína de Baixa Densidade , Glândulas Mamárias Animais/citologia , MicroRNAs , Via de Sinalização Wnt , Animais , Neoplasias da Mama , Carcinogênese , Linhagem Celular Tumoral , Células Epiteliais/citologia , Epitélio/metabolismo , Feminino , Proteína-6 Relacionada a Receptor de Lipoproteína de Baixa Densidade/genética , Proteína-6 Relacionada a Receptor de Lipoproteína de Baixa Densidade/metabolismo , Ciclo Menstrual , Camundongos , MicroRNAs/genética , Células-Tronco/citologia
5.
BMC Bioinformatics ; 21(1): 222, 2020 May 29.
Artigo em Inglês | MEDLINE | ID: mdl-32471347

RESUMO

BACKGROUND: Genome-wide ligation-based assays such as Hi-C provide us with an unprecedented opportunity to investigate the spatial organization of the genome. Results of a typical Hi-C experiment are often summarized in a chromosomal contact map, a matrix whose elements reflect the co-location frequencies of genomic loci. To elucidate the complex structural and functional interactions between those genomic loci, networks offer a natural and powerful framework. RESULTS: We propose a novel graph-theoretical framework, the Corrected Gene Proximity (CGP) map to study the effect of the 3D spatial organization of genes in transcriptional regulation. The starting point of the CGP map is a weighted network, the gene proximity map, whose weights are based on the contact frequencies between genes extracted from genome-wide Hi-C data. We derive a null model for the network based on the signal contributed by the 1D genomic distance and use it to "correct" the gene proximity for cell type 3D specific arrangements. The CGP map, therefore, provides a network framework for the 3D structure of the genome on a global scale. On human cell lines, we show that the CGP map can detect and quantify gene co-regulation and co-localization more effectively than the map obtained by raw contact frequencies. Analyzing the expression pattern of metabolic pathways of two hematopoietic cell lines, we find that the relative positioning of the genes, as captured and quantified by the CGP, is highly correlated with their expression change. We further show that the CGP map can be used to form an inter-chromosomal proximity map that allows large-scale abnormalities, such as chromosomal translocations, to be identified. CONCLUSIONS: The Corrected Gene Proximity map is a map of the 3D structure of the genome on a global scale. It allows the simultaneous analysis of intra- and inter- chromosomal interactions and of gene co-regulation and co-localization more effectively than the map obtained by raw contact frequencies, thus revealing hidden associations between global spatial positioning and gene expression. The flexible graph-based formalism of the CGP map can be easily generalized to study any existing Hi-C datasets.


Assuntos
Cromossomos Humanos , Regulação da Expressão Gênica , Genoma Humano , Linhagem Celular , Genômica/métodos , Humanos , Redes e Vias Metabólicas/genética
6.
Nature ; 512(7515): 445-8, 2014 Aug 28.
Artigo em Inglês | MEDLINE | ID: mdl-25164755

RESUMO

The transcriptome is the readout of the genome. Identifying common features in it across distant species can reveal fundamental principles. To this end, the ENCODE and modENCODE consortia have generated large amounts of matched RNA-sequencing data for human, worm and fly. Uniform processing and comprehensive annotation of these data allow comparison across metazoan phyla, extending beyond earlier within-phylum transcriptome comparisons and revealing ancient, conserved features. Specifically, we discover co-expression modules shared across animals, many of which are enriched in developmental genes. Moreover, we use expression patterns to align the stages in worm and fly development and find a novel pairing between worm embryo and fly pupae, in addition to the embryo-to-embryo and larvae-to-larvae pairings. Furthermore, we find that the extent of non-canonical, non-coding transcription is similar in each organism, per base pair. Finally, we find in all three organisms that the gene-expression levels, both coding and non-coding, can be quantitatively predicted from chromatin features at the promoter using a 'universal model' based on a single set of organism-independent parameters.


Assuntos
Caenorhabditis elegans/genética , Drosophila melanogaster/genética , Perfilação da Expressão Gênica , Transcriptoma/genética , Animais , Caenorhabditis elegans/embriologia , Caenorhabditis elegans/crescimento & desenvolvimento , Cromatina/genética , Análise por Conglomerados , Drosophila melanogaster/crescimento & desenvolvimento , Regulação da Expressão Gênica no Desenvolvimento/genética , Histonas/metabolismo , Humanos , Larva/genética , Larva/crescimento & desenvolvimento , Modelos Genéticos , Anotação de Sequência Molecular , Regiões Promotoras Genéticas/genética , Pupa/genética , Pupa/crescimento & desenvolvimento , RNA não Traduzido/genética , Análise de Sequência de RNA
7.
Nature ; 512(7515): 453-6, 2014 Aug 28.
Artigo em Inglês | MEDLINE | ID: mdl-25164757

RESUMO

Despite the large evolutionary distances between metazoan species, they can show remarkable commonalities in their biology, and this has helped to establish fly and worm as model organisms for human biology. Although studies of individual elements and factors have explored similarities in gene regulation, a large-scale comparative analysis of basic principles of transcriptional regulatory features is lacking. Here we map the genome-wide binding locations of 165 human, 93 worm and 52 fly transcription regulatory factors, generating a total of 1,019 data sets from diverse cell types, developmental stages, or conditions in the three species, of which 498 (48.9%) are presented here for the first time. We find that structural properties of regulatory networks are remarkably conserved and that orthologous regulatory factor families recognize similar binding motifs in vivo and show some similar co-associations. Our results suggest that gene-regulatory properties previously observed for individual factors are general principles of metazoan regulation that are remarkably well-preserved despite extensive functional divergence of individual network connections. The comparative maps of regulatory circuitry provided here will drive an improved understanding of the regulatory underpinnings of model organism biology and how these relate to human biology, development and disease.


Assuntos
Caenorhabditis elegans/genética , Drosophila melanogaster/genética , Evolução Molecular , Regulação da Expressão Gênica/genética , Redes Reguladoras de Genes/genética , Fatores de Transcrição/metabolismo , Animais , Sítios de Ligação , Caenorhabditis elegans/crescimento & desenvolvimento , Imunoprecipitação da Cromatina , Sequência Conservada/genética , Drosophila melanogaster/crescimento & desenvolvimento , Regulação da Expressão Gênica no Desenvolvimento/genética , Genoma/genética , Humanos , Anotação de Sequência Molecular , Motivos de Nucleotídeos/genética , Especificidade de Órgãos/genética , Fatores de Transcrição/genética
8.
Trends Genet ; 32(5): 251-253, 2016 05.
Artigo em Inglês | MEDLINE | ID: mdl-27005445

RESUMO

The emergence of collective creative enterprise such as large scientific consortia is a unique feature in modern scientific research. We analyzed the temporal co-authorship network structures of ENCODE and modENCODE consortia. Our analysis revealed that the consortium members work closely as a community whereas non-members collaborate in the scale of a few laboratories. We also identified a few brokers playing an important role to facilitate collaborations with outside researchers.


Assuntos
Comportamento Cooperativo , Revisão da Pesquisa por Pares/tendências , Humanos
9.
Bioinformatics ; 33(14): 2199-2201, 2017 Jul 15.
Artigo em Inglês | MEDLINE | ID: mdl-28369339

RESUMO

SUMMARY: Genome-wide proximity ligation based assays like Hi-C have opened a window to the 3D organization of the genome. In so doing, they present data structures that are different from conventional 1D signal tracks. To exploit the 2D nature of Hi-C contact maps, matrix techniques like spectral analysis are particularly useful. Here, we present HiC-spector, a collection of matrix-related functions for analyzing Hi-C contact maps. In particular, we introduce a novel reproducibility metric for quantifying the similarity between contact maps based on spectral decomposition. The metric successfully separates contact maps mapped from Hi-C data coming from biological replicates, pseudo-replicates and different cell types. AVAILABILITY AND IMPLEMENTATION: Source code in Julia and Python, and detailed documentation is available at https://github.com/gersteinlab/HiC-spector . CONTACT: koonkiu.yan@gmail.com or mark@gersteinlab.org. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Assuntos
Cromossomos/química , Técnicas Genéticas , Genoma , Biotinilação , DNA/química , Biblioteca Gênica , Humanos , Reprodutibilidade dos Testes
10.
PLoS Comput Biol ; 13(7): e1005647, 2017 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-28742097

RESUMO

Genome-wide proximity ligation based assays such as Hi-C have revealed that eukaryotic genomes are organized into structural units called topologically associating domains (TADs). From a visual examination of the chromosomal contact map, however, it is clear that the organization of the domains is not simple or obvious. Instead, TADs exhibit various length scales and, in many cases, a nested arrangement. Here, by exploiting the resemblance between TADs in a chromosomal contact map and densely connected modules in a network, we formulate TAD identification as a network optimization problem and propose an algorithm, MrTADFinder, to identify TADs from intra-chromosomal contact maps. MrTADFinder is based on the network-science concept of modularity. A key component of it is deriving an appropriate background model for contacts in a random chain, by numerically solving a set of matrix equations. The background model preserves the observed coverage of each genomic bin as well as the distance dependence of the contact frequency for any pair of bins exhibited by the empirical map. Also, by introducing a tunable resolution parameter, MrTADFinder provides a self-consistent approach for identifying TADs at different length scales, hence the acronym "Mr" standing for Multiple Resolutions. We then apply MrTADFinder to various Hi-C datasets. The identified domain boundaries are marked by characteristic signatures in chromatin marks and transcription factors (TF) that are consistent with earlier work. Moreover, by calling TADs at different length scales, we observe that boundary signatures change with resolution, with different chromatin features having different characteristic length scales. Furthermore, we report an enrichment of HOT (high-occupancy target) regions near TAD boundaries and investigate the role of different TFs in determining boundaries at various resolutions. To further explore the interplay between TADs and epigenetic marks, as tumor mutational burden is known to be coupled to chromatin structure, we examine how somatic mutations are distributed across boundaries and find a clear stepwise pattern. Overall, MrTADFinder provides a novel computational framework to explore the multi-scale structures in Hi-C contact maps.


Assuntos
Cromatina , Cromossomos , Biologia Computacional/métodos , Modelos Genéticos , Algoritmos , Linhagem Celular , Núcleo Celular/química , Núcleo Celular/genética , Cromatina/química , Cromatina/genética , Cromatina/ultraestrutura , Cromossomos/química , Cromossomos/genética , Cromossomos/ultraestrutura , Genoma/genética , Genoma/fisiologia , Humanos , Ligação Proteica , Fatores de Transcrição/metabolismo
11.
Nature ; 489(7414): 91-100, 2012 Sep 06.
Artigo em Inglês | MEDLINE | ID: mdl-22955619

RESUMO

Transcription factors bind in a combinatorial fashion to specify the on-and-off states of genes; the ensemble of these binding events forms a regulatory network, constituting the wiring diagram for a cell. To examine the principles of the human transcriptional regulatory network, we determined the genomic binding information of 119 transcription-related factors in over 450 distinct experiments. We found the combinatorial, co-association of transcription factors to be highly context specific: distinct combinations of factors bind at specific genomic locations. In particular, there are significant differences in the binding proximal and distal to genes. We organized all the transcription factor binding into a hierarchy and integrated it with other genomic information (for example, microRNA regulation), forming a dense meta-network. Factors at different levels have different properties; for instance, top-level transcription factors more strongly influence expression and middle-level ones co-regulate targets to mitigate information-flow bottlenecks. Moreover, these co-regulations give rise to many enriched network motifs (for example, noise-buffering feed-forward loops). Finally, more connected network components are under stronger selection and exhibit a greater degree of allele-specific activity (that is, differential binding to the two parental alleles). The regulatory information obtained in this study will be crucial for interpreting personal genome sequences and understanding basic principles of human biology and disease.


Assuntos
DNA/genética , Enciclopédias como Assunto , Redes Reguladoras de Genes/genética , Genoma Humano/genética , Anotação de Sequência Molecular , Sequências Reguladoras de Ácido Nucleico/genética , Fatores de Transcrição/metabolismo , Alelos , Linhagem Celular , Fator de Transcrição GATA1/metabolismo , Perfilação da Expressão Gênica , Genômica , Humanos , Células K562 , Especificidade de Órgãos , Fosforilação/genética , Polimorfismo de Nucleotídeo Único/genética , Mapas de Interação de Proteínas , RNA não Traduzido/genética , RNA não Traduzido/metabolismo , Seleção Genética/genética , Sítio de Iniciação de Transcrição
12.
PLoS Comput Biol ; 11(4): e1004132, 2015 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-25884877

RESUMO

The topology of the gene-regulatory network has been extensively analyzed. Now, given the large amount of available functional genomic data, it is possible to go beyond this and systematically study regulatory circuits in terms of logic elements. To this end, we present Loregic, a computational method integrating gene expression and regulatory network data, to characterize the cooperativity of regulatory factors. Loregic uses all 16 possible two-input-one-output logic gates (e.g. AND or XOR) to describe triplets of two factors regulating a common target. We attempt to find the gate that best matches each triplet's observed gene expression pattern across many conditions. We make Loregic available as a general-purpose tool (github.com/gersteinlab/loregic). We validate it with known yeast transcription-factor knockout experiments. Next, using human ENCODE ChIP-Seq and TCGA RNA-Seq data, we are able to demonstrate how Loregic characterizes complex circuits involving both proximally and distally regulating transcription factors (TFs) and also miRNAs. Furthermore, we show that MYC, a well-known oncogenic driving TF, can be modeled as acting independently from other TFs (e.g., using OR gates) but antagonistically with repressing miRNAs. Finally, we inter-relate Loregic's gate logic with other aspects of regulation, such as indirect binding via protein-protein interactions, feed-forward loop motifs and global regulatory hierarchy.


Assuntos
Redes Reguladoras de Genes/genética , Genes Reguladores/genética , Modelos Logísticos , Modelos Genéticos , Fatores de Transcrição/genética , Ativação Transcricional/genética , Algoritmos , Animais , Simulação por Computador , Regulação da Expressão Gênica/genética , Humanos , Leucemia/genética , MicroRNAs/genética
13.
Genome Res ; 22(9): 1658-67, 2012 Sep.
Artigo em Inglês | MEDLINE | ID: mdl-22955978

RESUMO

Statistical models have been used to quantify the relationship between gene expression and transcription factor (TF) binding signals. Here we apply the models to the large-scale data generated by the ENCODE project to study transcriptional regulation by TFs. Our results reveal a notable difference in the prediction accuracy of expression levels of transcription start sites (TSSs) captured by different technologies and RNA extraction protocols. In general, the expression levels of TSSs with high CpG content are more predictable than those with low CpG content. For genes with alternative TSSs, the expression levels of downstream TSSs are more predictable than those of the upstream ones. Different TF categories and specific TFs vary substantially in their contributions to predicting expression. Between two cell lines, the differential expression of TSS can be precisely reflected by the difference of TF-binding signals in a quantitative manner, arguing against the conventional on-and-off model of TF binding. Finally, we explore the relationships between TF-binding signals and other chromatin features such as histone modifications and DNase hypersensitivity for determining expression. The models imply that these features regulate transcription in a highly coordinated manner.


Assuntos
Regulação da Expressão Gênica , Genômica , Fatores de Transcrição/metabolismo , Transcrição Gênica , Composição de Bases , Sítios de Ligação/genética , Linhagem Celular , Cromatina/genética , Cromatina/metabolismo , Biologia Computacional/métodos , Histonas/genética , Humanos , Modelos Biológicos , Regiões Promotoras Genéticas , Ligação Proteica/genética , Sítio de Iniciação de Transcrição
14.
Proc Natl Acad Sci U S A ; 107(15): 6841-6, 2010 Apr 13.
Artigo em Inglês | MEDLINE | ID: mdl-20351254

RESUMO

Gene regulatory networks have been shown to share some common aspects with commonplace social governance structures. Thus, we can get some intuition into their organization by arranging them into well-known hierarchical layouts. These hierarchies, in turn, can be placed between the extremes of autocracies, with well-defined levels and clear chains of command, and democracies, without such defined levels and with more co-regulatory partnerships between regulators. In general, the presence of partnerships decreases the variation in information flow amongst nodes within a level, more evenly distributing stress. Here we study various regulatory networks (transcriptional, modification, and phosphorylation) for five diverse species, Escherichia coli to human. We specify three levels of regulators--top, middle, and bottom--which collectively govern the non-regulator targets lying in the lowest fourth level. We define quantities for nodes, levels, and entire networks that measure their degree of collaboration and autocratic vs. democratic character. We show individual regulators have a range of partnership tendencies: Some regulate their targets in combination with other regulators in local instantiations of democratic structure, whereas others regulate mostly in isolation, in more autocratic fashion. Overall, we show that in all networks studied the middle level has the highest collaborative propensity and coregulatory partnerships occur most frequently amongst midlevel regulators, an observation that has parallels in corporate settings where middle managers must interact most to ensure organizational effectiveness. There is, however, one notable difference between networks in different species: The amount of collaborative regulation and democratic character increases markedly with overall genomic complexity.


Assuntos
Regulação Bacteriana da Expressão Gênica , Regulação da Expressão Gênica , Redes Reguladoras de Genes , Animais , Escherichia coli/genética , Genoma , Humanos , Camundongos , Modelos Biológicos , Modelos Genéticos , Modelos Estatísticos , Mycobacterium tuberculosis/genética , Fosforilação , Ratos , Especificidade da Espécie
15.
Proc Natl Acad Sci U S A ; 107(20): 9186-91, 2010 May 18.
Artigo em Inglês | MEDLINE | ID: mdl-20439753

RESUMO

The genome has often been called the operating system (OS) for a living organism. A computer OS is described by a regulatory control network termed the call graph, which is analogous to the transcriptional regulatory network in a cell. To apply our firsthand knowledge of the architecture of software systems to understand cellular design principles, we present a comparison between the transcriptional regulatory network of a well-studied bacterium (Escherichia coli) and the call graph of a canonical OS (Linux) in terms of topology and evolution. We show that both networks have a fundamentally hierarchical layout, but there is a key difference: The transcriptional regulatory network possesses a few global regulators at the top and many targets at the bottom; conversely, the call graph has many regulators controlling a small set of generic functions. This top-heavy organization leads to highly overlapping functional modules in the call graph, in contrast to the relatively independent modules in the regulatory network. We further develop a way to measure evolutionary rates comparably between the two networks and explain this difference in terms of network evolution. The process of biological evolution via random mutation and subsequent selection tightly constrains the evolution of regulatory network hubs. The call graph, however, exhibits rapid evolution of its highly connected generic components, made possible by designers' continual fine-tuning. These findings stem from the design principles of the two systems: robustness for biological systems and cost effectiveness (reuse) for software systems.


Assuntos
Algoritmos , Evolução Molecular , Redes Reguladoras de Genes/genética , Genoma Bacteriano/genética , Metáfora , Design de Software , Escherichia coli
16.
bioRxiv ; 2023 Jan 27.
Artigo em Inglês | MEDLINE | ID: mdl-36747870

RESUMO

The sparse nature of single-cell omics data makes it challenging to dissect the wiring and rewiring of the transcriptional and signaling drivers that regulate cellular states. Many of the drivers, referred to as "hidden drivers", are difficult to identify via conventional expression analysis due to low expression and inconsistency between RNA and protein activity caused by post-translational and other modifications. To address this issue, we developed scMINER, a mutual information (MI)-based computational framework for unsupervised clustering analysis and cell-type specific inference of intracellular networks, hidden drivers and network rewiring from single-cell RNA-seq data. We designed scMINER to capture nonlinear cell-cell and gene-gene relationships and infer driver activities. Systematic benchmarking showed that scMINER outperforms popular single-cell clustering algorithms, especially in distinguishing similar cell types. With respect to network inference, scMINER does not rely on the binding motifs which are available for a limited set of transcription factors, therefore scMINER can provide quantitative activity assessment for more than 6,000 transcription and signaling drivers from a scRNA-seq experiment. As demonstrations, we used scMINER to expose hidden transcription and signaling drivers and dissect their regulon rewiring in immune cell heterogeneity, lineage differentiation, and tissue specification. Overall, activity-based scMINER is a widely applicable, highly accurate, reproducible and scalable method for inferring cellular transcriptional and signaling networks in each cell state from scRNA-seq data. The scMINER software is publicly accessible via: https://github.com/jyyulab/scMINER.

17.
Res Sq ; 2023 Jan 27.
Artigo em Inglês | MEDLINE | ID: mdl-36747874

RESUMO

The sparse nature of single-cell omics data makes it challenging to dissect the wiring and rewiring of the transcriptional and signaling drivers that regulate cellular states. Many of the drivers, referred to as "hidden drivers", are difficult to identify via conventional expression analysis due to low expression and inconsistency between RNA and protein activity caused by post-translational and other modifications. To address this issue, we developed scMINER, a mutual information (MI)-based computational framework for unsupervised clustering analysis and cell-type specific inference of intracellular networks, hidden drivers and network rewiring from single-cell RNA-seq data. We designed scMINER to capture nonlinear cell-cell and gene-gene relationships and infer driver activities. Systematic benchmarking showed that scMINER outperforms popular single-cell clustering algorithms, especially in distinguishing similar cell types. With respect to network inference, scMINER does not rely on the binding motifs which are available for a limited set of transcription factors, therefore scMINER can provide quantitative activity assessment for more than 6,000 transcription and signaling drivers from a scRNA-seq experiment. As demonstrations, we used scMINER to expose hidden transcription and signaling drivers and dissect their regulon rewiring in immune cell heterogeneity, lineage differentiation, and tissue specification. Overall, activity-based scMINER is a widely applicable, highly accurate, reproducible and scalable method for inferring cellular transcriptional and signaling networks in each cell state from scRNA-seq data. The scMINER software is publicly accessible via: https://github.com/jyyulab/scMINER.

18.
Nat Commun ; 14(1): 2581, 2023 05 04.
Artigo em Inglês | MEDLINE | ID: mdl-37142594

RESUMO

Many signaling and other genes known as "hidden" drivers may not be genetically or epigenetically altered or differentially expressed at the mRNA or protein levels, but, rather, drive a phenotype such as tumorigenesis via post-translational modification or other mechanisms. However, conventional approaches based on genomics or differential expression are limited in exposing such hidden drivers. Here, we present a comprehensive algorithm and toolkit NetBID2 (data-driven network-based Bayesian inference of drivers, version 2), which reverse-engineers context-specific interactomes and integrates network activity inferred from large-scale multi-omics data, empowering the identification of hidden drivers that could not be detected by traditional analyses. NetBID2 has substantially re-engineered the previous prototype version by providing versatile data visualization and sophisticated statistical analyses, which strongly facilitate researchers for result interpretation through end-to-end multi-omics data analysis. We demonstrate the power of NetBID2 using three hidden driver examples. We deploy NetBID2 Viewer, Runner, and Cloud apps with 145 context-specific gene regulatory and signaling networks across normal tissues and paediatric and adult cancers to facilitate end-to-end analysis, real-time interactive visualization and cloud-based data sharing. NetBID2 is freely available at https://jyyulab.github.io/NetBID .


Assuntos
Algoritmos , Genômica , Humanos , Teorema de Bayes , Transformação Celular Neoplásica/genética , Projetos de Pesquisa , Software
19.
Sci Adv ; 9(40): eadg9959, 2023 10 06.
Artigo em Inglês | MEDLINE | ID: mdl-37801507

RESUMO

Lentiviral vector (LV)-based gene therapy holds promise for a broad range of diseases. Analyzing more than 280,000 vector integration sites (VISs) in 273 samples from 10 patients with X-linked severe combined immunodeficiency (SCID-X1), we discovered shared LV integrome signatures in 9 of 10 patients in relation to the genomics, epigenomics, and 3D structure of the human genome. VISs were enriched in the nuclear subcompartment A1 and integrated into super-enhancers close to nuclear pore complexes. These signatures were validated in T cells transduced with an LV encoding a CD19-specific chimeric antigen receptor. Intriguingly, the one patient whose VISs deviated from the identified integrome signatures had a distinct clinical course. Comparison of LV and gamma retrovirus integromes regarding their 3D genome signatures identified differences that might explain the lower risk of insertional mutagenesis in LV-based gene therapy. Our findings suggest that LV integrome signatures, shaped by common features such as genome organization, may affect the efficacy of LV-based cellular therapies.


Assuntos
Vetores Genéticos , Doenças por Imunodeficiência Combinada Ligada ao Cromossomo X , Humanos , Vetores Genéticos/genética , Terapia Genética , Retroviridae/genética , Doenças por Imunodeficiência Combinada Ligada ao Cromossomo X/genética , Doenças por Imunodeficiência Combinada Ligada ao Cromossomo X/terapia , Linfócitos T
20.
PLoS Comput Biol ; 7(1): e1001050, 2011 Jan 06.
Artigo em Inglês | MEDLINE | ID: mdl-21253555

RESUMO

We have accumulated a large amount of biological network data and expect even more to come. Soon, we anticipate being able to compare many different biological networks as we commonly do for molecular sequences. It has long been believed that many of these networks change, or "rewire", at different rates. It is therefore important to develop a framework to quantify the differences between networks in a unified fashion. We developed such a formalism based on analogy to simple models of sequence evolution, and used it to conduct a systematic study of network rewiring on all the currently available biological networks. We found that, similar to sequences, biological networks show a decreased rate of change at large time divergences, because of saturation in potential substitutions. However, different types of biological networks consistently rewire at different rates. Using comparative genomics and proteomics data, we found a consistent ordering of the rewiring rates: transcription regulatory, phosphorylation regulatory, genetic interaction, miRNA regulatory, protein interaction, and metabolic pathway network, from fast to slow. This ordering was found in all comparisons we did of matched networks between organisms. To gain further intuition on network rewiring, we compared our observed rewirings with those obtained from simulation. We also investigated how readily our formalism could be mapped to other network contexts; in particular, we showed how it could be applied to analyze changes in a range of "commonplace" networks such as family trees, co-authorships and linux-kernel function dependencies.


Assuntos
Evolução Biológica , Genômica , Proteômica
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA