Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 95
Filtrar
Mais filtros

Tipo de documento
Intervalo de ano de publicação
1.
Cell ; 184(11): 3041-3055.e21, 2021 05 27.
Artigo em Inglês | MEDLINE | ID: mdl-33964211

RESUMO

cis-regulatory elements (CREs) encode the genomic blueprints of spatiotemporal gene expression programs enabling highly specialized cell functions. Using single-cell genomics in six maize organs, we determined the cis- and trans-regulatory factors defining diverse cell identities and coordinating chromatin organization by profiling transcription factor (TF) combinatorics, identifying TFs with non-cell-autonomous activity, and uncovering TFs underlying higher-order chromatin interactions. Cell-type-specific CREs were enriched for enhancer activity and within unmethylated long terminal repeat retrotransposons. Moreover, we found cell-type-specific CREs are hotspots for phenotype-associated genetic variants and were targeted by selection during modern maize breeding, highlighting the biological implications of this CRE atlas. Through comparison of maize and Arabidopsis thaliana developmental trajectories, we identified TFs and CREs with conserved and divergent chromatin dynamics, showcasing extensive evolution of gene regulatory networks. In addition to this rich dataset, we developed single-cell analysis software, Socrates, which can be used to understand cis-regulatory variation in any species.


Assuntos
Regulação da Expressão Gênica de Plantas/genética , Elementos Reguladores de Transcrição/genética , Zea mays/genética , Arabidopsis/genética , Expressão Gênica/genética , Perfilação da Expressão Gênica/métodos , Regulação da Expressão Gênica de Plantas/fisiologia , Redes Reguladoras de Genes/genética , Genoma , Genômica , Elementos Reguladores de Transcrição/fisiologia , Análise de Célula Única , Fatores de Transcrição/genética , Fatores de Transcrição/metabolismo , Transcriptoma/genética
2.
Am J Hum Genet ; 111(2): 338-349, 2024 02 01.
Artigo em Inglês | MEDLINE | ID: mdl-38228144

RESUMO

Clinical exome and genome sequencing have revolutionized the understanding of human disease genetics. Yet many genes remain functionally uncharacterized, complicating the establishment of causal disease links for genetic variants. While several scoring methods have been devised to prioritize these candidate genes, these methods fall short of capturing the expression heterogeneity across cell subpopulations within tissues. Here, we introduce single-cell tissue-specific gene prioritization using machine learning (STIGMA), an approach that leverages single-cell RNA-seq (scRNA-seq) data to prioritize candidate genes associated with rare congenital diseases. STIGMA prioritizes genes by learning the temporal dynamics of gene expression across cell types during healthy organogenesis. To assess the efficacy of our framework, we applied STIGMA to mouse limb and human fetal heart scRNA-seq datasets. In a cohort of individuals with congenital limb malformation, STIGMA prioritized 469 variants in 345 genes, with UBA2 as a notable example. For congenital heart defects, we detected 34 genes harboring nonsynonymous de novo variants (nsDNVs) in two or more individuals from a set of 7,958 individuals, including the ortholog of Prdm1, which is associated with hypoplastic left ventricle and hypoplastic aortic arch. Overall, our findings demonstrate that STIGMA effectively prioritizes tissue-specific candidate genes by utilizing single-cell transcriptome data. The ability to capture the heterogeneity of gene expression across cell populations makes STIGMA a powerful tool for the discovery of disease-associated genes and facilitates the identification of causal variants underlying human genetic disorders.


Assuntos
Cardiopatias Congênitas , Transcriptoma , Humanos , Animais , Camundongos , Exoma/genética , Cardiopatias Congênitas/genética , Sequenciamento do Exoma , Aprendizado de Máquina , Análise de Célula Única/métodos , Enzimas Ativadoras de Ubiquitina/genética
3.
Development ; 151(10)2024 May 15.
Artigo em Inglês | MEDLINE | ID: mdl-38804879

RESUMO

Dorsal interneurons (dIs) in the spinal cord encode the perception of touch, pain, heat, itchiness and proprioception. Previous studies using genetic strategies in animal models have revealed important insights into dI development, but the molecular details of how dIs arise as distinct populations of neurons remain incomplete. We have developed a resource to investigate dI fate specification by combining a single-cell RNA-Seq atlas of mouse embryonic stem cell-derived dIs with pseudotime analyses. To validate this in silico resource as a useful tool, we used it to first identify genes that are candidates for directing the transition states that lead to distinct dI lineage trajectories, and then validated them using in situ hybridization analyses in the developing mouse spinal cord in vivo. We have also identified an endpoint of the dI5 lineage trajectory and found that dIs become more transcriptionally homogeneous during terminal differentiation. This study introduces a valuable tool for further discovery about the timing of gene expression during dI differentiation and demonstrates its utility in clarifying dI lineage relationships.


Assuntos
Diferenciação Celular , Linhagem da Célula , Regulação da Expressão Gênica no Desenvolvimento , Interneurônios , Medula Espinal , Animais , Camundongos , Medula Espinal/metabolismo , Medula Espinal/embriologia , Linhagem da Célula/genética , Interneurônios/metabolismo , Interneurônios/citologia , Diferenciação Celular/genética , Análise de Célula Única , Células-Tronco Embrionárias Murinas/metabolismo , Células-Tronco Embrionárias Murinas/citologia , RNA-Seq
4.
Proc Natl Acad Sci U S A ; 121(37): e2316256121, 2024 Sep 10.
Artigo em Inglês | MEDLINE | ID: mdl-39226366

RESUMO

Trajectory inference methods are essential for analyzing the developmental paths of cells in single-cell sequencing datasets. It provides insights into cellular differentiation, transitions, and lineage hierarchies, helping unravel the dynamic processes underlying development and disease progression. However, many existing tools lack a coherent statistical model and reliable uncertainty quantification, limiting their utility and robustness. In this paper, we introduce VITAE (Variational Inference for Trajectory by AutoEncoder), a statistical approach that integrates a latent hierarchical mixture model with variational autoencoders to infer trajectories. The statistical hierarchical model enhances the interpretability of our framework, while the posterior approximations generated by our variational autoencoder ensure computational efficiency and provide uncertainty quantification of cell projections along trajectories. Specifically, VITAE enables simultaneous trajectory inference and data integration, improving the accuracy of learning a joint trajectory structure in the presence of biological and technical heterogeneity across datasets. We show that VITAE outperforms other state-of-the-art trajectory inference methods on both real and synthetic data under various trajectory topologies. Furthermore, we apply VITAE to jointly analyze three distinct single-cell RNA sequencing datasets of the mouse neocortex, unveiling comprehensive developmental lineages of projection neurons. VITAE effectively reduces batch effects within and across datasets and uncovers finer structures that might be overlooked in individual datasets. Additionally, we showcase VITAE's efficacy in integrative analyses of multiomic datasets with continuous cell population structures.


Assuntos
Aprendizado Profundo , Genômica , Análise de Célula Única , Análise de Célula Única/métodos , Animais , Camundongos , Genômica/métodos , Análise de Sequência de RNA/métodos , Humanos
5.
Development ; 150(11)2023 06 01.
Artigo em Inglês | MEDLINE | ID: mdl-37260149

RESUMO

Inspired by Waddington's illustration of an epigenetic landscape, cell-fate transitions have been envisioned as bifurcating dynamical systems, wherein exogenous signaling dynamics couple to the enormously complex signaling and transcriptional machinery of a cell to elicit qualitative transitions in its collective state. Single-cell RNA sequencing (scRNA-seq), which measures the distributions of possible transcriptional states in large populations of differentiating cells, provides an alternate view, in which development is marked by the variations of a myriad of genes. Here, we present a mathematical formalism for rigorously evaluating, from a dynamical systems perspective, whether scRNA-seq trajectories display statistical signatures consistent with bifurcations and, as a case study, pinpoint regions of multistability along the neutrophil branch of hematopoeitic differentiation. Additionally, we leverage the geometric features of linear instability to identify the low-dimensional phase plane in gene expression space within which the multistability unfolds, highlighting novel genetic players that are crucial for neutrophil differentiation. Broadly, we show that a dynamical systems treatment of scRNA-seq data provides mechanistic insights into the high-dimensional processes of cellular differentiation, taking a step toward systematic construction of mathematical models for transcriptomic dynamics.


Assuntos
Hematopoese , Transcriptoma , Transcriptoma/genética , Diferenciação Celular/genética , Hematopoese/genética , Perfilação da Expressão Gênica/métodos , Modelos Teóricos , Análise de Célula Única/métodos
6.
Brief Bioinform ; 25(3)2024 Mar 27.
Artigo em Inglês | MEDLINE | ID: mdl-38725155

RESUMO

Single-cell RNA sequencing (scRNA-seq) experiments have become instrumental in developmental and differentiation studies, enabling the profiling of cells at a single or multiple time-points to uncover subtle variations in expression profiles reflecting underlying biological processes. Benchmarking studies have compared many of the computational methods used to reconstruct cellular dynamics; however, researchers still encounter challenges in their analysis due to uncertainty with respect to selecting the most appropriate methods and parameters. Even among universal data processing steps used by trajectory inference methods such as feature selection and dimension reduction, trajectory methods' performances are highly dataset-specific. To address these challenges, we developed Escort, a novel framework for evaluating a dataset's suitability for trajectory inference and quantifying trajectory properties influenced by analysis decisions. Escort evaluates the suitability of trajectory analysis and the combined effects of processing choices using trajectory-specific metrics. Escort navigates single-cell trajectory analysis through these data-driven assessments, reducing uncertainty and much of the decision burden inherent to trajectory inference analyses. Escort is implemented in an accessible R package and R/Shiny application, providing researchers with the necessary tools to make informed decisions during trajectory analysis and enabling new insights into dynamic biological processes at single-cell resolution.


Assuntos
RNA-Seq , Análise da Expressão Gênica de Célula Única , Humanos , Algoritmos , Biologia Computacional/métodos , Perfilação da Expressão Gênica/métodos , RNA-Seq/métodos , Análise da Expressão Gênica de Célula Única/métodos , Software , Animais
7.
J Biol Chem ; 300(7): 107442, 2024 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-38838779

RESUMO

Sebaceous glands (SG) and their oily secretion (sebum) are indispensable for maintaining skin structure and function, and their deregulation causes skin disorders including but not limited to acne. Recent studies also indicate that sebum may have important immunomodulatory activities and may influence whole-body energy metabolism. However, the progressive transcriptional changes of sebocytes that lead to sebum production have never been characterized in detail. Here, we exploited the high cellular resolution provided by sebaceous hyperplasia and integrated spatial transcriptomics, pseudo time analysis, RNA velocity, and functional enrichment to map the landscape of sebaceous differentiation. Our results were validated by comparison with published SG transcriptome data and further corroborated by assessing the protein expression pattern of a subset of the transcripts in the public repository Human Protein Atlas. Departing from four sebocyte differentiation stages generated by unsupervised clustering, we demonstrate consecutive modulation of cellular functions associable with specific gene sets, from cell proliferation and oxidative phosphorylation via lipid synthesis to cell death. Both validation methods confirmed the biological significance of our results. Our report is complemented by a freely available and browsable online tool. Our data provide the first high-resolution spatial portrait of the SG transcriptional landscape and deliver starting points for experimentally assessing novel candidate molecules for regulating SG homeostasis in health and disease.


Assuntos
Diferenciação Celular , Glândulas Sebáceas , Humanos , Glândulas Sebáceas/metabolismo , Glândulas Sebáceas/citologia , Transcriptoma , Sebo/metabolismo , Transcrição Gênica
8.
Bioinformatics ; 2024 Jul 08.
Artigo em Inglês | MEDLINE | ID: mdl-38976653

RESUMO

MOTIVATION: Understanding the dynamics of gene expression across different cellular states is crucial for discerning the mechanisms underneath cellular differentiation. Genes that exhibit variation in mean expression as a function of Pseudotime and between branching trajectories are expected to govern cell fate decisions. We introduce scMaSigPro, a method for the identification of differential gene expression patterns along Pseudotime and branching paths simultaneously. RESULTS: We assessed the performance of scMaSigPro using synthetic and public datasets. Our evaluation shows that scMaSigPro outperforms existing methods in controlling the False Positive Rate and is computationally efficient. AVAILABILITY AND IMPLEMENTATION: scMaSigPro is available as a free R package (version 4.0 or higher) under the GPL(≥2) license on GitHub at 'github.com/BioBam/scMaSigPro' and archived with version 0.03 on Zenodo at 'zenodo.org/records/12568922'.

9.
Dev Biol ; 502: 39-49, 2023 10.
Artigo em Inglês | MEDLINE | ID: mdl-37437860

RESUMO

As the source of embryonic stem cells (ESCs), inner cell mass (ICM) can form all tissues of the embryo proper, however, its role in early human lineage specification remains controversial. Although a stepwise differentiation model has been proposed suggesting the existence of ICM as a distinct developmental stage, the underlying molecular mechanism remains unclear. In the present study, we perform an integrated analysis on the public human preimplantation embryonic single-cell transcriptomic data and apply a trajectory inference algorithm to measure the cell plasticity. In our results, ICM population can be clearly discriminated on the dimension-reduced graph and confirmed by compelling evidences, thus validating the two-step hypothesis of lineage commitment. According to the branch probabilities and differentiation potential, we determine the precise time points for two lineage segregations. Further analysis on gene expression dynamics and regulatory network indicates that transcription factors including GSC, PRDM1, and SPIC may underlie the decisions of ICM fate. In addition, new human ICM marker genes, such as EPHA4 and CCR8 are discovered and validated by immunofluorescence. Given the potential clinical applications of ESCs, our analysis provides a further understanding of human ICM cells and facilitates the exploration of more unique characteristics in early human development.


Assuntos
Blastocisto , Transcriptoma , Humanos , Transcriptoma/genética , Linhagem da Célula/genética , Blastocisto/metabolismo , Embrião de Mamíferos , Diferenciação Celular/genética , Regulação da Expressão Gênica no Desenvolvimento
10.
Apoptosis ; 29(3-4): 460-481, 2024 Apr.
Artigo em Inglês | MEDLINE | ID: mdl-38017206

RESUMO

Previous research has demonstrated that the conversion of hepatocellular carcinoma (HCC) to intrahepatic cholangiocarcinoma (iCCA) can be stimulated by manipulating the tumor microenvironment linked with necroptosis. However, the specific cells regulating the necroptosis microenvironment have not yet been identified. Additionally, further inquiry into the mechanism of how the tumor microenvironment regulates necroptosis and its impact on primary liver cancer(PLC) progression may be beneficial for precision therapy. We recruited a single-cell RNA sequencing dataset (scRNA-seq) with 34 samples from 4 HCC patients and 3 iCCA patients, and a Spatial Transcriptomic (ST) dataset including one each of HCC, iCCA, and combined hepatocellular-cholangiocarcinoma (cHCC-CCA). Quality control, dimensionality reduction and clustering were based on Seurat software (v4.2.2) process and batch effects were removed by harmony (v0.1.1) software. The pseudotime analysis (also known as cell trajectory) in the single cell dataset was performed by monocle2 software (v2.24.0). Calculation of necroptosis fraction was performed by AUCell (v1.16.0) software. Switch gene analysis was performed by geneSwitches(v0.1.0) software. Dimensionality reduction, clustering, and spatial image in ST dataset were performed by Seurat (v4.0.2). Tumor cell identification, tumor subtype characterization, and cell type deconvolution in spot were performed by SpaCET (v1.0.0) software. Immunofluorescence and immunohistochemistry experiments were used to prove our conclusions. Analysis of intercellular communication was performed using CellChat software (v1.4.0). ScRNA-seq analysis of HCC and iCCA revealed that necroptosis predominantly occurred in the myeloid cell subset, particularly in FCGBP + SPP1 + tumor-associated macrophages (TAMs), which had the highest likelihood of undergoing necroptosis. The existence of macrophages undergoing necroptosis cell death was further confirmed by immunofluorescence. Regions of HCC with poor differentiation, cHCC-CCA with more cholangiocarcinoma features, and the tumor region of iCCA shared spatial colocalization with FCGBP + macrophages, as confirmed by spatial transcriptomics, immunohistochemistry and immunofluorescence. Pseudotime analysis showed that premalignant cells could progress into two directions, one towards HCC and the other towards iCCA and cHCC-CCA. Immunofluorescence and immunohistochemistry experiments demonstrated that the number of macrophages undergoing necroptosis in cHCC-CCA was higher than in iCCA and HCC, the number of macrophages undergoing necroptosis in cHCC-CCA with cholangiocarcinoma features was more than in cHCC-CCA with hepatocellular carcinoma features. Further investigation showed that myeloid cells with the highest necroptosis score were derived from the HCC_4 case, which had a severe inflammatory background on pathological histology and was likely to progress towards iCCA and cHCC-CCA. Switchgene analysis indicated that S100A6 may play a significant role in the progression of premalignant cells towards iCCA and cHCC-CCA. Immunohistochemistry confirmed the expression of S100A6 in PLC, the more severe inflammatory background of the tumor area, the more cholangiocellular carcinoma features of the tumor area, S100A6 expression was higher. The emergence of necroptosis microenvironment was found to be significantly associated with FCGBP + SPP1 + TAMs in PLC. In the presence of necroptosis microenvironment, premalignant cells appeared to transform into iCCA or cHCC-CCA. In contrast, without a necroptosis microenvironment, premalignant cells tended to develop into HCC, exhibiting amplified stemness-related genes (SRGs) and heightened malignancy.


Assuntos
Neoplasias dos Ductos Biliares , Carcinoma Hepatocelular , Colangiocarcinoma , Neoplasias Hepáticas , Humanos , Carcinoma Hepatocelular/genética , Carcinoma Hepatocelular/patologia , Neoplasias Hepáticas/genética , Neoplasias Hepáticas/patologia , Macrófagos Associados a Tumor/patologia , Necroptose , Apoptose , Colangiocarcinoma/genética , Colangiocarcinoma/patologia , Ductos Biliares Intra-Hepáticos/patologia , Neoplasias dos Ductos Biliares/genética , Neoplasias dos Ductos Biliares/patologia , Estudos Retrospectivos , Microambiente Tumoral/genética , Moléculas de Adesão Celular
11.
Mamm Genome ; 35(2): 296-307, 2024 Jun.
Artigo em Inglês | MEDLINE | ID: mdl-38600211

RESUMO

Varicella-zoster virus (VZV), a common pathogen with humans as the sole host, causes primary infection and undergoes a latent period in sensory ganglia. The recurrence of VZV is often accompanied by severe neuralgia in skin tissue, which has a serious impact on the life of patients. During the acute infection of VZV, there are few related studies on the pathophysiological mechanism of skin tissue. In this study, transcriptome sequencing data from the acute response period within 2 days of VZV antigen stimulation of the skin were used to explore a model of the trajectory of skin tissue changes during VZV infection. It was found that early VZV antigen stimulation caused activation of mainly natural immune-related signaling pathways, while in the late phase activation of mainly active immune-related signaling pathways. JAK-STAT, NFκB, and TNFα signaling pathways are gradually activated with the progression of infection, while Hypoxia is progressively inhibited. In addition, we found that dendritic cell-mediated immune responses play a dominant role in the lesion damage caused by VZV antigen stimulation of the skin. This study provides a theoretical basis for the study of the molecular mechanisms of skin lesions during acute VZV infection.


Assuntos
Herpesvirus Humano 3 , Transdução de Sinais , Pele , Infecção pelo Vírus da Varicela-Zoster , Herpesvirus Humano 3/genética , Pele/patologia , Pele/virologia , Pele/imunologia , Animais , Infecção pelo Vírus da Varicela-Zoster/virologia , Infecção pelo Vírus da Varicela-Zoster/imunologia , Infecção pelo Vírus da Varicela-Zoster/genética , Infecção pelo Vírus da Varicela-Zoster/patologia , Humanos , Camundongos , Células Dendríticas/imunologia , Herpes Zoster/virologia , Herpes Zoster/patologia , Herpes Zoster/genética , Herpes Zoster/imunologia , Transcriptoma , Modelos Animais de Doenças , Antígenos Virais/imunologia , Antígenos Virais/genética , NF-kappa B/metabolismo , NF-kappa B/genética
12.
Beijing Da Xue Xue Bao Yi Xue Ban ; 56(2): 199-206, 2024 Apr 18.
Artigo em Zh | MEDLINE | ID: mdl-38595234

RESUMO

OBJECTIVE: To delve deeply into the dynamic trajectories of cell subpopulations and the communication network among immune cell subgroups during the malignant progression of glioblastoma (GBM), and to endeavor to unearth key risk biomarkers in the GBM malignancy progression, so as to provide a more profound understanding for the treatment and prognosis of this disease by integrating transcriptomic data and clinical information of the GBM patients. METHODS: Utilizing single-cell sequencing data analysis, we constructed a cell subgroup atlas during the malignant progression of GBM. The Monocle2 tool was employed to build dynamic progression trajectories of the tumor cell subgroups in GBM. Through gene enrichment analysis, we explored the biological processes enriched in genes that significantly changed with the malignancy progression of GBM tumor cell subpopulations. CellChat was used to identify the communication network between the different immune cell subgroups. Survival analysis helped in identifying risk molecular markers that impacted the patient prognosis during the malignant progression of GBM. This method ological approach offered a comprehensive and detailed examination of the cellular and molecular dynamics within GBM, providing a robust framework for understanding the disease' s progression and potential therapeutic targets. RESULTS: The analysis of single-cell sequencing data identified 6 different cell types, including lymphocytes, pericytes, oligodendrocytes, macrophages, glioma cells, and microglia. The 27 151 cells in the single-cell dataset included 3 881 cells from the patients with low-grade glioma (LGG), 10 166 cells from the patients with newly diagnosed GBM, and 13 104 cells from the patients with recurrent glioma (rGBM). The pseudo-time analysis of the glioma cell subgroups indicated significant cellular heterogeneity during malignant progression. The cell interaction analysis of immune cell subgroups revealed the communication network among the different immune subgroups in GBM malignancy, identifying 22 biologically significant ligand-receptor pairs across 12 key biological pathways. Survival analysis had identified 8 genes related to the prognosis of the GBM patients, among which SERPINE1, COL6A1, SPP1, LTF, C1S, AEBP1, and SAA1L were high-risk genes in the GBM patients, and ABCC8 was low-risk genes in the GBM patients. These findings not only provided new theoretical bases for the treatment of GBM, but also offered fresh insights for the prognosis assessment and treatment decision-making for the GBM patients. CONCLUSION: This research comprehensively and profoundly reveals the dynamic changes in glioma cell subpopulations and the communication patterns among the immune cell subgroups during the malignant progression of GBM. These findings are of significant importance for understanding the complex biological processes of GBM, providing crucial new insights for precision medicine and treatment decisions in GBM. Through these studies, we hope to provide more effective treatment options and more accurate prognostic assessments for the patients with GBM.


Assuntos
Neoplasias Encefálicas , Glioblastoma , Glioma , Humanos , Glioblastoma/genética , Glioblastoma/metabolismo , Glioblastoma/patologia , Neoplasias Encefálicas/genética , Recidiva Local de Neoplasia , Prognóstico , Comunicação Celular , Carboxipeptidases , Proteínas Repressoras
13.
BMC Bioinformatics ; 24(1): 55, 2023 Feb 20.
Artigo em Inglês | MEDLINE | ID: mdl-36803767

RESUMO

BACKGROUND: The advance in single-cell RNA sequencing technology has enhanced the analysis of cell development by profiling heterogeneous cells in individual cell resolution. In recent years, many trajectory inference methods have been developed. They have focused on using the graph method to infer the trajectory using single-cell data, and then calculate the geodesic distance as the pseudotime. However, these methods are vulnerable to errors caused by the inferred trajectory. Therefore, the calculated pseudotime suffers from such errors. RESULTS: We proposed a novel framework for trajectory inference called the single-cell data Trajectory inference method using Ensemble Pseudotime inference (scTEP). scTEP utilizes multiple clustering results to infer robust pseudotime and then uses the pseudotime to fine-tune the learned trajectory. We evaluated the scTEP using 41 real scRNA-seq data sets, all of which had the ground truth development trajectory. We compared the scTEP with state-of-the-art methods using the aforementioned data sets. Experiments on real linear and non-linear data sets demonstrate that our scTEP performed superior on more data sets than any other method. The scTEP also achieved a higher average and lower variance on most metrics than other state-of-the-art methods. In terms of trajectory inference capacity, the scTEP outperforms those methods. In addition, the scTEP is more robust to the unavoidable errors resulting from clustering and dimension reduction. CONCLUSION: The scTEP demonstrates that utilizing multiple clustering results for the pseudotime inference procedure enhances its robustness. Furthermore, robust pseudotime strengthens the accuracy of trajectory inference, which is the most crucial component in the pipeline. scTEP is available at https://cran.r-project.org/package=scTEP .


Assuntos
Perfilação da Expressão Gênica , Software , Perfilação da Expressão Gênica/métodos , Análise de Célula Única/métodos , Análise por Conglomerados , Benchmarking , Análise de Sequência de RNA/métodos
14.
EMBO J ; 38(1)2019 01 03.
Artigo em Inglês | MEDLINE | ID: mdl-30257965

RESUMO

An intricate link is becoming apparent between metabolism and cellular identities. Here, we explore the basis for such a link in an in vitro model for early mouse embryonic development: from naïve pluripotency to the specification of primordial germ cells (PGCs). Using single-cell RNA-seq with statistical modelling and modulation of energy metabolism, we demonstrate a functional role for oxidative mitochondrial metabolism in naïve pluripotency. We link mitochondrial tricarboxylic acid cycle activity to IDH2-mediated production of alpha-ketoglutarate and through it, the activity of key epigenetic regulators. Accordingly, this metabolite has a role in the maintenance of naïve pluripotency as well as in PGC differentiation, likely through preserving a particular histone methylation status underlying the transient state of developmental competence for the PGC fate. We reveal a link between energy metabolism and epigenetic control of cell state transitions during a developmental trajectory towards germ cell specification, and establish a paradigm for stabilizing fleeting cellular states through metabolic modulation.


Assuntos
Diferenciação Celular/efeitos dos fármacos , Células-Tronco Embrionárias/efeitos dos fármacos , Células Germinativas/efeitos dos fármacos , Ácidos Cetoglutáricos/farmacologia , Células-Tronco Pluripotentes/efeitos dos fármacos , Animais , Diferenciação Celular/genética , Células Cultivadas , Embrião de Mamíferos , Células-Tronco Embrionárias/fisiologia , Epigênese Genética/efeitos dos fármacos , Epigênese Genética/genética , Feminino , Regulação da Expressão Gênica no Desenvolvimento/efeitos dos fármacos , Células Germinativas/fisiologia , Ácidos Cetoglutáricos/metabolismo , Masculino , Redes e Vias Metabólicas/efeitos dos fármacos , Redes e Vias Metabólicas/genética , Camundongos , Camundongos Endogâmicos C57BL , Camundongos Transgênicos , Células-Tronco Pluripotentes/fisiologia
15.
Biostatistics ; 2022 Dec 13.
Artigo em Inglês | MEDLINE | ID: mdl-36511385

RESUMO

In the analysis of single-cell RNA sequencing data, researchers often characterize the variation between cells by estimating a latent variable, such as cell type or pseudotime, representing some aspect of the cell's state. They then test each gene for association with the estimated latent variable. If the same data are used for both of these steps, then standard methods for computing p-values in the second step will fail to achieve statistical guarantees such as Type 1 error control. Furthermore, approaches such as sample splitting that can be applied to solve similar problems in other settings are not applicable in this context. In this article, we introduce count splitting, a flexible framework that allows us to carry out valid inference in this setting, for virtually any latent variable estimation technique and inference approach, under a Poisson assumption. We demonstrate the Type 1 error control and power of count splitting in a simulation study and apply count splitting to a data set of pluripotent stem cells differentiating to cardiomyocytes.

16.
Brief Bioinform ; 22(5)2021 09 02.
Artigo em Inglês | MEDLINE | ID: mdl-34370020

RESUMO

Recent advances in bioinformatics analyses have led to the development of novel tools enabling the capture and trajectory mapping of single-cell RNA sequencing (scRNAseq) data. However, there is a lack of methods to assess the contributions of biological pathways and transcription factors to an overall developmental trajectory mapped from scRNAseq data. In this manuscript, we present a simplified approach for trajectory inference of pathway significance (TIPS) that leverages existing knowledgebases of functional pathways and other gene lists to provide further mechanistic insights into a biological process. TIPS identifies key pathways which contribute to a process of interest, as well as the individual genes that best reflect these changes. TIPS also provides insight into the relative timing of pathway changes, as well as a suite of visualizations to enable simplified data interpretation of scRNAseq libraries generated using a wide range of techniques. The TIPS package can be run through either a web server or downloaded as a user-friendly GUI run in R, and may serve as a useful tool to help biologists perform deeper functional analyses and visualization of their single-cell data.


Assuntos
Biologia Computacional/métodos , Perfilação da Expressão Gênica/métodos , RNA-Seq/métodos , Transdução de Sinais/genética , Análise de Célula Única/métodos , Linfócitos T CD8-Positivos/metabolismo , Células Cultivadas , Humanos , Internet , Reprodutibilidade dos Testes , Fatores de Tempo
17.
Biometrics ; 79(4): 3191-3202, 2023 12.
Artigo em Inglês | MEDLINE | ID: mdl-36807295

RESUMO

Bayesian networks have been widely used to generate causal hypotheses from multivariate data. Despite their popularity, the vast majority of existing causal discovery approaches make the strong assumption of a (partially) homogeneous sampling scheme. However, such assumption can be seriously violated, causing significant biases when the underlying population is inherently heterogeneous. To this end, we propose a novel causal Bayesian network model, termed BN-LTE, that embeds heterogeneous samples onto a low-dimensional manifold and builds Bayesian networks conditional on the embedding. This new framework allows for more precise network inference by improving the estimation resolution from the population level to the observation level. Moreover, while causal Bayesian networks are in general not identifiable with purely observational, cross-sectional data due to Markov equivalence, with the blessing of causal effect heterogeneity, we prove that the proposed BN-LTE is uniquely identifiable under relatively mild assumptions. Through extensive experiments, we demonstrate the superior performance of BN-LTE in causal structure learning as well as inferring observation-specific gene regulatory networks from observational data.


Assuntos
Algoritmos , Redes Reguladoras de Genes , Humanos , Teorema de Bayes , Estudos Transversais , Causalidade
18.
Genomics ; 114(2): 110316, 2022 03.
Artigo em Inglês | MEDLINE | ID: mdl-35202721

RESUMO

The problem of human hair loss has caused widespread concern, however, such research is difficult because the periodicity is not obvious and the deeper levels knowledge of dermal papilla (DP) stem cells' differentiation are limited. Here, cashmere goats which have obvious periodicity of hair follicles were used, based on unbiased scRNA sequencing, we constructed DP cell lineage differentiation trajectory and revealed the key genes, signals and functions involved in cell fate decisions. And then we revealed the molecular landscape of hair follicle on regeneration. Revealed that DP cells differentiate into four intermediate cell states at different periodicity: Intermediate-cell-10 showed important functions in the growth and maintenance of cashmere; intermediate-cell-1 acting on apoptosis and cashmere shedding; intermediate-cell-0 initiated new follicular cycles, the migration of hair follicles and the occurrence of cashmere; and intermediate-cell-15 are suggested to be DP progenitor cells. In general, we provide new insights for hair regrowth. At the same time, it provides a new research ideas, directions and molecular landscape for the mechanism of dermal papilla cells.


Assuntos
Cabras , Folículo Piloso , Animais , Diferenciação Celular/genética , Cabras/genética , Cabras/metabolismo , Cabelo , Regeneração/genética
19.
Glia ; 70(10): 1938-1949, 2022 10.
Artigo em Inglês | MEDLINE | ID: mdl-35735919

RESUMO

Morphological and emerging molecular studies have provided evidence for heterogeneity within the oligodendrocyte population. To address the regional and age-related heterogeneity of human mature oligodendrocytes (MOLs) we applied single-cell RNA sequencing to cells isolated from cortical/subcortical, subventricular zone brain tissue samples, and thoracolumbar spinal cord samples. Unsupervised clustering of cells identified transcriptionally distinct MOL subpopulations across regions. Spinal cord MOLs, but not microglia, exhibited cell-type-specific upregulation of immune-related markers compared to the other adult regions. SVZ MOLs showed an upregulation of select number of development-linked transcription factors compared to other regions; however, pseudotime trajectory analyses did not identify a global developmental difference. Age-related analysis of cortical/subcortical samples indicated that pediatric MOLs, especially from under age 5, retain higher expression of genes linked to development and to immune activity with pseudotime analysis favoring a distinct developmental stage. Our regional and age-related studies indicate heterogeneity of MOL populations in the human CNS that may reflect developmental and environmental influences.


Assuntos
Oligodendroglia , Medula Espinal , Encéfalo , Criança , Pré-Escolar , Humanos , Microglia , Oligodendroglia/metabolismo
20.
BMC Genomics ; 23(1): 56, 2022 Jan 15.
Artigo em Inglês | MEDLINE | ID: mdl-35033004

RESUMO

BACKGROUND: Pseudotime estimation from dynamic single-cell transcriptomic data enables characterisation and understanding of the underlying processes, for example developmental processes. Various pseudotime estimation methods have been proposed during the last years. Typically, these methods start with a dimension reduction step because the low-dimensional representation is usually easier to analyse. Approaches such as PCA, ICA or t-SNE belong to the most widely used methods for dimension reduction in pseudotime estimation methods. However, these methods usually make assumptions on the derived dimensions, which can result in important dataset properties being missed. In this paper, we suggest a new dictionary learning based approach, dynDLT, for dimension reduction and pseudotime estimation of dynamic transcriptomic data. Dictionary learning is a matrix factorisation approach that does not restrict the dependence of the derived dimensions. To evaluate the performance, we conduct a large simulation study and analyse 8 real-world datasets. RESULTS: The simulation studies reveal that firstly, dynDLT preserves the simulated patterns in low-dimension and the pseudotimes can be derived from the low-dimensional representation. Secondly, the results show that dynDLT is suitable for the detection of genes exhibiting the simulated dynamic patterns, thereby facilitating the interpretation of the compressed representation and thus the dynamic processes. For the real-world data analysis, we select datasets with samples that are taken at different time points throughout an experiment. The pseudotimes found by dynDLT have high correlations with the experimental times. We compare the results to other approaches used in pseudotime estimation, or those that are method-wise closely connected to dictionary learning: ICA, NMF, PCA, t-SNE, and UMAP. DynDLT has the best overall performance for the simulated and real-world datasets. CONCLUSIONS: We introduce dynDLT, a method that is suitable for pseudotime estimation. Its main advantages are: (1) It presents a model-free approach, meaning that it does not restrict the dependence of the derived dimensions; (2) Genes that are relevant in the detected dynamic processes can be identified from the dictionary matrix; (3) By a restriction of the dictionary entries to positive values, the dictionary atoms are highly interpretable.


Assuntos
Algoritmos , Transcriptoma , Simulação por Computador
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA