RESUMEN
Despite a high response rate in chimeric antigen receptor (CAR) T cell therapy for acute lymphocytic leukaemia (ALL)1-3, approximately 50% of patients relapse within the first year4-6, representing an urgent question to address in the next stage of cellular immunotherapy. Here, to investigate the molecular determinants of ultralong CAR T cell persistence, we obtained a single-cell multi-omics atlas from 695,819 pre-infusion CAR T cells at the basal level or after CAR-specific stimulation from 82 paediatric patients with ALL enrolled in the first two CAR T ALL clinical trials and 6 healthy donors. We identified that elevated type 2 functionality in CAR T infusion products is significantly associated with patients maintaining a median B cell aplasia duration of 8.4 years. Analysis of ligand-receptor interactions revealed that type 2 cells regulate a dysfunctional subset to maintain whole-population homeostasis, and the addition of IL-4 during antigen-specific activation alleviates CAR T cell dysfunction while enhancing fitness at both transcriptomic and epigenomic levels. Serial proteomic profiling of sera after treatment revealed a higher level of circulating type 2 cytokines in 5-year or 8-year relapse-free responders. In a leukaemic mouse model, type 2high CAR T cell products demonstrated superior expansion and antitumour activity, particularly after leukaemia rechallenge. Restoring antitumour efficacy in type 2low CAR T cells was attainable by enhancing their type 2 functionality, either through incorporating IL-4 into the manufacturing process or by priming manufactured CAR T products with IL-4 before infusion. Our findings provide insights into the mediators of durable CAR T therapy response and suggest potential therapeutic strategies to sustain long-term remission by boosting type 2 functionality in CAR T cells.
Asunto(s)
Inmunoterapia Adoptiva , Leucemia-Linfoma Linfoblástico de Células Precursoras , Receptores Quiméricos de Antígenos , Inducción de Remisión , Análisis de la Célula Individual , Humanos , Leucemia-Linfoma Linfoblástico de Células Precursoras/terapia , Leucemia-Linfoma Linfoblástico de Células Precursoras/inmunología , Ratones , Animales , Receptores Quiméricos de Antígenos/inmunología , Receptores Quiméricos de Antígenos/metabolismo , Niño , Interleucina-4/metabolismo , Linfocitos T/inmunología , Femenino , Masculino , Citocinas/metabolismo , Proteómica , Modelos Animales de Enfermedad , Factores de Tiempo , RecurrenciaRESUMEN
The Human Developmental Cell Atlas (HDCA) initiative, which is part of the Human Cell Atlas, aims to create a comprehensive reference map of cells during development. This will be critical to understanding normal organogenesis, the effect of mutations, environmental factors and infectious agents on human development, congenital and childhood disorders, and the cellular basis of ageing, cancer and regenerative medicine. Here we outline the HDCA initiative and the challenges of mapping and modelling human development using state-of-the-art technologies to create a reference atlas across gestation. Similar to the Human Genome Project, the HDCA will integrate the output from a growing community of scientists who are mapping human development into a unified atlas. We describe the early milestones that have been achieved and the use of human stem-cell-derived cultures, organoids and animal models to inform the HDCA, especially for prenatal tissues that are hard to acquire. Finally, we provide a roadmap towards a complete atlas of human development.
Asunto(s)
Movimiento Celular , Rastreo Celular , Células/citología , Biología Evolutiva/métodos , Embrión de Mamíferos/citología , Feto/citología , Difusión de la Información , Organogénesis , Adulto , Animales , Atlas como Asunto , Técnicas de Cultivo de Célula , Supervivencia Celular , Visualización de Datos , Femenino , Humanos , Imagenología Tridimensional , Masculino , Modelos Animales , Organogénesis/genética , Organoides/citología , Células Madre/citologíaRESUMEN
Single-cell RNA sequencing offers snapshots of whole transcriptomes but obscures the temporal RNA dynamics. Here we present single-cell metabolically labeled new RNA tagging sequencing (scNT-seq), a method for massively parallel analysis of newly transcribed and pre-existing mRNAs from the same cell. This droplet microfluidics-based method enables high-throughput chemical conversion on barcoded beads, efficiently marking newly transcribed mRNAs with T-to-C substitutions. Using scNT-seq, we jointly profiled new and old transcriptomes in ~55,000 single cells. These data revealed time-resolved transcription factor activities and cell-state trajectories at the single-cell level in response to neuronal activation. We further determined rates of RNA biogenesis and decay to uncover RNA regulatory strategies during stepwise conversion between pluripotent and rare totipotent two-cell embryo (2C)-like stem cell states. Finally, integrating scNT-seq with genetic perturbation identifies DNA methylcytosine dioxygenase as an epigenetic barrier into the 2C-like cell state. Time-resolved single-cell transcriptomic analysis thus opens new lines of inquiry regarding cell-type-specific RNA regulatory mechanisms.
Asunto(s)
Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Análisis de Secuencia de ARN/métodos , Animales , Línea Celular , Embrión de Mamíferos , Fibroblastos/efectos de los fármacos , Fibroblastos/metabolismo , Regulación de la Expresión Génica , Humanos , Ratones , Neuronas/metabolismo , Análisis de Componente Principal , ARN Mensajero , Análisis de la Célula Individual , Factores de TiempoRESUMEN
The prevailing paradigm for the analysis of biological data involves comparing groups of replicates from different conditions (e.g. control and treatment) to statistically infer features that discriminate them (e.g. differentially expressed genes). However, many situations in modern genomics such as single-cell omics experiments do not fit well into this paradigm because they lack true replicates. In such instances, spectral techniques could be used to rank features according to their degree of consistency with an underlying metric structure without the need to cluster samples. Here, we extend spectral methods for feature selection to abstract simplicial complexes and present a general framework for clustering-independent analysis. Combinatorial Laplacian scores take into account the topology spanned by the data and reduce to the ordinary Laplacian score when restricted to graphs. We demonstrate the utility of this framework with several applications to the analysis of gene expression and multi-modal genomic data. Specifically, we perform differential expression analysis in situations where samples cannot be grouped into distinct classes, and we disaggregate differentially expressed genes according to the topology of the expression space (e.g. alternative paths of differentiation). We also apply this formalism to identify genes with spatial patterns of expression using fluorescence in-situ hybridization data and to establish associations between genetic alterations and global expression patterns in large cross-sectional studies. Our results provide a unifying perspective on topological data analysis and manifold learning approaches to the analysis of large-scale biological datasets.
Asunto(s)
Biología Computacional/métodos , Análisis de Secuencia por Matrices de Oligonucleótidos/métodos , Algoritmos , Análisis por Conglomerados , Perfilación de la Expresión Génica/métodos , Genómica/métodos , Modelos TeóricosRESUMEN
Amidst the growing literature on cancer genomics and intratumor heterogeneity, essential principles in evolutionary biology recur time and time again. Here we use these principles to guide the reader through major advances in cancer research, highlighting issues of "hit hard, hit early" treatment strategies, drug resistance, and metastasis. We distinguish between two frameworks for understanding heterogeneous tumors, both of which can inform treatment strategies: (1) The tumor as diverse ecosystem, a Darwinian population of sometimes-competing, sometimes-cooperating cells; (2) The tumor as tightly integrated, self-regulating organ, which may hijack developmental signals to restore functional heterogeneity after treatment. While the first framework dominates literature on cancer evolution, the second framework enjoys support as well. Throughout this review, we illustrate how mathematical models inform understanding of tumor progression and treatment outcomes. Connecting models to genomic data faces computational and technical hurdles, but high-throughput single-cell technologies show promise to clear these hurdles. This article is part of a Special Issue entitled: Evolutionary principles - heterogeneity in cancer?, edited by Dr. Robert A. Gatenby.
Asunto(s)
Biomarcadores de Tumor/genética , Transformación Celular Neoplásica/genética , Evolución Molecular , Aptitud Genética , Neoplasias/genética , Microambiente Tumoral , Adaptación Fisiológica , Animales , Antineoplásicos/uso terapéutico , Biomarcadores de Tumor/metabolismo , Transformación Celular Neoplásica/metabolismo , Transformación Celular Neoplásica/patología , Resistencia a Antineoplásicos/genética , Regulación Neoplásica de la Expresión Génica , Predisposición Genética a la Enfermedad , Herencia , Humanos , Modelos Genéticos , Mutación , Neoplasias/tratamiento farmacológico , Neoplasias/metabolismo , Neoplasias/patología , Linaje , Fenotipo , Transducción de Señal/genética , Factores de TiempoRESUMEN
The recent explosion of genomic data has underscored the need for interpretable and comprehensive analyses that can capture complex phylogenetic relationships within and across species. Recombination, reassortment and horizontal gene transfer constitute examples of pervasive biological phenomena that cannot be captured by tree-like representations. Starting from hundreds of genomes, we are interested in the reconstruction of potential evolutionary histories leading to the observed data. Ancestral recombination graphs represent potential histories that explicitly accommodate recombination and mutation events across orthologous genomes. However, they are computationally costly to reconstruct, usually being infeasible for more than few tens of genomes. Recently, Topological Data Analysis (TDA) methods have been proposed as robust and scalable methods that can capture the genetic scale and frequency of recombination. We build upon previous TDA developments for detecting and quantifying recombination, and present a novel framework that can be applied to hundreds of genomes and can be interpreted in terms of minimal histories of mutation and recombination events, quantifying the scales and identifying the genomic locations of recombinations. We implement this framework in a software package, called TARGet, and apply it to several examples, including small migration between different populations, human recombination, and horizontal evolution in finches inhabiting the Galápagos Islands.
Asunto(s)
Biología Computacional/métodos , Evolución Molecular , Recombinación Genética/genética , Programas Informáticos , Animales , Pinzones/genética , Transferencia de Gen Horizontal , Humanos , Modelos Genéticos , FilogeniaRESUMEN
Single-cell RNA-sequencing has transformed the study of biological tissues by enabling transcriptomic characterizations of their constituent cell states. Computational methods for gene expression deconvolution use this information to infer the cell composition of related tissues profiled at the bulk level. However, current deconvolution methods are restricted to discrete cell types and have limited power to make inferences about continuous cellular processes like cell differentiation or immune cell activation. We present ConDecon, a clustering-independent method for inferring the likelihood for each cell in a single-cell dataset to be present in a bulk tissue. ConDecon represents an improvement in functionality and accuracy with respect to current deconvolution methods. Using ConDecon, we discover the implication of neurodegenerative microglial inflammatory pathways in the mesenchymal transformation of ependymoma, recapitulate spatial patterns of cell differentiation during zebrafish embryogenesis, and make temporal inferences from bulk ATAC-seq data. Overall, ConDecon significantly enhances our understanding of dynamic cellular processes within bulk tissue samples.
RESUMEN
High-resolution imaging has revolutionized the study of single cells in their spatial context. However, summarizing the great diversity of complex cell shapes found in tissues and inferring associations with other single-cell data remains a challenge. Here, we present CAJAL, a general computational framework for the analysis and integration of single-cell morphological data. By building upon metric geometry, CAJAL infers cell morphology latent spaces where distances between points indicate the amount of physical deformation required to change the morphology of one cell into that of another. We show that cell morphology spaces facilitate the integration of single-cell morphological data across technologies and the inference of relations with other data, such as single-cell transcriptomic data. We demonstrate the utility of CAJAL with several morphological datasets of neurons and glia and identify genes associated with neuronal plasticity in C. elegans. Our approach provides an effective strategy for integrating cell morphology data into single-cell omics analyses.
Asunto(s)
Caenorhabditis elegans , Neuronas , Animales , Caenorhabditis elegans/genética , Perfilación de la Expresión Génica , TranscriptomaRESUMEN
The thalamus is the principal information hub of the vertebrate brain, with essential roles in sensory and motor information processing, attention, and memory. The complex array of thalamic nuclei develops from a restricted pool of neural progenitors. We apply longitudinal single-cell RNA sequencing and regional abrogation of Sonic hedgehog (Shh) to map the developmental trajectories of thalamic progenitors, intermediate progenitors, and post-mitotic neurons as they coalesce into distinct thalamic nuclei. These data reveal that the complex architecture of the thalamus is established early during embryonic brain development through the coordinated action of four cell differentiation lineages derived from Shh-dependent and -independent progenitors. We systematically characterize the gene expression programs that define these thalamic lineages across time and demonstrate how their disruption upon Shh depletion causes pronounced locomotor impairment resembling infantile Parkinson's disease. These results reveal key principles of thalamic development and provide mechanistic insights into neurodevelopmental disorders resulting from thalamic dysfunction.
Asunto(s)
Tálamo , Tálamo/citologíaRESUMEN
Pediatric ependymoma is a devastating brain cancer marked by its relapsing pattern and lack of effective chemotherapies. This shortage of treatments is due to limited knowledge about ependymoma tumorigenic mechanisms. By means of single-nucleus chromatin accessibility and gene expression profiling of posterior fossa primary tumors and distal metastases, we reveal key transcription factors and enhancers associated with the differentiation of ependymoma tumor cells into tumor-derived cell lineages and their transition into a mesenchymal-like state. We identify NFκB, AP-1, and MYC as mediators of this transition, and show that the gene expression profiles of tumor cells and infiltrating microglia are consistent with abundant pro-inflammatory signaling between these populations. In line with these results, both TGF-ß1 and TNF-α induce the expression of mesenchymal genes on a patient-derived cell model, and TGF-ß1 leads to an invasive phenotype. Altogether, these data suggest that tumor gliosis induced by inflammatory cytokines and oxidative stress underlies the mesenchymal phenotype of posterior fossa ependymoma.
Asunto(s)
Ependimoma , Factor de Crecimiento Transformador beta1 , Citocinas/genética , Citocinas/farmacología , Ependimoma/genética , Transición Epitelial-Mesenquimal/genética , Humanos , Recurrencia Local de Neoplasia , Factor de Crecimiento Transformador beta1/metabolismoRESUMEN
A notable number of acute lymphoblastic leukemia (ALL) patients develop CD19-positive relapse within 1 year after receiving chimeric antigen receptor (CAR) T cell therapy. It remains unclear if the long-term response is associated with the characteristics of CAR T cells in infusion products, hindering the identification of biomarkers to predict therapeutic outcomes. Here, we present 101,326 single-cell transcriptomes and surface protein landscape from the infusion products of 12 ALL patients. We observed substantial heterogeneity in the antigen-specific activation states, among which a deficiency of T helper 2 function was associated with CD19-positive relapse compared with durable responders (remission, >54 months). Proteomic data revealed that the frequency of early memory T cells, rather than activation or coinhibitory signatures, could distinguish the relapse. These findings were corroborated by independent functional profiling of 49 patients, and an integrative model was developed to predict the response. Our data unveil the molecular mechanisms that may inform strategies to boost specific T cell function to maintain long-term remission.
Asunto(s)
Inmunoterapia Adoptiva , Leucemia-Linfoma Linfoblástico de Células Precursoras , Antígenos CD19 , Humanos , Leucemia-Linfoma Linfoblástico de Células Precursoras/terapia , Proteómica , Receptores Quiméricos de Antígenos/metabolismo , RecurrenciaRESUMEN
Highly multiplexed immunohistochemistry (mIHC) enables the staining and quantification of dozens of antigens in a tissue section with single-cell resolution. However, annotating cell populations that differ little in the profiled antigens or for which the antibody panel does not include specific markers is challenging. To overcome this obstacle, we have developed an approach for enriching mIHC images with single-cell RNA sequencing data, building upon recent experimental procedures for augmenting single-cell transcriptomes with concurrent antigen measurements. Spatially-resolved Transcriptomics via Epitope Anchoring (STvEA) performs transcriptome-guided annotation of highly multiplexed cytometry datasets. It increases the level of detail in histological analyses by enabling the systematic annotation of nuanced cell populations, spatial patterns of transcription, and interactions between cell types. We demonstrate the utility of STvEA by uncovering the architecture of poorly characterized cell types in the murine spleen using published cytometry and mIHC data of this organ.
Asunto(s)
Análisis de la Célula Individual , Transcriptoma , Animales , Inmunohistoquímica , Ratones , Coloración y Etiquetado , Secuenciación del ExomaRESUMEN
BACKGROUND: Autologous T cells engineered to express a chimeric antigen receptor (CAR) specific for CD19 molecule have transformed the therapeutic landscape in patients with highly refractory leukemia and lymphoma, and the use of donor-generated allogeneic CAR T is paving the way for further breakthroughs in the treatment of cancer. However, it remains unknown how the intrinsic heterogeneities of these engineered cells mediate therapeutic efficacy and whether allogeneic products match the effectiveness of autologous therapies. METHODS: Using single-cell mRNA sequencing in conjunction with CITE-seq, we performed multiomics characterization of CAR T cells generated from healthy donor and patients with acute lymphoblastic leukemia. CAR T cells used in this study were manufactured at the University of Pennsylvania through lentiviral transduction with a CD19-4-1BB-CD3ζ construct. Besides the baseline condition, we engineered NIH-3T3 cells with human CD19 or mesothelin expression to conduct ex vivo antigen-specific or non-antigen stimulation of CAR T cells through 6-hour coculture at a 1:1 ratio. RESULTS: We delineated the global cellular and molecular CAR T landscape and identified that transcriptional CAR tonic signaling was regulated by a mixture of early activation, exhaustion signatures, and cytotoxic activities. On CD19 stimulation, we illuminated the disparities of CAR T cells derived from different origins and found that donor CAR T had more pronounced activation level in correlation with the upregulation of major histocompatibility complex class II genes compared with patient CAR T cells. This finding was independently validated in additional datasets from literature. Furthermore, GM-CSF(CSF2) expression was found to be associated with functional gene productions, but it induced little impact on the CAR T activation. CONCLUSIONS: Through integrated multiomics profiling and unbiased canonical pathway analyses, our results unveil heterogeneities in the transcriptional, phenotypic, functional, and metabolic profiles of donor and patient CAR T cells, providing mechanistic basis for ameliorating clinical outcomes and developing next-generation 'off- the-shelf' allogeneic products.
Asunto(s)
Antígenos CD19/genética , Perfilación de la Expresión Génica , Inmunoterapia Adoptiva , Activación de Linfocitos/genética , Leucemia-Linfoma Linfoblástico de Células Precursoras/terapia , Receptores Quiméricos de Antígenos/genética , Análisis de la Célula Individual , Linfocitos T/trasplante , Transcriptoma , Animales , Antígenos CD19/inmunología , Antígenos CD19/metabolismo , Estudios de Casos y Controles , Línea Celular Tumoral , Técnicas de Cocultivo , Citotoxicidad Inmunológica/genética , Humanos , Ratones , Células 3T3 NIH , Fenotipo , Leucemia-Linfoma Linfoblástico de Células Precursoras/genética , Leucemia-Linfoma Linfoblástico de Células Precursoras/inmunología , Leucemia-Linfoma Linfoblástico de Células Precursoras/metabolismo , RNA-Seq , Receptores Quiméricos de Antígenos/inmunología , Receptores Quiméricos de Antígenos/metabolismo , Linfocitos T/inmunología , Linfocitos T/metabolismoRESUMEN
The anterior pituitary gland drives highly conserved physiologic processes in mammalian species. These hormonally controlled processes are central to somatic growth, pubertal transformation, fertility, lactation, and metabolism. Current cellular models of mammalian anteiror pituitary, largely built on candidate gene based immuno-histochemical and mRNA analyses, suggest that each of the seven hormones synthesized by the pituitary is produced by a specific and exclusive cell lineage. However, emerging evidence suggests more complex relationship between hormone specificity and cell plasticity. Here we have applied massively parallel single-cell RNA sequencing (scRNA-seq), in conjunction with complementary imaging-based single-cell analyses of mRNAs and proteins, to systematically map both cell-type diversity and functional state heterogeneity in adult male and female mouse pituitaries at single-cell resolution and in the context of major physiologic demands. These quantitative single-cell analyses reveal sex-specific cell-type composition under normal pituitary homeostasis, identify an array of cells associated with complex complements of hormone-enrichment, and undercover non-hormone producing interstitial and supporting cell-types. Interestingly, we also identified a Pou1f1-expressing cell population that is characterized by a unique multi-hormone gene expression profile. In response to two well-defined physiologic stresses, dynamic shifts in cellular diversity and transcriptome profiles were observed for major hormone producing and the putative multi-hormone cells. These studies reveal unanticipated cellular complexity and plasticity in adult pituitary, and provide a rich resource for further validating and expanding our molecular understanding of pituitary gene expression programs and hormone production.
Asunto(s)
Plasticidad de la Célula/genética , Hipófisis/citología , Hipófisis/metabolismo , ARN Mensajero/genética , RNA-Seq , Caracteres Sexuales , Análisis de la Célula Individual , Transcriptoma , Animales , Femenino , Homeostasis/genética , Masculino , Ratones , Ratones TransgénicosRESUMEN
Large-scale cancer genomic studies enable the systematic identification of mutations that lead to the genesis and progression of tumors, uncovering the underlying molecular mechanisms and potential therapies. While some such mutations are recurrently found in many tumors, many others exist solely within a few samples, precluding detection by conventional recurrence-based statistical approaches. Integrated analysis of somatic mutations and RNA expression data across 12 tumor types reveals that mutations of cancer genes are usually accompanied by substantial changes in expression. We use topological data analysis to leverage this observation and uncover 38 elusive candidate cancer-associated genes, including inactivating mutations of the metalloproteinase ADAMTS12 in lung adenocarcinoma. We show that ADAMTS12-/- mice have a five-fold increase in the susceptibility to develop lung tumors, confirming the role of ADAMTS12 as a tumor suppressor gene. Our results demonstrate that data integration through topological techniques can increase our ability to identify previously unreported cancer-related alterations.
Asunto(s)
Proteínas ADAMTS/genética , Adenocarcinoma del Pulmón/genética , Predisposición Genética a la Enfermedad/genética , Neoplasias Pulmonares/genética , Animales , Línea Celular Tumoral , Biología Computacional/métodos , Análisis de Datos , Ratones , Ratones Endogámicos C57BL , Ratones Noqueados , Mutación/genética , Recurrencia Local de Neoplasia/genética , Oncogenes/genéticaRESUMEN
Personalized medicine is being realized by our ability to measure biological and environmental information about patients. Much of these data are being stored in electronic health records yielding big data that presents challenges for its management and analysis. Here, we review several areas of knowledge that are necessary for next-generation scientists to fully realize the potential of biomedical big data. We begin with an overview of big data and its storage and management. We then review statistics and data science as foundational topics followed by a core curriculum of artificial intelligence, machine learning and natural language processing that are needed to develop predictive models for clinical decision making. We end with some specific training recommendations for preparing next-generation scientists for biomedical big data.
Asunto(s)
Ciencia de los Datos/métodos , Medicina de Precisión/métodos , Macrodatos , Toma de Decisiones Clínicas , Minería de Datos , Registros Electrónicos de Salud , Humanos , Aprendizaje Automático , Procesamiento de Lenguaje NaturalRESUMEN
Acute kidney injury (AKI) currently is diagnosed by a temporal trend of a single blood analyte: serum creatinine. This measurement is neither sensitive nor specific to kidney injury or its protean forms. Newer biomarkers, neutrophil gelatinase-associated lipocalin (NGAL, Lipocalin 2, Siderocalin), or kidney injury molecule-1 (KIM-1, Hepatitis A Virus Cellular Receptor 1), accelerate the diagnosis of AKI as well as prospectively distinguish rapidly reversible from prolonged causes of serum creatinine increase. Nonetheless, these biomarkers lack the capacity to subfractionate AKI further (eg, sepsis versus ischemia versus nephrotoxicity from medications, enzymes, or metals) or inform us about the primary and secondary sites of injury. It also is unknown whether all nephrons are injured in AKI, whether all cells in a nephron are affected, and whether injury responses can be stimulus-specific or cell type-specific or both. In this review, we summarize fully agnostic tissue interrogation approaches that may help to redefine AKI in cellular and molecular terms, including single-cell and single-nuclei RNA sequencing technology. These approaches will empower a shift in the current paradigm of AKI diagnosis, classification, and staging, and provide the renal community with a significant advance toward precision medicine in the analysis AKI.
Asunto(s)
Lesión Renal Aguda/diagnóstico , Medicina de Precisión , Lesión Renal Aguda/genética , Lesión Renal Aguda/patología , Creatinina/sangre , Humanos , Sitios de Carácter Cuantitativo , Análisis de Secuencia de ARNRESUMEN
Topological methods are emerging as a new set of tools for the analysis of large genomic datasets. They are mathematically grounded methods that extract information from the geometric structure of data. In the last few years, applications to evolutionary biology, cancer genomics, and the analysis of complex diseases have uncovered significant biological results, highlighting their utility for fulfilling some of the current analytic needs of genomics. In this review, the state of the art in the application of topological methods to genomics is summarized, and some of the present limitations and possible future developments are reviewed.
RESUMEN
Transcriptional programs control cellular lineage commitment and differentiation during development. Understanding of cell fate has been advanced by studying single-cell RNA-sequencing (RNA-seq) but is limited by the assumptions of current analytic methods regarding the structure of data. We present single-cell topological data analysis (scTDA), an algorithm for topology-based computational analyses to study temporal, unbiased transcriptional regulation. Unlike other methods, scTDA is a nonlinear, model-independent, unsupervised statistical framework that can characterize transient cellular states. We applied scTDA to the analysis of murine embryonic stem cell (mESC) differentiation in vitro in response to inducers of motor neuron differentiation. scTDA resolved asynchrony and continuity in cellular identity over time and identified four transient states (pluripotent, precursor, progenitor, and fully differentiated cells) based on changes in stage-dependent combinations of transcription factors, RNA-binding proteins, and long noncoding RNAs (lncRNAs). scTDA can be applied to study asynchronous cellular responses to either developmental cues or environmental perturbations.
Asunto(s)
Algoritmos , Diferenciación Celular/genética , Células Madre Embrionarias/fisiología , ARN/genética , Análisis de Secuencia de ARN/métodos , Transcripción Genética/genética , Animales , Células Cultivadas , Células Madre Embrionarias/citología , Regulación de la Expresión Génica/genética , Ratones , Neuronas Motoras/citología , Neuronas Motoras/fisiología , Análisis de la Célula Individual/métodos , Activación Transcripcional/genéticaRESUMEN
Precision medicine in cancer proposes that genomic characterization of tumors can inform personalized targeted therapies. However, this proposition is complicated by spatial and temporal heterogeneity. Here we study genomic and expression profiles across 127 multisector or longitudinal specimens from 52 individuals with glioblastoma (GBM). Using bulk and single-cell data, we find that samples from the same tumor mass share genomic and expression signatures, whereas geographically separated, multifocal tumors and/or long-term recurrent tumors are seeded from different clones. Chemical screening of patient-derived glioma cells (PDCs) shows that therapeutic response is associated with genetic similarity, and multifocal tumors that are enriched with PIK3CA mutations have a heterogeneous drug-response pattern. We show that targeting truncal events is more efficacious than targeting private events in reducing the tumor burden. In summary, this work demonstrates that evolutionary inference from integrated genomic analysis in multisector biopsies can inform targeted therapeutic interventions for patients with GBM.