RESUMO
Discovery of genomic safe harbor sites (SHSs) is fundamental for multiple transgene integrations, such as reporter genes, chimeric antigen receptors (CARs), and safety switches, which are required for safe cell products for regenerative cell therapies and immunotherapies. Here we identified and characterized potential SHS in human cells. Using the CRISPR-MAD7 system, we integrated transgenes at these sites in induced pluripotent stem cells (iPSCs), primary T and natural killer (NK) cells, and Jurkat cell line, and demonstrated efficient and stable expression at these loci. Subsequently, we validated the differentiation potential of engineered iPSC toward CD34+ hematopoietic stem and progenitor cells (HSPCs), lymphoid progenitor cells (LPCs), and NK cells and showed that transgene expression was perpetuated in these lineages. Finally, we demonstrated that engineered iPSC-derived NK cells retained expression of a non-virally integrated anti-CD19 CAR, suggesting that several of the investigated SHSs can be used to engineer cells for adoptive immunotherapies.
RESUMO
Single-cell resolution analysis of complex biological tissues is fundamental to capture cell-state heterogeneity and distinct cellular signaling patterns that remain obscured with population-based techniques. The limited amount of material encapsulated in a single cell however, raises significant technical challenges to molecular profiling. Due to extensive optimization efforts, single-cell proteomics by Mass Spectrometry (scp-MS) has emerged as a powerful tool to facilitate proteome profiling from ultra-low amounts of input, although further development is needed to realize its full potential. To this end, we carry out comprehensive analysis of orbitrap-based data-independent acquisition (DIA) for limited material proteomics. Notably, we find a fundamental difference between optimal DIA methods for high- and low-load samples. We further improve our low-input DIA method by relying on high-resolution MS1 quantification, thus enhancing sensitivity by more efficiently utilizing available mass analyzer time. With our ultra-low input tailored DIA method, we are able to accommodate long injection times and high resolution, while keeping the scan cycle time low enough to ensure robust quantification. Finally, we demonstrate the capability of our approach by profiling mouse embryonic stem cell culture conditions, showcasing heterogeneity in global proteomes and highlighting distinct differences in key metabolic enzyme expression in distinct cell subclusters.
Assuntos
Células-Tronco Embrionárias Murinas , Proteômica , Animais , Camundongos , Espectrometria de Massas , Proteoma , Análise de Célula ÚnicaRESUMO
Accurate prediction of gene regulatory rules is important towards understanding of cellular processes. Existing computational algorithms devised for bulk transcriptomics typically require a large number of time points to infer gene regulatory networks (GRNs), are applicable for a small number of genes and fail to detect potential causal relationships effectively. Here, we propose a novel approach 'TENET' to reconstruct GRNs from single cell RNA sequencing (scRNAseq) datasets. Employing transfer entropy (TE) to measure the amount of causal relationships between genes, TENET predicts large-scale gene regulatory cascades/relationships from scRNAseq data. TENET showed better performance than other GRN reconstructors, in identifying key regulators from public datasets. Specifically from scRNAseq, TENET identified key transcriptional factors in embryonic stem cells (ESCs) and during direct cardiomyocytes reprogramming, where other predictors failed. We further demonstrate that known target genes have significantly higher TE values, and TENET predicted higher TE genes were more influenced by the perturbation of their regulator. Using TENET, we identified and validated that Nme2 is a culture condition specific stem cell factor. These results indicate that TENET is uniquely capable of identifying key regulators from scRNAseq data.
Assuntos
Algoritmos , Biologia Computacional/métodos , Entropia , Redes Reguladoras de Genes , Análise de Célula Única/métodos , Transcriptoma , Fosfatase Alcalina/metabolismo , Animais , Proliferação de Células/genética , Perfilação da Expressão Gênica/métodos , Ontologia Genética , Camundongos , Células-Tronco Embrionárias Murinas/citologia , Células-Tronco Embrionárias Murinas/metabolismo , Análise de Sequência de RNA/métodos , Fatores de Transcrição/genética , Fatores de Transcrição/metabolismoRESUMO
BACKGROUND AND AIMS: Hepatic sinusoidal cells are known actors in the fibrogenic response to injury. Activated hepatic stellate cells (HSCs), liver sinusoidal endothelial cells, and Kupffer cells are responsible for sinusoidal capillarization and perisinusoidal matrix deposition, impairing vascular exchange and heightening the risk of advanced fibrosis. While the overall pathogenesis is well understood, functional relations between cellular transitions during fibrogenesis are only beginning to be resolved. At single-cell resolution, we here explored the heterogeneity of individual cell types and dissected their transitions and crosstalk during fibrogenesis. APPROACH AND RESULTS: We applied single-cell transcriptomics to map the heterogeneity of sinusoid-associated cells in healthy and injured livers and reconstructed the single-lineage HSC trajectory from pericyte to myofibroblast. Stratifying each sinusoidal cell population by activation state, we projected shifts in sinusoidal communication upon injury. Weighted gene correlation network analysis of the HSC trajectory led to the identification of core genes whose expression proved highly predictive of advanced fibrosis in patients with nonalcoholic steatohepatitis (NASH). Among the core members of the injury-repressed gene module, we identified plasmalemma vesicle-associated protein (PLVAP) as a protein amply expressed by mouse and human HSCs. PLVAP expression was suppressed in activated HSCs upon injury and may hence define hitherto unknown roles for HSCs in the regulation of microcirculatory exchange and its breakdown in chronic liver disease. CONCLUSIONS: Our study offers a single-cell resolved account of drug-induced injury of the mammalian liver and identifies key genes that may serve important roles in sinusoidal integrity and as markers of advanced fibrosis in human NASH.
Assuntos
Doença Hepática Induzida por Substâncias e Drogas/patologia , Células Endoteliais/patologia , Redes Reguladoras de Genes , Cirrose Hepática/genética , Hepatopatia Gordurosa não Alcoólica/patologia , Animais , Biópsia , Capilares/citologia , Capilares/patologia , Tetracloreto de Carbono/administração & dosagem , Tetracloreto de Carbono/toxicidade , Doença Hepática Induzida por Substâncias e Drogas/etiologia , Modelos Animais de Doenças , Endotélio Vascular/citologia , Endotélio Vascular/patologia , Feminino , Veias Hepáticas/citologia , Veias Hepáticas/patologia , Humanos , Fígado/irrigação sanguínea , Fígado/patologia , Cirrose Hepática/patologia , Proteínas de Membrana/genética , Camundongos , Camundongos Transgênicos , Análise de Sequência com Séries de Oligonucleotídeos , RNA-Seq , Análise de Célula ÚnicaRESUMO
The endothelial to haematopoietic transition (EHT) is the process whereby haemogenic endothelium differentiates into haematopoietic stem and progenitor cells (HSPCs). The intermediary steps of this process are unclear, in particular the identity of endothelial cells that give rise to HSPCs is unknown. Using single-cell transcriptome analysis and antibody screening, we identify CD44 as a marker of EHT enabling us to isolate robustly the different stages of EHT in the aorta-gonad-mesonephros (AGM) region. This allows us to provide a detailed phenotypical and transcriptional profile of CD44-positive arterial endothelial cells from which HSPCs emerge. They are characterized with high expression of genes related to Notch signalling, TGFbeta/BMP antagonists, a downregulation of genes related to glycolysis and the TCA cycle, and a lower rate of cell cycle. Moreover, we demonstrate that by inhibiting the interaction between CD44 and its ligand hyaluronan, we can block EHT, identifying an additional regulator of HSPC development.
Assuntos
Biomarcadores , Endotélio/metabolismo , Células-Tronco Hematopoéticas/metabolismo , Receptores de Hialuronatos/metabolismo , Transcriptoma , Animais , Aorta , Artérias , Ciclo Celular , Ciclo do Ácido Cítrico/genética , Biologia Computacional , Subunidade alfa 2 de Fator de Ligação ao Core/genética , Regulação para Baixo , Glicólise/genética , Gônadas , Hematopoese/fisiologia , Receptores de Hialuronatos/sangue , Receptores de Hialuronatos/genética , Ácido Hialurônico , Mesonefro , Camundongos , Camundongos Endogâmicos C57BL , Camundongos Knockout , Fator de Crescimento Transformador beta/metabolismoRESUMO
Single-cell RNA-seq enables the quantitative characterization of cell types based on global transcriptome profiles. We present single-cell consensus clustering (SC3), a user-friendly tool for unsupervised clustering, which achieves high accuracy and robustness by combining multiple clustering solutions through a consensus approach (http://bioconductor.org/packages/SC3). We demonstrate that SC3 is capable of identifying subclones from the transcriptomes of neoplastic cells collected from patients.
Assuntos
Perfilação da Expressão Gênica/métodos , Sequenciamento de Nucleotídeos em Larga Escala/métodos , Análise de Sequência de RNA/métodos , Análise de Célula Única/métodos , Análise por Conglomerados , Conjuntos de Dados como Assunto , Células-Tronco Hematopoéticas/citologia , Humanos , Máquina de Vetores de SuporteRESUMO
BACKGROUND: Interoperability between formats is a recurring problem in systems biology research. Many tools have been developed to convert computational models from one format to another. However, they have been developed independently, resulting in redundancy of efforts and lack of synergy. RESULTS: Here we present the System Biology Format Converter (SBFC), which provide a generic framework to potentially convert any format into another. The framework currently includes several converters translating between the following formats: SBML, BioPAX, SBGN-ML, Matlab, Octave, XPP, GPML, Dot, MDL and APM. This software is written in Java and can be used as a standalone executable or web service. CONCLUSIONS: The SBFC framework is an evolving software project. Existing converters can be used and improved, and new converters can be easily added, making SBFC useful to both modellers and developers. The source code and documentation of the framework are freely available from the project web site.
Assuntos
Interface Usuário-Computador , Bases de Dados Factuais , Internet , Biologia de SistemasRESUMO
Embryonic stem cell (ESC) culture conditions are important for maintaining long-term self-renewal, and they influence cellular pluripotency state. Here, we report single cell RNA-sequencing of mESCs cultured in three different conditions: serum, 2i, and the alternative ground state a2i. We find that the cellular transcriptomes of cells grown in these conditions are distinct, with 2i being the most similar to blastocyst cells and including a subpopulation resembling the two-cell embryo state. Overall levels of intercellular gene expression heterogeneity are comparable across the three conditions. However, this masks variable expression of pluripotency genes in serum cells and homogeneous expression in 2i and a2i cells. Additionally, genes related to the cell cycle are more variably expressed in the 2i and a2i conditions. Mining of our dataset for correlations in gene expression allowed us to identify additional components of the pluripotency network, including Ptma and Zfp640, illustrating its value as a resource for future discovery.
Assuntos
Células-Tronco Embrionárias Murinas/fisiologia , RNA/genética , Transcriptoma , Animais , Diferenciação Celular/genética , Células Cultivadas , Quinase 3 da Glicogênio Sintase/antagonistas & inibidores , Quinase 3 da Glicogênio Sintase/metabolismo , MAP Quinase Quinase 1/antagonistas & inibidores , MAP Quinase Quinase 1/metabolismo , Camundongos , Células-Tronco Embrionárias Murinas/citologia , RNA/metabolismo , Proteínas Repressoras/genética , Proteínas Repressoras/metabolismo , Análise de Célula ÚnicaRESUMO
The transcriptome of single cells can reveal important information about cellular states and heterogeneity within populations of cells. Recently, single-cell RNA-sequencing has facilitated expression profiling of large numbers of single cells in parallel. To fully exploit these data, it is critical that suitable computational approaches are developed. One key challenge, especially pertinent when considering dividing populations of cells, is to understand the cell-cycle stage of each captured cell. Here we describe and compare five established supervised machine learning methods and a custom-built predictor for allocating cells to their cell-cycle stage on the basis of their transcriptome. In particular, we assess the impact of different normalisation strategies and the usage of prior knowledge on the predictive power of the classifiers. We tested the methods on previously published datasets and found that a PCA-based approach and the custom predictor performed best. Moreover, our analysis shows that the performance depends strongly on normalisation and the usage of prior knowledge. Only by leveraging prior knowledge in form of cell-cycle annotated genes and by preprocessing the data using a rank-based normalisation, is it possible to robustly capture the transcriptional cell-cycle signature across different cell types, organisms and experimental protocols.
Assuntos
Ciclo Celular/fisiologia , Perfilação da Expressão Gênica/métodos , Aprendizado de Máquina , Análise de Célula Única/métodos , Transcriptoma/fisiologia , Animais , Linhagem Celular Tumoral , Biologia Computacional/métodos , Células-Tronco Embrionárias/fisiologia , Hepatócitos/fisiologia , Humanos , CamundongosRESUMO
Recent technical developments have enabled the transcriptomes of hundreds of cells to be assayed in an unbiased manner, opening up the possibility that new subpopulations of cells can be found. However, the effects of potential confounding factors, such as the cell cycle, on the heterogeneity of gene expression and therefore on the ability to robustly identify subpopulations remain unclear. We present and validate a computational approach that uses latent variable models to account for such hidden factors. We show that our single-cell latent variable model (scLVM) allows the identification of otherwise undetectable subpopulations of cells that correspond to different stages during the differentiation of naive T cells into T helper 2 cells. Our approach can be used not only to identify cellular subpopulations but also to tease apart different sources of gene expression heterogeneity in single-cell transcriptomes.