ABSTRACT
BACKGROUND: Highly polymorphic human leukocyte antigen (HLA) genes are responsible for fine-tuning the adaptive immune system. High-resolution HLA typing is important for the treatment of autoimmune and infectious diseases. Additionally, it is routinely performed for identifying matched donors in transplantation medicine. Although many HLA typing approaches have been developed, the complexity, low-efficiency and high-cost of current HLA-typing assays limit their application in population-based high-throughput HLA typing for donors, which is required for creating large-scale databases for transplantation and precision medicine. RESULTS: Here, we present a cost-efficient Saturated Tiling Capture Sequencing (STC-Seq) approach to capturing 14 HLA class I and II genes. The highly efficient capture (an approximately 23,000-fold enrichment) of these genes allows for simplified allele calling. Tests on five genes (HLA-A/B/C/DRB1/DQB1) from 31 human samples and 351 datasets using STC-Seq showed results that were 98% consistent with the known two sets of digitals (field1 and field2) genotypes. Additionally, STC can capture genomic DNA fragments longer than 3 kb from HLA loci, making the library compatible with the third-generation sequencing. CONCLUSIONS: STC-Seq is a highly accurate and cost-efficient method for HLA typing which can be used to facilitate the establishment of population-based HLA databases for the precision and transplantation medicine.
Subject(s)
High-Throughput Nucleotide Sequencing , Histocompatibility Testing/methods , Sequence Analysis, DNA , HLA Antigens/genetics , HumansABSTRACT
Single cell chromatin accessibility profiling and transcriptome sequencing are the most widely used technologies for single-cell genomics. Here, we present Microwell-seq3, a high-throughput and facile platform for high-sensitivity single-nucleus chromatin accessibility or full-length transcriptome profiling. The method combines a preindexing strategy and a penetrable chip-in-a-tube for single nucleus loading and DNA amplification and therefore does not require specialized equipment. We used Microwell-seq3 to profile chromatin accessibility in more than 200,000 single nuclei and the full-length transcriptome in ~50,000 nuclei from multiple adult mouse tissues. Compared with the existing polyadenylated transcript capture methods, integrative analysis of cell type-specific regulatory elements and total RNA expression uncovered comprehensive cell type heterogeneity in the brain. Gene regulatory networks based on chromatin accessibility profiling provided an improved cell type communication model. Finally, we demonstrated that Microwell-seq3 can identify malignant cells and their specific regulons in spontaneous lung tumors of aged mice. We envision a broad application of Microwell-seq3 in many areas of research.
ABSTRACT
In vivo differentiation of human pluripotent stem cells (hPSCs) has unique advantages, such as multilineage differentiation, angiogenesis, and close cell-cell interactions. To systematically investigate multilineage differentiation mechanisms of hPSCs, we constructed the in vivo hPSC differentiation landscape containing 239,670 cells using teratoma models. We identified 43 cell types, inferred 18 cell differentiation trajectories, and characterized common and specific gene regulation patterns during hPSC differentiation at both transcriptional and epigenetic levels. Additionally, we developed the developmental single-cell Basic Local Alignment Search Tool (dscBLAST), an R-based cell identification tool, to simplify the identification processes of developmental cells. Using dscBLAST, we aligned cells in multiple differentiation models to normally developing cells to further understand their differentiation states. Overall, our study offers new insights into stem cell differentiation and human embryonic development; dscBLAST shows favorable cell identification performance, providing a powerful identification tool for developmental cells.
Subject(s)
Pluripotent Stem Cells , Humans , Cell Differentiation/genetics , Pluripotent Stem Cells/metabolism , Gene Expression Regulation , Embryonic DevelopmentABSTRACT
Waddington's epigenetic landscape is a metaphor frequently used to illustrate cell differentiation. Recent advances in single-cell genomics are altering our understanding of the Waddington landscape, yet the molecular mechanisms of cell-fate decisions remain poorly understood. We constructed a cell landscape of mouse lineage differentiation during development at the single-cell level and described both lineage-common and lineage-specific regulatory programs during cell-type maturation. We also found lineage-common regulatory programs that are broadly active during the development of invertebrates and vertebrates. In particular, we identified Xbp1 as an evolutionarily conserved regulator of cell-fate determinations across different species. We demonstrated that Xbp1 transcriptional regulation is important for the stabilization of the gene-regulatory networks for a wide range of mouse cell types. Our results offer genetic and molecular insights into cellular gene-regulatory programs and will serve as a basis for further advancing the understanding of cell-fate decisions.
Subject(s)
Epigenesis, Genetic , Models, Genetic , Animals , Cell Differentiation/genetics , Cell Lineage/genetics , Epigenomics , Gene Regulatory Networks/genetics , MiceABSTRACT
Despite extensive efforts to generate and analyze reference genomes, genetic models to predict gene regulation and cell fate decisions are lacking for most species. Here, we generated whole-body single-cell transcriptomic landscapes of zebrafish, Drosophila and earthworm. We then integrated cell landscapes from eight representative metazoan species to study gene regulation across evolution. Using these uniformly constructed cross-species landscapes, we developed a deep-learning-based strategy, Nvwa, to predict gene expression and identify regulatory sequences at the single-cell level. We systematically compared cell-type-specific transcription factors to reveal conserved genetic regulation in vertebrates and invertebrates. Our work provides a valuable resource and offers a new strategy for studying regulatory grammar in diverse biological systems.
Subject(s)
Deep Learning , Zebrafish , Animals , Zebrafish/genetics , Zebrafish/metabolism , Transcription Factors/genetics , Transcription Factors/metabolism , Gene Expression Regulation , Drosophila/genetics , Drosophila/metabolism , Conserved Sequence/geneticsABSTRACT
[This corrects the article DOI: 10.1371/journal.pone.0206844.].
ABSTRACT
BACKGROUND: Key regulators of developmental processes can be prioritized through integrated analysis of ChIP-Seq data of master transcriptional factors (TFs) such as Nanog and Oct4, active histone modifications (HMs) such as H3K4me3 and H3K27ac, and repressive HMs such as H3K27me3. Recent studies show that broad enrichment signals such as super-enhancers and broad H3K4me3 enrichment signals play more dominant roles than short enrichment signals of the master TFs and H3K4me3 in epigenetic regulatory mechanism. Besides the broad enrichment signals, up to ten thousands of short enrichment signals of these TFs and HMs exist in genome. Prioritization of these broad enrichment signals from ChIP-Seq data is a prerequisite for such integrated analysis. RESULTS: Here, we present a method named Clustering-Local-Unique-Enriched-Signals (CLUES), which uses an adaptive-size-windows strategy to identify enriched regions (ERs) and cluster them into broad enrichment signals. Tested on 62 ENCODE ChIP-Seq datasets of Ctcf and Nrsf, CLUES performs equally well as MACS2 regarding prioritization of ERs with the TF's motif. Tested on 165 ENCODE ChIP-Seq datasets of H3K4me3, H3K27me3, and H3K36me3, CLUES performs better than existing algorithms on prioritizing broad enrichment signals implicating cell functions influenced by epigenetic regulatory mechanism in cells. Most importantly, CLUES helps to confirm several novel regulators of mouse ES cell self-renewal and pluripotency through integrated analysis of prioritized broad enrichment signals of H3K4me3, H3K27me3, Nanog and Oct4 with the support of a CRISPR/Cas9 negative selection genetic screen. CONCLUSIONS: CLUES holds promise for prioritizing broad enrichment signals from ChIP-Seq data. The download site for CLUES is https://github.com/Wuchao1984/CLUESv1.
Subject(s)
Cell Self Renewal/genetics , Embryonic Stem Cells , Epigenesis, Genetic , Histone-Lysine N-Methyltransferase/genetics , Animals , CRISPR-Cas Systems/genetics , Chromatin Immunoprecipitation , Histone Code/genetics , Mice , Promoter Regions, Genetic , Protein Processing, Post-Translational , Regulatory Sequences, Nucleic AcidABSTRACT
Current single-cell RNA-seq approaches are hindered by preamplification bias, loss of strand of origin information, and the inability to observe small-RNA and mRNA dual transcriptomes. Here, we introduce a single-cell holo-transcriptome sequencing (Holo-Seq) that overcomes all three hurdles. Holo-Seq has the same quantitative accuracy and uniform coverage with a complete strand of origin information as bulk RNA-seq. Most importantly, Holo-Seq can simultaneously observe small RNAs and mRNAs in a single cell. Furthermore, we acquire small RNA and mRNA dual transcriptomes of 32 human hepatocellular carcinoma single cells, which display the genome-wide super-enhancer activity and hepatic neoplasm kinetics of these cells.