RESUMEN
Single cell chromatin accessibility assays reveal epigenomic variability at cis-regulatory elements among individual cells. We previously developed a single-cell DNase-seq assay (scDNase-seq) to profile accessible chromatin in a limited number of single cells. Here, we report a novel indexing strategy to resolve single-cell DNase hypersensitivity profiles based on bulk cell analysis. This new technique, termed indexing single-cell DNase sequencing (iscDNase-seq), employs the activities of terminal DNA transferase (TdT) and T4 DNA ligase to add unique cell barcodes to DNase-digested chromatin ends. By a three-layer indexing strategy, it allows profiling genome-wide DHSs for >15 000 single-cells in a single experiment. Application of iscDNase-seq to human white blood cells accurately revealed specific cell types and inferred regulatory transcription factors (TF) specific to each cell type. We found that iscDNase-seq detected DHSs with specific properties related to gene expression and conservation missed by scATAC-seq for the same cell type. Also, we found that the cell-to-cell variation in accessibility computed using iscDNase-seq data is significantly correlated with the cell-to-cell variation in gene expression. Importantly, this correlation is significantly higher than that between scATAC-seq and scRNA-seq, suggesting that iscDNase-seq data can better predict the cellular heterogeneity in gene expression compared to scATAC-seq. Thus, iscDNase-seq is an attractive alternative method for single-cell epigenomics studies.
Asunto(s)
Epigénesis Genética , Expresión Génica , Secuenciación de Nucleótidos de Alto Rendimiento/métodos , Análisis de Secuencia de ADN/métodos , Análisis de la Célula Individual/métodos , Cromatina/metabolismo , Desoxirribonucleasa I/metabolismo , HumanosRESUMEN
An amendment to this paper has been published and can be accessed via a link at the top of the paper.
RESUMEN
Modern next-generation sequencing-based methods have empowered researchers to assay the epigenetic states of individual cells. Existing techniques for profiling epigenetic marks in single cells often require the use and optimization of time-intensive procedures such as drop fluidics, chromatin fragmentation, and end repair. Here we describe ACT-seq, a streamlined method for mapping genome-wide distributions of histone tail modifications, histone variants, and chromatin-binding proteins in a small number of or single cells. ACT-seq utilizes a fusion of Tn5 transposase to Protein A that is targeted to chromatin by a specific antibody, allowing chromatin fragmentation and sequence tag insertion specifically at genomic sites presenting the relevant antigen. The Tn5 transposase enables the use of an index multiplexing strategy (iACT-seq), which enables construction of thousands of single-cell libraries in one day by a single researcher without the need for drop-based fluidics or visual sorting. We conclude that ACT-seq present an attractive alternative to existing techniques for mapping epigenetic marks in single cells.
Asunto(s)
Mapeo Cromosómico/métodos , Código de Histonas , Histonas/genética , Análisis de la Célula Individual/métodos , Anticuerpos/inmunología , Anticuerpos/metabolismo , Cromatina/inmunología , Cromatina/metabolismo , Inmunoprecipitación de Cromatina , Epigenómica/métodos , Células HEK293 , Histonas/metabolismo , Humanos , Procesamiento Proteico-Postraduccional , Proteínas Recombinantes de Fusión/genética , Proteínas Recombinantes de Fusión/metabolismo , Análisis de Secuencia de ADN/métodos , Coloración y Etiquetado/métodos , Proteína Estafilocócica A/genética , Proteína Estafilocócica A/metabolismo , Transposasas/genética , Transposasas/metabolismoRESUMEN
Taxonomic markers such as the 16S ribosomal RNA gene are widely used in microbial community analysis. A common first step in marker-gene analysis is grouping genes into clusters to reduce data sets to a more manageable size and potentially mitigate the effects of sequencing error. Instead of clustering based on sequence identity, marker-gene data sets collected over time can be clustered based on temporal correlation to reveal ecologically meaningful associations. We present Ananke, a free and open-source algorithm and software package that complements existing sequence-identity-based clustering approaches by clustering marker-gene data based on time-series profiles and provides interactive visualization of clusters, including highlighting of internal OTU inconsistencies. Ananke is able to cluster distinct temporal patterns from simulations of multiple ecological patterns, such as periodic seasonal dynamics and organism appearances/disappearances. We apply our algorithm to two longitudinal marker gene data sets: faecal communities from the human gut of an individual sampled over one year, and communities from a freshwater lake sampled over eleven years. Within the gut, the segregation of the bacterial community around a food-poisoning event was immediately clear. In the freshwater lake, we found that high sequence identity between marker genes does not guarantee similar temporal dynamics, and Ananke time-series clusters revealed patterns obscured by clustering based on sequence identity or taxonomy. Ananke is free and open-source software available at https://github.com/beiko-lab/ananke.