Búsqueda | Portal de Búsqueda de la BVS España

ScSmOP: a universal computational pipeline for single-cell single-molecule multiomics data analysis.

Jing, Kai; Xu, Yewen; Yang, Yang; Yin, Pengfei; Ning, Duo; Huang, Guangyu; Deng, Yuqing; Chen, Gengzhan; Li, Guoliang; Tian, Simon Zhongyuan; Zheng, Meizhen.

Brief Bioinform ; 24(6)2023 09 22.

Artículo en Inglés | MEDLINE | ID: mdl-37779245

RESUMEN

Single-cell multiomics techniques have been widely applied to detect the key signature of cells. These methods have achieved a single-molecule resolution and can even reveal spatial localization. These emerging methods provide insights elucidating the features of genomic, epigenomic and transcriptomic heterogeneity in individual cells. However, they have given rise to new computational challenges in data processing. Here, we describe Single-cell Single-molecule multiple Omics Pipeline (ScSmOP), a universal pipeline for barcode-indexed single-cell single-molecule multiomics data analysis. Essentially, the C language is utilized in ScSmOP to set up spaced-seed hash table-based algorithms for barcode identification according to ligation-based barcoding data and synthesis-based barcoding data, followed by data mapping and deconvolution. We demonstrate high reproducibility of data processing between ScSmOP and published pipelines in comprehensive analyses of single-cell omics data (scRNA-seq, scATAC-seq, scARC-seq), single-molecule chromatin interaction data (ChIA-Drop, SPRITE, RD-SPRITE), single-cell single-molecule chromatin interaction data (scSPRITE) and spatial transcriptomic data from various cell types and species. Additionally, ScSmOP shows more rapid performance and is a versatile, efficient, easy-to-use and robust pipeline for single-cell single-molecule multiomics data analysis.

Asunto(s)

Genómica , Multiómica , Reproducibilidad de los Resultados , Cromatina/genética , Análisis de Datos

MCIBox: a toolkit for single-molecule multi-way chromatin interaction visualization and micro-domains identification.

Tian, Simon Zhongyuan; Li, Guoliang; Ning, Duo; Jing, Kai; Xu, Yewen; Yang, Yang; Fullwood, Melissa J; Yin, Pengfei; Huang, Guangyu; Plewczynski, Dariusz; Zhai, Jixian; Dai, Ziwei; Chen, Wei; Zheng, Meizhen.

Brief Bioinform ; 23(6)2022 11 19.

Artículo en Inglés | MEDLINE | ID: mdl-36094071

RESUMEN

The emerging ligation-free three-dimensional (3D) genome mapping technologies can identify multiplex chromatin interactions with single-molecule precision. These technologies not only offer new insight into high-dimensional chromatin organization and gene regulation, but also introduce new challenges in data visualization and analysis. To overcome these challenges, we developed MCIBox, a toolkit for multi-way chromatin interaction (MCI) analysis, including a visualization tool and a platform for identifying micro-domains with clustered single-molecule chromatin complexes. MCIBox is based on various clustering algorithms integrated with dimensionality reduction methods that can display multiplex chromatin interactions at single-molecule level, allowing users to explore chromatin extrusion patterns and super-enhancers regulation modes in transcription, and to identify single-molecule chromatin complexes that are clustered into micro-domains. Furthermore, MCIBox incorporates a two-dimensional kernel density estimation algorithm to identify micro-domains boundaries automatically. These micro-domains were stratified with distinctive signatures of transcription activity and contained different cell-cycle-associated genes. Taken together, MCIBox represents an invaluable tool for the study of multiple chromatin interactions and inaugurates a previously unappreciated view of 3D genome structure.

Asunto(s)

Cromatina , Secuencias Reguladoras de Ácidos Nucleicos , Cromatina/genética , Genoma , Regulación de la Expresión Génica

MCI-frcnn: A deep learning method for topological micro-domain boundary detection.

Tian, Simon Zhongyuan; Yin, Pengfei; Jing, Kai; Yang, Yang; Xu, Yewen; Huang, Guangyu; Ning, Duo; Fullwood, Melissa J; Zheng, Meizhen.

Front Cell Dev Biol ; 10: 1050769, 2022.

Artículo en Inglés | MEDLINE | ID: mdl-36531953

RESUMEN

Chromatin structural domains, or topologically associated domains (TADs), are a general organizing principle in chromatin biology. RNA polymerase II (RNAPII) mediates multiple chromatin interactive loops, tethering together as RNAPII-associated chromatin interaction domains (RAIDs) to offer a framework for gene regulation. RAID and TAD alterations have been found to be associated with diseases. They can be further dissected as micro-domains (micro-TADs and micro-RAIDs) by clustering single-molecule chromatin-interactive complexes from next-generation three-dimensional (3D) genome techniques, such as ChIA-Drop. Currently, there are few tools available for micro-domain boundary identification. In this work, we developed the MCI-frcnn deep learning method to train a Faster Region-based Convolutional Neural Network (Faster R-CNN) for micro-domain boundary detection. At the training phase in MCI-frcnn, 50 images of RAIDs from Drosophila RNAPII ChIA-Drop data, containing 261 micro-RAIDs with ground truth boundaries, were trained for 7 days. Using this well-trained MCI-frcnn, we detected micro-RAID boundaries for the input new images, with a fast speed (5.26 fps), high recognition accuracy (AUROC = 0.85, mAP = 0.69), and high boundary region quantification (genomic IoU = 76%). We further applied MCI-frcnn to detect human micro-TADs boundaries using human GM12878 SPRITE data and obtained a high region quantification score (mean gIoU = 85%). In all, the MCI-frcnn deep learning method which we developed in this work is a general tool for micro-domain boundary detection.

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

ENVIAR RESULTADO:

SELECCIÓN DE REFERENCIAS

DETALLE DE LA BÚSQUEDA