Your browser doesn't support javascript.
loading
Single-cell multi-ome regression models identify functional and disease-associated enhancers and enable chromatin potential analysis.
Mitra, Sneha; Malik, Rohan; Wong, Wilfred; Rahman, Afsana; Hartemink, Alexander J; Pritykin, Yuri; Dey, Kushal K; Leslie, Christina S.
Affiliation
  • Mitra S; Computational and Systems Biology Program, Memorial Sloan Kettering Cancer Center, New York City, NY, USA.
  • Malik R; Rye Country Day School, Rye, NY, USA.
  • Wong W; Computational and Systems Biology Program, Memorial Sloan Kettering Cancer Center, New York City, NY, USA.
  • Rahman A; Tri-Institutional Training Program in Computational Biology and Medicine, New York City, NY, USA.
  • Hartemink AJ; Hunter College, City University of New York, New York City, NY, USA.
  • Pritykin Y; Department of Computer Science, Duke University, Durham, NC, USA.
  • Dey KK; Program in Computational Biology and Bioinformatics, Duke University, Durham, NC, USA.
  • Leslie CS; Center for Genomic and Computational Biology, Duke University, Durham, NC, USA.
Nat Genet ; 56(4): 627-636, 2024 Apr.
Article in En | MEDLINE | ID: mdl-38514783
ABSTRACT
We present a gene-level regulatory model, single-cell ATAC + RNA linking (SCARlink), which predicts single-cell gene expression and links enhancers to target genes using multi-ome (scRNA-seq and scATAC-seq co-assay) sequencing data. The approach uses regularized Poisson regression on tile-level accessibility data to jointly model all regulatory effects at a gene locus, avoiding the limitations of pairwise gene-peak correlations and dependence on peak calling. SCARlink outperformed existing gene scoring methods for imputing gene expression from chromatin accessibility across high-coverage multi-ome datasets while giving comparable to improved performance on low-coverage datasets. Shapley value analysis on trained models identified cell-type-specific gene enhancers that are validated by promoter capture Hi-C and are 11× to 15× and 5× to 12× enriched in fine-mapped eQTLs and fine-mapped genome-wide association study (GWAS) variants, respectively. We further show that SCARlink-predicted and observed gene expression vectors provide a robust way to compute a chromatin potential vector field to enable developmental trajectory analysis.
Subject(s)

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Chromatin / Genome-Wide Association Study Language: En Journal: Nat Genet Journal subject: GENETICA MEDICA Year: 2024 Document type: Article Affiliation country: Estados Unidos

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Chromatin / Genome-Wide Association Study Language: En Journal: Nat Genet Journal subject: GENETICA MEDICA Year: 2024 Document type: Article Affiliation country: Estados Unidos