Your browser doesn't support javascript.
loading
Identification of robust cellular programs using reproducible LDA that impact sex-specific disease progression in different genotypes of a mouse model of AD.
Rezaie, Narges; Rebboah, Elisabeth; Williams, Brian A; Liang, Heidi Yahan; Reese, Fairlie; Balderrama-Gutierrez, Gabriela; Dionne, Louise A; Reinholdt, Laura; Trout, Diane; Wold, Barbara J; Mortazavi, Ali.
Afiliación
  • Rezaie N; Department of Developmental and Cell Biology, University of California, Irvine, CA, USA.
  • Rebboah E; Center for Complex Biological Systems, University of California, Irvine, CA, USA.
  • Williams BA; Department of Developmental and Cell Biology, University of California, Irvine, CA, USA.
  • Liang HY; Center for Complex Biological Systems, University of California, Irvine, CA, USA.
  • Reese F; Division of Biology, California Institute of Technology, Pasadena, CA, USA.
  • Balderrama-Gutierrez G; Department of Developmental and Cell Biology, University of California, Irvine, CA, USA.
  • Dionne LA; Center for Complex Biological Systems, University of California, Irvine, CA, USA.
  • Reinholdt L; Department of Developmental and Cell Biology, University of California, Irvine, CA, USA.
  • Trout D; Center for Complex Biological Systems, University of California, Irvine, CA, USA.
  • Wold BJ; Department of Developmental and Cell Biology, University of California, Irvine, CA, USA.
  • Mortazavi A; Center for Complex Biological Systems, University of California, Irvine, CA, USA.
bioRxiv ; 2024 Feb 29.
Article en En | MEDLINE | ID: mdl-38464087
ABSTRACT
The gene expression profiles of distinct cell types reflect complex genomic interactions among multiple simultaneous biological processes within each cell that can be altered by disease progression as well as genetic background. The identification of these active cellular programs is an open challenge in the analysis of single-cell RNA-seq data. Latent Dirichlet Allocation (LDA) is a generative method used to identify recurring patterns in counts data, commonly referred to as topics that can be used to interpret the state of each cell. However, LDA's interpretability is hindered by several key factors including the hyperparameter selection of the number of topics as well as the variability in topic definitions due to random initialization. We developed Topyfic, a Reproducible LDA (rLDA) package, to accurately infer the identity and activity of cellular programs in single-cell data, providing insights into the relative contributions of each program in individual cells. We apply Topyfic to brain single-cell and single-nucleus datasets of two 5xFAD mouse models of Alzheimer's disease crossed with C57BL6/J or CAST/EiJ mice to identify distinct cell types and states in different cell types such as microglia. We find that 8-month 5xFAD/Cast F1 males show higher level of microglial activation than matching 5xFAD/BL6 F1 males, whereas female mice show similar levels of microglial activation. We show that regulatory genes such as TFs, microRNA host genes, and chromatin regulatory genes alone capture cell types and cell states. Our study highlights how topic modeling with a limited vocabulary of regulatory genes can identify gene expression programs in single-cell data in order to quantify similar and divergent cell states in distinct genotypes.

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Idioma: En Revista: BioRxiv Año: 2024 Tipo del documento: Article País de afiliación: Estados Unidos

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Idioma: En Revista: BioRxiv Año: 2024 Tipo del documento: Article País de afiliación: Estados Unidos