Your browser doesn't support javascript.
loading
Accelerated dimensionality reduction of single-cell RNA sequencing data with fastglmpca.
Weine, Eric; Carbonetto, Peter; Stephens, Matthew.
Affiliation
  • Weine E; Laboratory for Information and Decision Systems, Massachusetts Institute of Technology, Cambridge, MA 02139, United States.
  • Carbonetto P; Department of Data Science, Dana Farber Cancer Institute, Boston, MA 02215, United States.
  • Stephens M; Department of Human Genetics, University of Chicago, Chicago, IL 60637, United States.
Bioinformatics ; 40(8)2024 08 02.
Article in En | MEDLINE | ID: mdl-39110511
ABSTRACT

SUMMARY:

Motivated by theoretical and practical issues that arise when applying Principal component analysis (PCA) to count data, Townes et al. introduced "Poisson GLM-PCA", a variation of PCA adapted to count data, as a tool for dimensionality reduction of single-cell RNA sequencing (scRNA-seq) data. However, fitting GLM-PCA is computationally challenging. Here we study this problem, and show that a simple algorithm, which we call "Alternating Poisson Regression" (APR), produces better quality fits, and in less time, than existing algorithms. APR is also memory-efficient and lends itself to parallel implementation on multi-core processors, both of which are helpful for handling large scRNA-seq datasets. We illustrate the benefits of this approach in three publicly available scRNA-seq datasets. The new algorithms are implemented in an R package, fastglmpca. AVAILABILITY AND IMPLEMENTATION The fastglmpca R package is released on CRAN for Windows, macOS and Linux, and the source code is available at github.com/stephenslab/fastglmpca under the open source GPL-3 license. Scripts to reproduce the results in this paper are also available in the GitHub repository and on Zenodo.
Subject(s)

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Algorithms / Software / Sequence Analysis, RNA / Single-Cell Analysis Limits: Humans Language: En Journal: Bioinformatics Journal subject: INFORMATICA MEDICA Year: 2024 Document type: Article Affiliation country: United States Country of publication: United kingdom

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Algorithms / Software / Sequence Analysis, RNA / Single-Cell Analysis Limits: Humans Language: En Journal: Bioinformatics Journal subject: INFORMATICA MEDICA Year: 2024 Document type: Article Affiliation country: United States Country of publication: United kingdom