Your browser doesn't support javascript.
loading
Modeling expression ranks for noise-tolerant differential expression analysis of scRNA-seq data.
Gupta, Krishan; Lalit, Manan; Biswas, Aditya; Sanada, Chad D; Greene, Cassandra; Hukari, Kyle; Maulik, Ujjwal; Bandyopadhyay, Sanghamitra; Ramalingam, Naveen; Ahuja, Gaurav; Ghosh, Abhik; Sengupta, Debarka.
Afiliación
  • Gupta K; Department of Computer Science and Engineering, Indraprastha Institute of Information Technology, Delhi 110020, India.
  • Lalit M; Max Planck Institute of Molecular Cell Biology and Genetics, Dresden 01307, Germany.
  • Biswas A; Microsoft India Private Limited, Hyderabad, Telangana 500032, India.
  • Sanada CD; Fluidigm Corporation, South San Francisco, California 94080, USA.
  • Greene C; Fluidigm Corporation, South San Francisco, California 94080, USA.
  • Hukari K; Fluidigm Corporation, South San Francisco, California 94080, USA.
  • Maulik U; Department of Computer Science, Jadavpur University, Kolkata, West Bengal 700032, India.
  • Bandyopadhyay S; Machine Intelligence Unit, Indian Statistical Institute, Kolkata 700108, India.
  • Ramalingam N; Fluidigm Corporation, South San Francisco, California 94080, USA.
  • Ahuja G; Department of Computational Biology, Indraprastha Institute of Information Technology, Delhi 110020, India.
  • Ghosh A; Interdisciplinary Statistical Research Unit, Indian Statistical Institute, Kolkata 700108, India.
  • Sengupta D; Department of Computer Science and Engineering, Indraprastha Institute of Information Technology, Delhi 110020, India.
Genome Res ; 31(4): 689-697, 2021 04.
Article en En | MEDLINE | ID: mdl-33674351
ABSTRACT
Systematic delineation of complex biological systems is an ever-challenging and resource-intensive process. Single-cell transcriptomics allows us to study cell-to-cell variability in complex tissues at an unprecedented resolution. Accurate modeling of gene expression plays a critical role in the statistical determination of tissue-specific gene expression patterns. In the past few years, considerable efforts have been made to identify appropriate parametric models for single-cell expression data. The zero-inflated version of Poisson/negative binomial and log-normal distributions have emerged as the most popular alternatives owing to their ability to accommodate high dropout rates, as commonly observed in single-cell data. Although the majority of the parametric approaches directly model expression estimates, we explore the potential of modeling expression ranks, as robust surrogates for transcript abundance. Here we examined the performance of the discrete generalized beta distribution (DGBD) on real data and devised a Wald-type test for comparing gene expression across two phenotypically divergent groups of single cells. We performed a comprehensive assessment of the proposed method to understand its advantages compared with some of the existing best-practice approaches. We concluded that besides striking a reasonable balance between Type I and Type II errors, ROSeq, the proposed differential expression test, is exceptionally robust to expression noise and scales rapidly with increasing sample size. For wider dissemination and adoption of the method, we created an R package called ROSeq and made it available on the Bioconductor platform.
Asunto(s)

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: Perfilación de la Expresión Génica / Análisis de la Célula Individual / RNA-Seq Tipo de estudio: Guideline Idioma: En Revista: Genome Res Asunto de la revista: BIOLOGIA MOLECULAR / GENETICA Año: 2021 Tipo del documento: Article País de afiliación: India

Texto completo: 1 Colección: 01-internacional Banco de datos: MEDLINE Asunto principal: Perfilación de la Expresión Génica / Análisis de la Célula Individual / RNA-Seq Tipo de estudio: Guideline Idioma: En Revista: Genome Res Asunto de la revista: BIOLOGIA MOLECULAR / GENETICA Año: 2021 Tipo del documento: Article País de afiliación: India