Your browser doesn't support javascript.
loading
bayNorm: Bayesian gene expression recovery, imputation and normalization for single-cell RNA-sequencing data.
Tang, Wenhao; Bertaux, François; Thomas, Philipp; Stefanelli, Claire; Saint, Malika; Marguerat, Samuel; Shahrezaei, Vahid.
Afiliação
  • Tang W; Department of Mathematics, Faculty of Natural Sciences, Imperial College, London SW7 2AZ, UK.
  • Bertaux F; Department of Mathematics, Faculty of Natural Sciences, Imperial College, London SW7 2AZ, UK.
  • Thomas P; MRC London Institute of Medical Sciences (LMS), London W12 0NN, UK.
  • Stefanelli C; Faculty of Medicine, Institute of Clinical Sciences (ICS), Imperial College London, London W12 0NN, UK.
  • Saint M; Department of Mathematics, Faculty of Natural Sciences, Imperial College, London SW7 2AZ, UK.
  • Marguerat S; Department of Mathematics, Faculty of Natural Sciences, Imperial College, London SW7 2AZ, UK.
  • Shahrezaei V; MRC London Institute of Medical Sciences (LMS), London W12 0NN, UK.
Bioinformatics ; 36(4): 1174-1181, 2020 02 15.
Article em En | MEDLINE | ID: mdl-31584606
ABSTRACT
MOTIVATION Normalization of single-cell RNA-sequencing (scRNA-seq) data is a prerequisite to their interpretation. The marked technical variability, high amounts of missing observations and batch effect typical of scRNA-seq datasets make this task particularly challenging. There is a need for an efficient and unified approach for normalization, imputation and batch effect correction.

RESULTS:

Here, we introduce bayNorm, a novel Bayesian approach for scaling and inference of scRNA-seq counts. The method's likelihood function follows a binomial model of mRNA capture, while priors are estimated from expression values across cells using an empirical Bayes approach. We first validate our assumptions by showing this model can reproduce different statistics observed in real scRNA-seq data. We demonstrate using publicly available scRNA-seq datasets and simulated expression data that bayNorm allows robust imputation of missing values generating realistic transcript distributions that match single molecule fluorescence in situ hybridization measurements. Moreover, by using priors informed by dataset structures, bayNorm improves accuracy and sensitivity of differential expression analysis and reduces batch effect compared with other existing methods. Altogether, bayNorm provides an efficient, integrated solution for global scaling normalization, imputation and true count recovery of gene expression measurements from scRNA-seq data. AVAILABILITY AND IMPLEMENTATION The R package 'bayNorm' is publishd on bioconductor at https//bioconductor.org/packages/release/bioc/html/bayNorm.html. The code for analyzing data in this article is available at https//github.com/WT215/bayNorm_papercode. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Assuntos

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Software / RNA Tipo de estudo: Prognostic_studies Idioma: En Ano de publicação: 2020 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Software / RNA Tipo de estudo: Prognostic_studies Idioma: En Ano de publicação: 2020 Tipo de documento: Article