Your browser doesn't support javascript.
loading
RgCop-A regularized copula based method for gene selection in single-cell RNA-seq data.
Lall, Snehalika; Ray, Sumanta; Bandyopadhyay, Sanghamitra.
Afiliação
  • Lall S; Machine Intelligence Unit, Indian Statistical Institute, Kolkata, India.
  • Ray S; Department of Computer Science and Engineering, Aliah University, Kolkata, India.
  • Bandyopadhyay S; Machine Intelligence Unit, Indian Statistical Institute, Kolkata, India.
PLoS Comput Biol ; 17(10): e1009464, 2021 10.
Article em En | MEDLINE | ID: mdl-34665808
ABSTRACT
Gene selection in unannotated large single cell RNA sequencing (scRNA-seq) data is important and crucial step in the preliminary step of downstream analysis. The existing approaches are primarily based on high variation (highly variable genes) or significant high expression (highly expressed genes) failed to provide stable and predictive feature set due to technical noise present in the data. Here, we propose RgCop, a novel regularized copula based method for gene selection from large single cell RNA-seq data. RgCop utilizes copula correlation (Ccor), a robust equitable dependence measure that captures multivariate dependency among a set of genes in single cell expression data. We formulate an objective function by adding l1 regularization term with Ccor to penalizes the redundant co-efficient of features/genes, resulting non-redundant effective features/genes set. Results show a significant improvement in the clustering/classification performance of real life scRNA-seq data over the other state-of-the-art. RgCop performs extremely well in capturing dependence among the features of noisy data due to the scale invariant property of copula, thereby improving the stability of the method. Moreover, the differentially expressed (DE) genes identified from the clusters of scRNA-seq data are found to provide an accurate annotation of cells. Finally, the features/genes obtained from RgCop is able to annotate the unknown cells with high accuracy.
Assuntos

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Biologia Computacional / Análise de Célula Única / RNA-Seq Tipo de estudo: Prognostic_studies Limite: Humans Idioma: En Ano de publicação: 2021 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Biologia Computacional / Análise de Célula Única / RNA-Seq Tipo de estudo: Prognostic_studies Limite: Humans Idioma: En Ano de publicação: 2021 Tipo de documento: Article