Search | Brasil - Virtual Health Library

Directly selecting cell-type marker genes for single-cell clustering analyses.

Chen, Zihao; Wang, Changhu; Huang, Siyuan; Shi, Yang; Xi, Ruibin.

Cell Rep Methods ; 4(7): 100810, 2024 Jul 15.

Article in English | MEDLINE | ID: mdl-38981475

ABSTRACT

In single-cell RNA sequencing (scRNA-seq) studies, cell types and their marker genes are often identified by clustering and differentially expressed gene (DEG) analysis. A common practice is to select genes using surrogate criteria such as variance and deviance, then cluster them using selected genes and detect markers by DEG analysis assuming known cell types. The surrogate criteria can miss important genes or select unimportant genes, while DEG analysis has the selection-bias problem. We present Festem, a statistical method for the direct selection of cell-type markers for downstream clustering. Festem distinguishes marker genes with heterogeneous distribution across cells that are cluster informative. Simulation and scRNA-seq applications demonstrate that Festem can sensitively select markers with high precision and enables the identification of cell types often missed by other methods. In a large intrahepatic cholangiocarcinoma dataset, we identify diverse CD8+ T cell types and potential prognostic marker genes.

Subject(s)

Single-Cell Analysis , Single-Cell Analysis/methods , Humans , Cluster Analysis , Gene Expression Profiling/methods , Sequence Analysis, RNA/methods , Biomarkers, Tumor/genetics , Biomarkers, Tumor/metabolism , CD8-Positive T-Lymphocytes/metabolism , Cholangiocarcinoma/genetics , Cholangiocarcinoma/pathology , Genetic Markers/genetics

Evaluation of tools for highly variable gene discovery from single-cell RNA-seq data.

Yip, Shun H; Sham, Pak Chung; Wang, Junwen.

Brief Bioinform ; 20(4): 1583-1589, 2019 07 19.

Article in English | MEDLINE | ID: mdl-29481632

ABSTRACT

Traditional RNA sequencing (RNA-seq) allows the detection of gene expression variations between two or more cell populations through differentially expressed gene (DEG) analysis. However, genes that contribute to cell-to-cell differences are not discoverable with RNA-seq because RNA-seq samples are obtained from a mixture of cells. Single-cell RNA-seq (scRNA-seq) allows the detection of gene expression in each cell. With scRNA-seq, highly variable gene (HVG) discovery allows the detection of genes that contribute strongly to cell-to-cell variation within a homogeneous cell population, such as a population of embryonic stem cells. This analysis is implemented in many software packages. In this study, we compare seven HVG methods from six software packages, including BASiCS, Brennecke, scLVM, scran, scVEGs and Seurat. Our results demonstrate that reproducibility in HVG analysis requires a larger sample size than DEG analysis. Discrepancies between methods and potential issues in these tools are discussed and recommendations are made.

Subject(s)

RNA-Seq/statistics & numerical data , Software , Animals , Cluster Analysis , Computational Biology , Computer Simulation , Databases, Nucleic Acid/statistics & numerical data , Gene Expression Profiling/statistics & numerical data , Genetic Variation , Humans , Reproducibility of Results , Single-Cell Analysis/statistics & numerical data

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL