Synthetic DNA barcodes identify singlets in scRNA-seq datasets and evaluate doublet algorithms.
Cell Genom
; 4(7): 100592, 2024 Jul 10.
Article
em En
| MEDLINE
| ID: mdl-38925122
ABSTRACT
Single-cell RNA sequencing (scRNA-seq) datasets contain true single cells, or singlets, in addition to cells that coalesce during the protocol, or doublets. Identifying singlets with high fidelity in scRNA-seq is necessary to avoid false negative and false positive discoveries. Although several methodologies have been proposed, they are typically tested on highly heterogeneous datasets and lack a priori knowledge of true singlets. Here, we leveraged datasets with synthetically introduced DNA barcodes for a hitherto unexplored application to extract ground-truth singlets. We demonstrated the feasibility of our framework, "singletCode," to evaluate existing doublet detection methods across a range of contexts. We also leveraged our ground-truth singlets to train a proof-of-concept machine learning classifier, which outperformed other doublet detection algorithms. Our integrative framework can identify ground-truth singlets and enable robust doublet detection in non-barcoded datasets.
Palavras-chave
Texto completo:
1
Base de dados:
MEDLINE
Assunto principal:
Algoritmos
/
Análise de Célula Única
/
Código de Barras de DNA Taxonômico
Limite:
Animals
/
Humans
Idioma:
En
Ano de publicação:
2024
Tipo de documento:
Article