scDOT: enhancing single-cell RNA-Seq data annotation and uncovering novel cell types through multi-reference integration.
Brief Bioinform
; 25(2)2024 Jan 22.
Article
em En
| MEDLINE
| ID: mdl-38436563
ABSTRACT
The proliferation of single-cell RNA-seq data has greatly enhanced our ability to comprehend the intricate nature of diverse tissues. However, accurately annotating cell types in such data, especially when handling multiple reference datasets and identifying novel cell types, remains a significant challenge. To address these issues, we introduce Single Cell annotation based on Distance metric learning and Optimal Transport (scDOT), an innovative cell-type annotation method adept at integrating multiple reference datasets and uncovering previously unseen cell types. scDOT introduces two key innovations. First, by incorporating distance metric learning and optimal transport, it presents a novel optimization framework. This framework effectively learns the predictive power of each reference dataset for new query data and simultaneously establishes a probabilistic mapping between cells in the query data and reference-defined cell types. Secondly, scDOT develops an interpretable scoring system based on the acquired probabilistic mapping, enabling the precise identification of previously unseen cell types within the data. To rigorously assess scDOT's capabilities, we systematically evaluate its performance using two diverse collections of benchmark datasets encompassing various tissues, sequencing technologies and diverse cell types. Our experimental results consistently affirm the superior performance of scDOT in cell-type annotation and the identification of previously unseen cell types. These advancements provide researchers with a potent tool for precise cell-type annotation, ultimately enriching our understanding of complex biological tissues.
Palavras-chave
Texto completo:
1
Eixos temáticos:
Capacitacao_em_gestao_de_ciencia
/
Inovacao_tecnologica
Base de dados:
MEDLINE
Assunto principal:
Curadoria de Dados
/
Análise da Expressão Gênica de Célula Única
Limite:
Humans
Idioma:
En
Ano de publicação:
2024
Tipo de documento:
Article