scMMT: a multi-use deep learning approach for cell annotation, protein prediction and embedding in single-cell RNA-seq data.
Brief Bioinform
; 25(2)2024 Jan 22.
Article
in En
| MEDLINE
| ID: mdl-38300515
ABSTRACT
Accurate cell type annotation in single-cell RNA-sequencing data is essential for advancing biological and medical research, particularly in understanding disease progression and tumor microenvironments. However, existing methods are constrained by single feature extraction approaches, lack of adaptability to immune cell types with similar molecular profiles but distinct functions and a failure to account for the impact of cell label noise on model accuracy, all of which compromise the precision of annotation. To address these challenges, we developed a supervised approach called scMMT. We proposed a novel feature extraction technique to uncover more valuable information. Additionally, we constructed a multi-task learning framework based on the GradNorm method to enhance the recognition of challenging immune cells and reduce the impact of label noise by facilitating mutual reinforcement between cell type annotation and protein prediction tasks. Furthermore, we introduced logarithmic weighting and label smoothing mechanisms to enhance the recognition ability of rare cell types and prevent model overconfidence. Through comprehensive evaluations on multiple public datasets, scMMT has demonstrated state-of-the-art performance in various aspects including cell type annotation, rare cell identification, dropout and label noise resistance, protein expression prediction and low-dimensional embedding representation.
Key words
Full text:
1
Collection:
01-internacional
Database:
MEDLINE
Main subject:
Biomedical Research
/
Deep Learning
Type of study:
Prognostic_studies
/
Risk_factors_studies
Limits:
Humans
Language:
En
Journal:
Brief Bioinform
Journal subject:
BIOLOGIA
/
INFORMATICA MEDICA
Year:
2024
Type:
Article
Affiliation country:
China