Your browser doesn't support javascript.
loading
scMMT: a multi-use deep learning approach for cell annotation, protein prediction and embedding in single-cell RNA-seq data.
Zhou, Songqi; Li, Yang; Wu, Wenyuan; Li, Li.
Affiliation
  • Zhou S; Chongqing Institute of Green and Intelligent Technology, Chinese Academy of Sciences, Chongqing, China.
  • Li Y; Chongqing School, University of Chinese Academy of Sciences, Chongqing, China.
  • Wu W; Chongqing Institute of Green and Intelligent Technology, Chinese Academy of Sciences, Chongqing, China.
  • Li L; Chongqing School, University of Chinese Academy of Sciences, Chongqing, China.
Brief Bioinform ; 25(2)2024 Jan 22.
Article in En | MEDLINE | ID: mdl-38300515
ABSTRACT
Accurate cell type annotation in single-cell RNA-sequencing data is essential for advancing biological and medical research, particularly in understanding disease progression and tumor microenvironments. However, existing methods are constrained by single feature extraction approaches, lack of adaptability to immune cell types with similar molecular profiles but distinct functions and a failure to account for the impact of cell label noise on model accuracy, all of which compromise the precision of annotation. To address these challenges, we developed a supervised approach called scMMT. We proposed a novel feature extraction technique to uncover more valuable information. Additionally, we constructed a multi-task learning framework based on the GradNorm method to enhance the recognition of challenging immune cells and reduce the impact of label noise by facilitating mutual reinforcement between cell type annotation and protein prediction tasks. Furthermore, we introduced logarithmic weighting and label smoothing mechanisms to enhance the recognition ability of rare cell types and prevent model overconfidence. Through comprehensive evaluations on multiple public datasets, scMMT has demonstrated state-of-the-art performance in various aspects including cell type annotation, rare cell identification, dropout and label noise resistance, protein expression prediction and low-dimensional embedding representation.
Subject(s)
Key words

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Biomedical Research / Deep Learning Type of study: Prognostic_studies / Risk_factors_studies Limits: Humans Language: En Journal: Brief Bioinform Journal subject: BIOLOGIA / INFORMATICA MEDICA Year: 2024 Type: Article Affiliation country: China

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Biomedical Research / Deep Learning Type of study: Prognostic_studies / Risk_factors_studies Limits: Humans Language: En Journal: Brief Bioinform Journal subject: BIOLOGIA / INFORMATICA MEDICA Year: 2024 Type: Article Affiliation country: China