Your browser doesn't support javascript.
loading
Assessing parameter efficient methods for pre-trained language model in annotating scRNA-seq data.
Xia, Yucheng; Liu, Yuhang; Li, Tianhao; He, Sihan; Chang, Hong; Wang, Yaqing; Zhang, Yongqing; Ge, Wenyi.
Affiliation
  • Xia Y; Institute of Optics and Electronics, Chinese Academy of Sciences, Chengdu, 610209, China.
  • Liu Y; School of Computer Science, Chengdu University of Information Technology, Chengdu, 610225, China.
  • Li T; School of Computer Science, Chengdu University of Information Technology, Chengdu, 610225, China.
  • He S; School of Computer Science, Chengdu University of Information Technology, Chengdu, 610225, China.
  • Chang H; School of Computer Science, Chengdu University of Information Technology, Chengdu, 610225, China.
  • Wang Y; School of Computer Science, Chengdu University of Information Technology, Chengdu, 610225, China.
  • Zhang Y; School of Computer Science, Chengdu University of Information Technology, Chengdu, 610225, China.
  • Ge W; School of Computer Science, Chengdu University of Information Technology, Chengdu, 610225, China. Electronic address: gewenyi15@cuit.edu.cn.
Methods ; 228: 12-21, 2024 Aug.
Article in En | MEDLINE | ID: mdl-38759908
ABSTRACT
Annotating cell types of single-cell RNA sequencing (scRNA-seq) data is crucial for studying cellular heterogeneity in the tumor microenvironment. Recently, large-scale pre-trained language models (PLMs) have achieved significant progress in cell-type annotation of scRNA-seq data. This approach effectively addresses previous methods' shortcomings in performance and generalization. However, fine-tuning PLMs for different downstream tasks demands considerable computational resources, rendering it impractical. Hence, a new research branch introduces parameter-efficient fine-tuning (PEFT). This involves optimizing a few parameters while leaving the majority unchanged, leading to substantial reductions in computational expenses. Here, we utilize scBERT, a large-scale pre-trained model, to explore the capabilities of three PEFT methods in scRNA-seq cell type annotation. Extensive benchmark studies across several datasets demonstrate the superior applicability of PEFT methods. Furthermore, downstream analysis using models obtained through PEFT showcases their utility in novel cell type discovery and model interpretability for potential marker genes. Our findings underscore the considerable potential of PEFT in PLM-based cell type annotation, presenting novel perspectives for the analysis of scRNA-seq data.
Subject(s)
Key words

Full text: 1 Database: MEDLINE Main subject: Single-Cell Analysis / RNA-Seq Limits: Humans Language: En Journal: Methods Journal subject: BIOQUIMICA Year: 2024 Type: Article Affiliation country: China

Full text: 1 Database: MEDLINE Main subject: Single-Cell Analysis / RNA-Seq Limits: Humans Language: En Journal: Methods Journal subject: BIOQUIMICA Year: 2024 Type: Article Affiliation country: China