Your browser doesn't support javascript.
loading
Test-Time Training for Deep MS/MS Spectrum Prediction Improves Peptide Identification.
Ye, Jianbai; He, Xiangnan; Wang, Shujuan; Dong, Meng-Qiu; Wu, Feng; Lu, Shan; Feng, Fuli.
  • Ye J; MoE Key Laboratory of Brain-inspired Intelligent Perception and Cognition, University of Science and Technology of China, Hefei, Anhui 230026, China.
  • He X; MoE Key Laboratory of Brain-inspired Intelligent Perception and Cognition, University of Science and Technology of China, Hefei, Anhui 230026, China.
  • Wang S; National Institute of Biological Sciences, Beijing 102206, China.
  • Dong MQ; National Institute of Biological Sciences, Beijing 102206, China.
  • Wu F; MoE Key Laboratory of Brain-inspired Intelligent Perception and Cognition, University of Science and Technology of China, Hefei, Anhui 230026, China.
  • Lu S; Department of Cellular and Molecular Medicine, University of California, San Diego, La Jolla, California 92093, United States.
  • Feng F; MoE Key Laboratory of Brain-inspired Intelligent Perception and Cognition, University of Science and Technology of China, Hefei, Anhui 230026, China.
J Proteome Res ; 23(2): 550-559, 2024 02 02.
Article en En | MEDLINE | ID: mdl-38153036
ABSTRACT
In bottom-up proteomics, peptide-spectrum matching is critical for peptide and protein identification. Recently, deep learning models have been used to predict tandem mass spectra of peptides, enabling the calculation of similarity scores between the predicted and experimental spectra for peptide-spectrum matching. These models follow the supervised learning paradigm, which trains a general model using paired peptides and spectra from standard data sets and directly employs the model on experimental data. However, this approach can lead to inaccurate predictions due to differences between the training data and the experimental data, such as sample types, enzyme specificity, and instrument calibration. To tackle this problem, we developed a test-time training paradigm that adapts the pretrained model to generate experimental data-specific models, namely, PepT3. PepT3 yields a 10-40% increase in peptide identification depending on the variability in training and experimental data. Intriguingly, when applied to a patient-derived immunopeptidomic sample, PepT3 increases the identification of tumor-specific immunopeptide candidates by 60%. Two-thirds of the newly identified candidates are predicted to bind to the patient's human leukocyte antigen isoforms. To facilitate access of the model and all the results, we have archived all the intermediate files in Zenodo.org with identifier 8231084.
Asunto(s)
Palabras clave

Texto completo: 1 Banco de datos: MEDLINE Asunto principal: Péptidos / Espectrometría de Masas en Tándem Límite: Humans Idioma: En Año: 2024 Tipo del documento: Article

Texto completo: 1 Banco de datos: MEDLINE Asunto principal: Péptidos / Espectrometría de Masas en Tándem Límite: Humans Idioma: En Año: 2024 Tipo del documento: Article