Your browser doesn't support javascript.
loading
Machine Learning on Large-Scale Proteomics Data Identifies Tissue and Cell-Type Specific Proteins.
Claeys, Tine; Menu, Maxime; Bouwmeester, Robbin; Gevaert, Kris; Martens, Lennart.
Afiliação
  • Claeys T; VIB-UGent Center for Medical Biotechnology, VIB, 9052 Ghent, Belgium.
  • Menu M; Department of Biomolecular Medicine, Ghent University, 9052 Ghent, Belgium.
  • Bouwmeester R; VIB-UGent Center for Medical Biotechnology, VIB, 9052 Ghent, Belgium.
  • Gevaert K; Department of Biomolecular Medicine, Ghent University, 9052 Ghent, Belgium.
  • Martens L; VIB-UGent Center for Medical Biotechnology, VIB, 9052 Ghent, Belgium.
J Proteome Res ; 22(4): 1181-1192, 2023 04 07.
Article em En | MEDLINE | ID: mdl-36963412
Using data from 183 public human data sets from PRIDE, a machine learning model was trained to identify tissue and cell-type specific protein patterns. PRIDE projects were searched with ionbot and tissue/cell type annotation was manually added. Data from physiological samples were used to train a Random Forest model on protein abundances to classify samples into tissues and cell types. Subsequently, a one-vs-all classification and feature importance were used to analyze the most discriminating protein abundances per class. Based on protein abundance alone, the model was able to predict tissues with 98% accuracy, and cell types with 99% accuracy. The F-scores describe a clear view on tissue-specific proteins and tissue-specific protein expression patterns. In-depth feature analysis shows slight confusion between physiologically similar tissues, demonstrating the capacity of the algorithm to detect biologically relevant patterns. These results can in turn inform downstream uses, from identification of the tissue of origin of proteins in complex samples such as liquid biopsies, to studying the proteome of tissue-like samples such as organoids and cell lines.
Assuntos
Palavras-chave

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Proteoma / Proteômica Idioma: En Ano de publicação: 2023 Tipo de documento: Article País de afiliação: Bélgica

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Proteoma / Proteômica Idioma: En Ano de publicação: 2023 Tipo de documento: Article País de afiliação: Bélgica