PROSE: phenotype-specific network signatures from individual proteomic samples.

Wong, Bertrand Jern Han; Kong, Weijia; Peng, Hui; Goh, Wilson Wen Bin

Wong, Bertrand Jern Han; Kong, Weijia; Peng, Hui; Goh, Wilson Wen Bin.

Affiliation

Wong BJH; School of Biological Sciences, Nanyang Technological University, Singapore.
Kong W; School of Biological Sciences, Nanyang Technological University, Singapore.
Peng H; Lee Kong Chian School of Medicine, Nanyang Technological University, Singapore.
Goh WWB; School of Computer Science, National University of Singapore, Singapore.

Brief Bioinform ; 24(2)2023 03 19.

Article in En | MEDLINE | ID: mdl-36907650

ABSTRACT

Proteomic studies characterize the protein composition of complex biological samples. Despite recent advancements in mass spectrometry instrumentation and computational tools, low proteome coverage and interpretability remains a challenge. To address this, we developed Proteome Support Vector Enrichment (PROSE), a fast, scalable and lightweight pipeline for scoring proteins based on orthogonal gene co-expression network matrices. PROSE utilizes simple protein lists as input, generating a standard enrichment score for all proteins, including undetected ones. In our benchmark with 7 other candidate prioritization techniques, PROSE shows high accuracy in missing protein prediction, with scores correlating strongly to corresponding gene expression data. As a further proof-of-concept, we applied PROSE to a reanalysis of the Cancer Cell Line Encyclopedia proteomics dataset, where it captures key phenotypic features, including gene dependency. We lastly demonstrated its applicability on a breast cancer clinical dataset, showing clustering by annotated molecular subtype and identification of putative drivers of triple-negative breast cancer. PROSE is available as a user-friendly Python module from https://github.com/bwbio/PROSE.

Subject(s)

Proteome; Proteomics; Proteomics/methods; Proteome/analysis

Key words

candidate prioritization; enrichment scoring; integrated analysis; machine learning; network; proteomics; support vector machine (SVM)

Fulltext

XML

PubMed Links

Search on Google

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Proteome / Proteomics Type of study: Prognostic_studies Language: En Journal: Brief Bioinform Journal subject: BIOLOGIA / INFORMATICA MEDICA Year: 2023 Type: Article Affiliation country: Singapore

Fulltext

XML

PubMed Links

Search on Google