RESUMO
In the era of open-modification search engines, more posttranslational modifications than ever can be detected by LC-MS/MS-based proteomics. This development can switch proteomics research into a higher gear, as PTMs are key in many cellular pathways important in cell proliferation, migration, metastasis, and aging. However, despite these advances in modification identification, statistical methods for PTM-level quantification and differential analysis have yet to catch up. This absence can partly be explained by statistical challenges inherent to the data, such as the confounding of PTM intensities with its parent protein abundance. Therefore, we have developed msqrob2PTM, a new workflow in the msqrob2 universe capable of differential abundance analysis at the PTM and at the peptidoform level. The latter is important for validating PTMs found as significantly differential. Indeed, as our method can deal with multiple PTMs per peptidoform, there is a possibility that significant PTMs stem from one significant peptidoform carrying another PTM, hinting that it might be the other PTM driving the perceived differential abundance. Our workflows can flag both differential peptidoform abundance (DPA) and differential peptidoform usage (DPU). This enables a distinction between direct assessment of differential abundance of peptidoforms (DPA) and differences in the relative usage of peptidoforms corrected for corresponding protein abundances (DPU). For DPA, we directly model the log2-transformed peptidoform intensities, while for DPU, we correct for parent protein abundance by an intermediate normalization step which calculates the log2-ratio of the peptidoform intensities to their summarized parent protein intensities. We demonstrated the utility and performance of msqrob2PTM by applying it to datasets with known ground truth, as well as to biological PTM-rich datasets. Our results show that msqrob2PTM is on par with, or surpassing the performance of, the current state-of-the-art methods. Moreover, msqrob2PTM is currently unique in providing output at the peptidoform level.
Assuntos
Proteômica , Espectrometria de Massas em Tandem , Proteômica/métodos , Cromatografia Líquida , Processamento de Proteína Pós-Traducional , ProteínasRESUMO
Single-cell proteomics can offer valuable insights into dynamic cellular interactions, but identifying proteins at this level is challenging due to their low abundance. In this chapter, we present a state-of-the-art bioinformatics pipeline for single-cell proteomics that combines the search engine Sage (via SearchGUI), identification rescoring with MS2Rescore, quantification through FlashLFQ, and differential expression analysis using MSqRob2. MS2Rescore leverages LC-MS/MS behavior predictors, such as MS2PIP and DeepLC, to recalibrate scores with Percolator or mokapot. Combining these tools into a unified pipeline, this approach improves the detection of low-abundance peptides, resulting in increased identifications while maintaining stringent FDR thresholds.
Assuntos
Biologia Computacional , Proteômica , Análise de Célula Única , Software , Espectrometria de Massas em Tandem , Análise de Célula Única/métodos , Biologia Computacional/métodos , Proteômica/métodos , Espectrometria de Massas em Tandem/métodos , Humanos , Cromatografia Líquida/métodos , Ferramenta de Busca , Proteoma/análiseRESUMO
The holistic nature of omics studies makes them ideally suited to generate hypotheses on health and disease. Sequencing-based genomics and mass spectrometry (MS)-based proteomics are linked through epigenetic regulation mechanisms. However, epigenomics is currently mainly focused on DNA methylation status using sequencing technologies, while studying histone posttranslational modifications (hPTMs) using MS is lagging, partly because reuse of raw data is impractical. Yet, targeting hPTMs using epidrugs is an established promising research avenue in cancer treatment. Therefore, we here present the most comprehensive MS-based preprocessed hPTM atlas to date, including 21 T-cell acute lymphoblastic leukemia (T-ALL) cell lines. We present the data in an intuitive and browsable single licensed Progenesis QIP project and provide all essential quality metrics, allowing users to assess the quality of the data, edit individual peptides, try novel annotation algorithms and export both peptide and protein data for downstream analyses, exemplified by the PeptidoformViz tool. This data resource sets the stage for generalizing MS-based histone analysis and provides the first reusable histone dataset for epidrug development.