Search | VHL CLAP/WR-PAHO/WHO

1.

The Reactome Pathway Knowledgebase 2024.

Milacic, Marija; Beavers, Deidre; Conley, Patrick; Gong, Chuqiao; Gillespie, Marc; Griss, Johannes; Haw, Robin; Jassal, Bijay; Matthews, Lisa; May, Bruce; Petryszak, Robert; Ragueneau, Eliot; Rothfels, Karen; Sevilla, Cristoffer; Shamovsky, Veronica; Stephan, Ralf; Tiwari, Krishna; Varusai, Thawfeek; Weiser, Joel; Wright, Adam; Wu, Guanming; Stein, Lincoln; Hermjakob, Henning; D'Eustachio, Peter.

Nucleic Acids Res ; 52(D1): D672-D678, 2024 Jan 05.

Article in English | MEDLINE | ID: mdl-37941124

ABSTRACT

The Reactome Knowledgebase (https://reactome.org), an Elixir and GCBR core biological data resource, provides manually curated molecular details of a broad range of normal and disease-related biological processes. Processes are annotated as an ordered network of molecular transformations in a single consistent data model. Reactome thus functions both as a digital archive of manually curated human biological processes and as a tool for discovering functional relationships in data such as gene expression profiles or somatic mutation catalogs from tumor cells. Here we review progress towards annotation of the entire human proteome, targeted annotation of disease-causing genetic variants of proteins and of small-molecule drugs in a pathway context, and towards supporting explicit annotation of cell- and tissue-specific pathways. Finally, we briefly discuss issues involved in making Reactome more fully interoperable with other related resources such as the Gene Ontology and maintaining the resulting community resource network.

Subject(s)

Knowledge Bases , Metabolic Networks and Pathways , Signal Transduction , Humans , Metabolic Networks and Pathways/genetics , Proteome/genetics

2.

ReactomeGSA: new features to simplify public data reuse.

Grentner, Alexander; Ragueneau, Eliot; Gong, Chuqiao; Prinz, Adrian; Gansberger, Sabina; Oyarzun, Inigo; Hermjakob, Henning; Griss, Johannes.

Bioinformatics ; 40(6)2024 Jun 03.

Article in English | MEDLINE | ID: mdl-38806182

ABSTRACT

MOTIVATION: ReactomeGSA is part of the Reactome knowledgebase and one of the leading multi-omics pathway analysis platforms. ReactomeGSA provides access to quantitative pathway analysis methods supporting different 'omics data types. Additionally, ReactomeGSA can process different datasets simultaneously, leading to a comparative pathway analysis that can also be performed across different species. RESULTS: We present a major update to the ReactomeGSA analysis platforms that greatly simplifies the reuse and direct integration of public data. In order to increase the number of available datasets, we developed the new grein_loader Python application that can directly fetch experiments from the GREIN resource. This enabled us to support both EMBL-EBI's Expression Atlas and GEO RNA-seq Experiments Interactive Navigator within ReactomeGSA. To further increase the visibility and simplify the reuse of public datasets, we integrated a novel search function into ReactomeGSA that enables users to search for public datasets across both supported resources. Finally, we completely re-developed ReactomeGSA's web-frontend and R/Bioconductor package to support the new search and loading features, and greatly simplify the use of ReactomeGSA. AVAILABILITY AND IMPLEMENTATION: The new ReactomeGSA web frontend is available at https://www.reactome.org/gsa with an built-in, interactive tutorial. The ReactomeGSA R package (https://bioconductor.org/packages/release/bioc/html/ReactomeGSA.html) is available through Bioconductor and shipped with detailed documentation and vignettes. The grein_loader Python application is available through the Python Package Index (pypi). The complete source code for all applications is available on GitHub at https://github.com/grisslab/grein_loader and https://github.com/reactome.

Subject(s)

Software , Humans , Computational Biology/methods , Knowledge Bases

3.

The reactome pathway knowledgebase 2022.

Gillespie, Marc; Jassal, Bijay; Stephan, Ralf; Milacic, Marija; Rothfels, Karen; Senff-Ribeiro, Andrea; Griss, Johannes; Sevilla, Cristoffer; Matthews, Lisa; Gong, Chuqiao; Deng, Chuan; Varusai, Thawfeek; Ragueneau, Eliot; Haider, Yusra; May, Bruce; Shamovsky, Veronica; Weiser, Joel; Brunson, Timothy; Sanati, Nasim; Beckman, Liam; Shao, Xiang; Fabregat, Antonio; Sidiropoulos, Konstantinos; Murillo, Julieth; Viteri, Guilherme; Cook, Justin; Shorser, Solomon; Bader, Gary; Demir, Emek; Sander, Chris; Haw, Robin; Wu, Guanming; Stein, Lincoln; Hermjakob, Henning; D'Eustachio, Peter.

Nucleic Acids Res ; 50(D1): D687-D692, 2022 01 07.

Article in English | MEDLINE | ID: mdl-34788843

ABSTRACT

The Reactome Knowledgebase (https://reactome.org), an Elixir core resource, provides manually curated molecular details across a broad range of physiological and pathological biological processes in humans, including both hereditary and acquired disease processes. The processes are annotated as an ordered network of molecular transformations in a single consistent data model. Reactome thus functions both as a digital archive of manually curated human biological processes and as a tool for discovering functional relationships in data such as gene expression profiles or somatic mutation catalogs from tumor cells. Recent curation work has expanded our annotations of normal and disease-associated signaling processes and of the drugs that target them, in particular infections caused by the SARS-CoV-1 and SARS-CoV-2 coronaviruses and the host response to infection. New tools support better simultaneous analysis of high-throughput data from multiple sources and the placement of understudied ('dark') proteins from analyzed datasets in the context of Reactome's manually curated pathways.

Subject(s)

Antiviral Agents/pharmacology , Knowledge Bases , Proteins/metabolism , COVID-19/metabolism , Data Curation , Genome, Human , Host-Pathogen Interactions , Humans , Proteins/genetics , Signal Transduction , Software

4.

Single-cell RNA sequencing defines disease-specific differences between chronic nodular prurigo and atopic dermatitis.

Alkon, Natalia; Assen, Frank P; Arnoldner, Tamara; Bauer, Wolfgang M; Medjimorec, Marco A; Shaw, Lisa E; Rindler, Katharina; Holzer, Gregor; Weber, Philipp; Weninger, Wolfgang; Freystätter, Christian; Chennareddy, Sumanth; Kinaciyan, Tamar; Farlik, Matthias; Jonak, Constanze; Griss, Johannes; Bangert, Christine; Brunner, Patrick M.

J Allergy Clin Immunol ; 152(2): 420-435, 2023 08.

Article in English | MEDLINE | ID: mdl-37210042

ABSTRACT

BACKGROUND: Chronic nodular prurigo (CNPG) is an inflammatory skin disease that is maintained by a chronic itch-scratch cycle likely rooted in neuroimmunological dysregulation. This condition may be associated with atopy in some patients, and there are now promising therapeutic results from blocking type 2 cytokines such as IL-4, IL-13, and IL-31. OBJECTIVES: This study aimed to improve the understanding of pathomechanisms underlying CNPG as well as molecular relationships between CNPG and atopic dermatitis (AD). METHODS: We profiled skin lesions from patients with CNPG in comparison with AD and healthy control individuals using single-cell RNA sequencing combined with T-cell receptor sequencing. RESULTS: We found type 2 immune skewing in both CNPG and AD, as evidenced by CD4+ helper T cells expressing IL13. However, only AD harbored an additional, oligoclonally expanded CD8A+IL9R+IL13+ cytotoxic T-cell population, and immune activation pathways were highly upregulated in AD, but less so in CNPG. Conversely, CNPG showed signatures of extracellular matrix organization, collagen synthesis, and fibrosis, including a unique population of CXCL14-IL24+ secretory papillary fibroblasts. Besides known itch mediators such as IL31 and oncostatin M, we also detected increased levels of neuromedin B in fibroblasts of CNPG lesions compared with AD and HC, with neuromedin B receptors detectable on some nerve endings. CONCLUSIONS: These data show that CNPG does not harbor the strong disease-specific immune activation pathways that are typically found in AD but is rather characterized by upregulated stromal remodeling mechanisms that might have a direct impact on itch fibers.

Subject(s)

Dermatitis, Atopic , Prurigo , Humans , Prurigo/genetics , Interleukin-13 , Pruritus , Sequence Analysis, RNA

5.

Evaluation of mortality, prognostic parameters, and treatment efficacy in mycosis fungoides.

Porkert, Stefanie; Griss, Johannes; Hudelist-Venz, Mercedes; Steiner, Irene; Valencak, Julia; Weninger, Wolfgang; Brunner, Patrick M; Jonak, Constanze.

J Dtsch Dermatol Ges ; 22(4): 532-550, 2024 Apr.

Article in English | MEDLINE | ID: mdl-38444271

ABSTRACT

BACKGROUND AND OBJECTIVES: Mycosis fungoides (MF), the most common primary cutaneous T-cell lymphoma, is characterized by a variable clinical course, presenting either as indolent disease or showing fatal progression due to extracutaneous involvement. Importantly, the lack of prognostic models and predominantly palliative therapy settings hamper patient care. Here, we aimed to define survival rates, disease prediction accuracy, and treatment impact in MF. PATIENTS AND METHODS: Hundred-forty MF patients were assessed retrospectively. Prognosis and disease progression/survival were analyzed using univariate Cox proportional hazards regression model and Kaplan-Meier estimates. RESULTS: Skin tumors were linked to shorter progression-free, overall survival and a 3.48 increased risk for disease progression when compared to erythroderma. The Cutaneous Lymphoma International Prognostic Index identified patients at risk in early-stage disease only. Moreover, expression of Ki-67 >20%, CD30 >10%, CD20+, and CD7- were associated with a significantly worse outcome independent of disease stage. Only single-agent interferon-α and phototherapy combined with interferon-α or retinoids/bexarotene achieved long-term disease control in MF. CONCLUSIONS: Our data support predictive validity of prognostic factors and models in MF and identified further potential parameters associated with poor survival. Prospective studies on prognostic indices across disease stages and treatment modalities are needed to predict and improve survival.

Subject(s)

Mycosis Fungoides , Skin Neoplasms , Humans , Prognosis , Retrospective Studies , Prospective Studies , Mycosis Fungoides/diagnosis , Mycosis Fungoides/therapy , Treatment Outcome , Interferon-alpha , Disease Progression , Neoplasm Staging

6.

Single-cell analysis reveals innate lymphoid cell lineage infidelity in atopic dermatitis.

Alkon, Natalia; Bauer, Wolfgang M; Krausgruber, Thomas; Goh, Issac; Griss, Johannes; Nguyen, Vy; Reininger, Baerbel; Bangert, Christine; Staud, Clement; Brunner, Patrick M; Bock, Christoph; Haniffa, Muzlifah; Stingl, Georg.

J Allergy Clin Immunol ; 149(2): 624-639, 2022 02.

Article in English | MEDLINE | ID: mdl-34363841

ABSTRACT

BACKGROUND: Although ample knowledge exists about phenotype and function of cutaneous T lymphocytes, much less is known about the lymphocytic components of the skin's innate immune system. OBJECTIVE: To better understand the biologic role of cutaneous innate lymphoid cells (ILCs), we investigated their phenotypic and molecular features under physiologic (normal human skin [NHS]) and pathologic (lesional skin of patients with atopic dermatitis [AD]) conditions. METHODS: Skin punch biopsies and reduction sheets as well as blood specimens were obtained from either patients with AD or healthy individuals. Cell and/or tissue samples were analyzed by flow cytometry, immunohistochemistry, and single-cell RNA sequencing or subjected to in vitro/ex vivo culture. RESULTS: Notwithstanding substantial quantitative differences between NHS and AD skin, we found that the vast majority of cutaneous ILCs belong to the CRTH2+ subset and reside in the upper skin layers. Single-cell RNA sequencing of cutaneous ILC-enriched cell samples confirmed the predominance of biologically heterogeneous group 2 ILCs and, for the first time, demonstrated considerable ILC lineage infidelity (coexpression of genes typical of either type 2 [GATA3 and IL13] or type 3/17 [RORC, IL22, and IL26] immunity within individual cells) in lesional AD skin, and to a much lesser extent, in NHS. Similar events were demonstrated in ILCs from skin explant cultures and in vitro expanded ILCs from the peripheral blood. CONCLUSION: These findings support the concept that instead of being a stable entity with well-defined components, the skin immune system consists of a network of highly flexible cellular players that are capable of adjusting their function to the needs and challenges of the environment.

Subject(s)

Cell Lineage , Lymphocytes/immunology , Single-Cell Analysis/methods , Dermatitis, Atopic/immunology , Flow Cytometry , Humans , Immunity, Innate , Killer Cells, Natural/immunology , RNA-Seq , Skin/immunology

7.

scAnnotatR: framework to accurately classify cell types in single-cell RNA-sequencing data.

Nguyen, Vy; Griss, Johannes.

BMC Bioinformatics ; 23(1): 44, 2022 Jan 17.

Article in English | MEDLINE | ID: mdl-35038984

ABSTRACT

BACKGROUND: Automatic cell type identification is essential to alleviate a key bottleneck in scRNA-seq data analysis. While most existing classification tools show good sensitivity and specificity, they often fail to adequately not-classify cells that are missing in the used reference. Additionally, many tools do not scale to the continuously increasing size of current scRNA-seq datasets. Therefore, additional tools are needed to solve these challenges. RESULTS: scAnnotatR is a novel R package that provides a complete framework to classify cells in scRNA-seq datasets using pre-trained classifiers. It supports both Seurat and Bioconductor's SingleCellExperiment and is thereby compatible with the vast majority of R-based analysis workflows. scAnnotatR uses hierarchically organised SVMs to distinguish a specific cell type versus all others. It shows comparable or even superior accuracy, sensitivity and specificity compared to existing tools while being able to not-classify unknown cell types. Moreover, scAnnotatR is the only of the best performing tools able to process datasets containing more than 600,000 cells. CONCLUSIONS: scAnnotatR is freely available on GitHub ( https://github.com/grisslab/scAnnotatR ) and through Bioconductor (from version 3.14). It is consistently among the best performing tools in terms of classification accuracy while scaling to the largest datasets.

Subject(s)

RNA , Single-Cell Analysis , RNA/genetics , Sequence Analysis, RNA , Exome Sequencing

8.

A Comprehensive Evaluation of Consensus Spectrum Generation Methods in Proteomics.

Luo, Xiyang; Bittremieux, Wout; Griss, Johannes; Deutsch, Eric W; Sachsenberg, Timo; Levitsky, Lev I; Ivanov, Mark V; Bubis, Julia A; Gabriels, Ralf; Webel, Henry; Sanchez, Aniel; Bai, Mingze; Käll, Lukas; Perez-Riverol, Yasset.

J Proteome Res ; 21(6): 1566-1574, 2022 06 03.

Article in English | MEDLINE | ID: mdl-35549218

ABSTRACT

Spectrum clustering is a powerful strategy to minimize redundant mass spectra by grouping them based on similarity, with the aim of forming groups of mass spectra from the same repeatedly measured analytes. Each such group of near-identical spectra can be represented by its so-called consensus spectrum for downstream processing. Although several algorithms for spectrum clustering have been adequately benchmarked and tested, the influence of the consensus spectrum generation step is rarely evaluated. Here, we present an implementation and benchmark of common consensus spectrum algorithms, including spectrum averaging, spectrum binning, the most similar spectrum, and the best-identified spectrum. We have analyzed diverse public data sets using two different clustering algorithms (spectra-cluster and MaRaCluster) to evaluate how the consensus spectrum generation procedure influences downstream peptide identification. The BEST and BIN methods were found the most reliable methods for consensus spectrum generation, including for data sets with post-translational modifications (PTM) such as phosphorylation. All source code and data of the present study are freely available on GitHub at https://github.com/statisticalbiotechnology/representative-spectra-benchmark.

Subject(s)

Proteomics , Tandem Mass Spectrometry , Algorithms , Cluster Analysis , Consensus , Databases, Protein , Proteomics/methods , Software , Tandem Mass Spectrometry/methods

9.

Separating Golgi Proteins from Cis to Trans Reveals Underlying Properties of Cisternal Localization.

Parsons, Harriet T; Stevens, Tim J; McFarlane, Heather E; Vidal-Melgosa, Silvia; Griss, Johannes; Lawrence, Nicola; Butler, Richard; Sousa, Mirta M L; Salemi, Michelle; Willats, William G T; Petzold, Christopher J; Heazlewood, Joshua L; Lilley, Kathryn S.

Plant Cell ; 31(9): 2010-2034, 2019 09.

Article in English | MEDLINE | ID: mdl-31266899

ABSTRACT

The order of enzymatic activity across Golgi cisternae is essential for complex molecule biosynthesis. However, an inability to separate Golgi cisternae has meant that the cisternal distribution of most resident proteins, and their underlying localization mechanisms, are unknown. Here, we exploit differences in surface charge of intact cisternae to perform separation of early to late Golgi subcompartments. We determine protein and glycan abundance profiles across the Golgi; over 390 resident proteins are identified, including 136 new additions, with over 180 cisternal assignments. These assignments provide a means to better understand the functional roles of Golgi proteins and how they operate sequentially. Protein and glycan distributions are validated in vivo using high-resolution microscopy. Results reveal distinct functional compartmentalization among resident Golgi proteins. Analysis of transmembrane proteins shows several sequence-based characteristics relating to pI, hydrophobicity, Ser abundance, and Phe bilayer asymmetry that change across the Golgi. Overall, our results suggest that a continuum of transmembrane features, rather than discrete rules, guide proteins to earlier or later locations within the Golgi stack.

Subject(s)

Golgi Apparatus/metabolism , Plant Proteins/chemistry , Plant Proteins/metabolism , Golgi Apparatus/ultrastructure , Hydrophobic and Hydrophilic Interactions , Intracellular Membranes , Membrane Proteins/chemistry , Membrane Proteins/metabolism , Polysaccharides/chemistry , Polysaccharides/metabolism , Proteome

10.

ReactomeGSA - Efficient Multi-Omics Comparative Pathway Analysis.

Griss, Johannes; Viteri, Guilherme; Sidiropoulos, Konstantinos; Nguyen, Vy; Fabregat, Antonio; Hermjakob, Henning.

Mol Cell Proteomics ; 19(12): 2115-2125, 2020 12.

Article in English | MEDLINE | ID: mdl-32907876

ABSTRACT

Pathway analyses are key methods to analyze 'omics experiments. Nevertheless, integrating data from different 'omics technologies and different species still requires considerable bioinformatics knowledge.Here we present the novel ReactomeGSA resource for comparative pathway analyses of multi-omics datasets. ReactomeGSA can be used through Reactome's existing web interface and the novel ReactomeGSA R Bioconductor package with explicit support for scRNA-seq data. Data from different species is automatically mapped to a common pathway space. Public data from ExpressionAtlas and Single Cell ExpressionAtlas can be directly integrated in the analysis. ReactomeGSA greatly reduces the technical barrier for multi-omics, cross-species, comparative pathway analyses.We used ReactomeGSA to characterize the role of B cells in anti-tumor immunity. We compared B cell rich and poor human cancer samples from five of the Cancer Genome Atlas (TCGA) transcriptomics and two of the Clinical Proteomic Tumor Analysis Consortium (CPTAC) proteomics studies. B cell-rich lung adenocarcinoma samples lacked the otherwise present activation through NFkappaB. This may be linked to the presence of a specific subset of tumor associated IgG+ plasma cells that lack NFkappaB activation in scRNA-seq data from human melanoma. This showcases how ReactomeGSA can derive novel biomedical insights by integrating large multi-omics datasets.

Subject(s)

Databases, Genetic , Proteomics , Software , B-Lymphocytes/immunology , Humans , Internet , User-Computer Interface

11.

STAT3-dependent analysis reveals PDK4 as independent predictor of recurrence in prostate cancer.

Oberhuber, Monika; Pecoraro, Matteo; Rusz, Mate; Oberhuber, Georg; Wieselberg, Maritta; Haslinger, Peter; Gurnhofer, Elisabeth; Schlederer, Michaela; Limberger, Tanja; Lagger, Sabine; Pencik, Jan; Kodajova, Petra; Högler, Sandra; Stockmaier, Georg; Grund-Gröschke, Sandra; Aberger, Fritz; Bolis, Marco; Theurillat, Jean-Philippe; Wiebringhaus, Robert; Weiss, Theresa; Haitel, Andrea; Brehme, Marc; Wadsak, Wolfgang; Griss, Johannes; Mohr, Thomas; Hofer, Alexandra; Jäger, Anton; Pollheimer, Jürgen; Egger, Gerda; Koellensperger, Gunda; Mann, Matthias; Hantusch, Brigitte; Kenner, Lukas.

Mol Syst Biol ; 16(4): e9247, 2020 04.

Article in English | MEDLINE | ID: mdl-32323921

ABSTRACT

Prostate cancer (PCa) has a broad spectrum of clinical behavior; hence, biomarkers are urgently needed for risk stratification. Here, we aim to find potential biomarkers for risk stratification, by utilizing a gene co-expression network of transcriptomics data in addition to laser-microdissected proteomics from human and murine prostate FFPE samples. We show up-regulation of oxidative phosphorylation (OXPHOS) in PCa on the transcriptomic level and up-regulation of the TCA cycle/OXPHOS on the proteomic level, which is inversely correlated to STAT3 expression. We hereby identify gene expression of pyruvate dehydrogenase kinase 4 (PDK4), a key regulator of the TCA cycle, as a promising independent prognostic marker in PCa. PDK4 predicts disease recurrence independent of diagnostic risk factors such as grading, staging, and PSA level. Therefore, low PDK4 is a promising marker for PCa with dismal prognosis.

Subject(s)

Gene Expression Profiling/methods , Neoplasm Recurrence, Local/genetics , Neoplasms, Experimental/pathology , Prostatic Neoplasms/genetics , Proteomics/methods , Pyruvate Dehydrogenase Acetyl-Transferring Kinase/genetics , STAT3 Transcription Factor/genetics , Animals , Biomarkers, Tumor/genetics , Biomarkers, Tumor/metabolism , Gene Expression Regulation, Neoplastic , Humans , Laser Capture Microdissection , Male , Mice , Neoplasm Grading , Neoplasm Recurrence, Local/metabolism , Neoplasm Recurrence, Local/pathology , Neoplasms, Experimental/genetics , Neoplasms, Experimental/metabolism , Oxidative Phosphorylation , Prognosis , Prostatic Neoplasms/metabolism , Prostatic Neoplasms/pathology , Pyruvate Dehydrogenase Acetyl-Transferring Kinase/metabolism , STAT3 Transcription Factor/metabolism , Systems Biology , Young Adult

12.

The PRIDE database and related tools and resources in 2019: improving support for quantification data.

Perez-Riverol, Yasset; Csordas, Attila; Bai, Jingwen; Bernal-Llinares, Manuel; Hewapathirana, Suresh; Kundu, Deepti J; Inuganti, Avinash; Griss, Johannes; Mayer, Gerhard; Eisenacher, Martin; Pérez, Enrique; Uszkoreit, Julian; Pfeuffer, Julianus; Sachsenberg, Timo; Yilmaz, Sule; Tiwary, Shivani; Cox, Jürgen; Audain, Enrique; Walzer, Mathias; Jarnuczak, Andrew F; Ternent, Tobias; Brazma, Alvis; Vizcaíno, Juan Antonio.

Nucleic Acids Res ; 47(D1): D442-D450, 2019 01 08.

Article in English | MEDLINE | ID: mdl-30395289

ABSTRACT

The PRoteomics IDEntifications (PRIDE) database (https://www.ebi.ac.uk/pride/) is the world's largest data repository of mass spectrometry-based proteomics data, and is one of the founding members of the global ProteomeXchange (PX) consortium. In this manuscript, we summarize the developments in PRIDE resources and related tools since the previous update manuscript was published in Nucleic Acids Research in 2016. In the last 3 years, public data sharing through PRIDE (as part of PX) has definitely become the norm in the field. In parallel, data re-use of public proteomics data has increased enormously, with multiple applications. We first describe the new architecture of PRIDE Archive, the archival component of PRIDE. PRIDE Archive and the related data submission framework have been further developed to support the increase in submitted data volumes and additional data types. A new scalable and fault tolerant storage backend, Application Programming Interface and web interface have been implemented, as a part of an ongoing process. Additionally, we emphasize the improved support for quantitative proteomics data through the mzTab format. At last, we outline key statistics on the current data contents and volume of downloads, and how PRIDE data are starting to be disseminated to added-value resources including Ensembl, UniProt and Expression Atlas.

Subject(s)

Databases, Protein , Mass Spectrometry , Proteomics , Peptides/chemistry , Software

13.

IsoProt: A Complete and Reproducible Workflow To Analyze iTRAQ/TMT Experiments.

Griss, Johannes; Vinterhalter, Goran; Schwämmle, Veit.

J Proteome Res ; 18(4): 1751-1759, 2019 04 05.

Article in English | MEDLINE | ID: mdl-30855969

ABSTRACT

Reproducibility has become a major concern in biomedical research. In proteomics, bioinformatic workflows can quickly consist of multiple software tools each with its own set of parameters. Their usage involves the definition of often hundreds of parameters as well as data operations to ensure tool interoperability. Hence, a manuscript's methods section is often insufficient to completely describe and reproduce a data analysis workflow. Here we present IsoProt: A complete and reproducible bioinformatic workflow deployed on a portable container environment to analyze data from isobarically labeled, quantitative proteomics experiments. The workflow uses only open source tools and provides a user-friendly and interactive browser interface to configure and execute the different operations. Once the workflow is executed, the results including the R code to perform statistical analyses can be downloaded as an HTML document providing a complete record of the performed analyses. IsoProt therefore represents a reproducible bioinformatics workflow that will yield identical results on any computer platform.

Subject(s)

Isotope Labeling , Proteome/analysis , Proteomics/methods , Software , Tandem Mass Spectrometry , Animals , Databases, Factual , Malaria, Cerebral/metabolism , Mice , Proteome/chemistry , Proteome/metabolism , Reproducibility of Results

14.

Spectral Clustering Improves Label-Free Quantification of Low-Abundant Proteins.

Griss, Johannes; Stanek, Florian; Hudecz, Otto; Dürnberger, Gerhard; Perez-Riverol, Yasset; Vizcaíno, Juan Antonio; Mechtler, Karl.

J Proteome Res ; 18(4): 1477-1485, 2019 04 05.

Article in English | MEDLINE | ID: mdl-30859831

ABSTRACT

Label-free quantification has become a common-practice in many mass spectrometry-based proteomics experiments. In recent years, we and others have shown that spectral clustering can considerably improve the analysis of (primarily large-scale) proteomics data sets. Here we show that spectral clustering can be used to infer additional peptide-spectrum matches and improve the quality of label-free quantitative proteomics data in data sets also containing only tens of MS runs. We analyzed four well-known public benchmark data sets that represent different experimental settings using spectral counting and peak intensity based label-free quantification. In both approaches, the additionally inferred peptide-spectrum matches through our spectra-cluster algorithm improved the detectability of low abundant proteins while increasing the accuracy of the derived quantitative data, without increasing the data sets' noise. Additionally, we developed a Proteome Discoverer node for our spectra-cluster algorithm which allows anyone to rebuild our proposed pipeline using the free version of Proteome Discoverer.

Subject(s)

Cluster Analysis , Mass Spectrometry/methods , Proteome/analysis , Proteomics/methods , Algorithms , Databases, Protein , Humans

15.

Recognizing millions of consistently unidentified spectra across hundreds of shotgun proteomics datasets.

Griss, Johannes; Perez-Riverol, Yasset; Lewis, Steve; Tabb, David L; Dianes, José A; Del-Toro, Noemi; Rurik, Marc; Walzer, Mathias W; Kohlbacher, Oliver; Hermjakob, Henning; Wang, Rui; Vizcaíno, Juan Antonio.

Nat Methods ; 13(8): 651-656, 2016 Aug.

Article in English | MEDLINE | ID: mdl-27493588

ABSTRACT

Mass spectrometry (MS) is the main technology used in proteomics approaches. However, on average 75% of spectra analysed in an MS experiment remain unidentified. We propose to use spectrum clustering at a large-scale to shed a light on these unidentified spectra. PRoteomics IDEntifications database (PRIDE) Archive is one of the largest MS proteomics public data repositories worldwide. By clustering all tandem MS spectra publicly available in PRIDE Archive, coming from hundreds of datasets, we were able to consistently characterize three distinct groups of spectra: 1) incorrectly identified spectra, 2) spectra correctly identified but below the set scoring threshold, and 3) truly unidentified spectra. Using a multitude of complementary analysis approaches, we were able to identify less than 20% of the consistently unidentified spectra. The complete spectrum clustering results are available through the new version of the PRIDE Cluster resource (http://www.ebi.ac.uk/pride/cluster). This resource is intended, among other aims, to encourage and simplify further investigation into these unidentified spectra.

16.

Future Prospects of Spectral Clustering Approaches in Proteomics.

Perez-Riverol, Yasset; Vizcaíno, Juan Antonio; Griss, Johannes.

Proteomics ; 18(14): e1700454, 2018 07.

Article in English | MEDLINE | ID: mdl-29882266

ABSTRACT

In this article, current and future applications of spectral clustering are discussed in the context of mass spectrometry-based proteomics approaches. First of all, the main algorithms and tools that can currently be used to perform spectral clustering are introduced. In addition, its main applications and their use in current computational proteomics workflows are explained, including the generation of spectral libraries and spectral archives. Finally, possible future directions for spectral clustering, including its potential use to achieve a deeper coverage of the proteome and the discovery of novel post-translational modifications and single amino acid variants.

Subject(s)

Algorithms , Cluster Analysis , Proteomics/methods , Spectrum Analysis/methods , Databases, Protein , Humans , Proteome/analysis

17.

Response to "Comparison and Evaluation of Clustering Algorithms for Tandem Mass Spectra".

Griss, Johannes; Perez-Riverol, Yasset; The, Matthew; Käll, Lukas; Vizcaíno, Juan Antonio.

J Proteome Res ; 17(5): 1993-1996, 2018 05 04.

Article in English | MEDLINE | ID: mdl-29682973

ABSTRACT

In the recent benchmarking article entitled "Comparison and Evaluation of Clustering Algorithms for Tandem Mass Spectra", Rieder et al. compared several different approaches to cluster MS/MS spectra. While we certainly recognize the value of the manuscript, here, we report some shortcomings detected in the original analyses. For most analyses, the authors clustered only single MS/MS runs. In one of the reported analyses, three MS/MS runs were processed together, which already led to computational performance issues in many of the tested approaches. This fact highlights the difficulties of using many of the tested algorithms on the nowadays produced average proteomics data sets. Second, the authors only processed identified spectra when merging MS runs. Thereby, all unidentified spectra that are of lower quality were already removed from the data set and could not influence the clustering results. Next, we found that the authors did not analyze the effect of chimeric spectra on the clustering results. In our analysis, we found that 3% of the spectra in the used data sets were chimeric, and this had marked effects on the behavior of the different clustering algorithms tested. Finally, the authors' choice to evaluate the MS-Cluster and spectra-cluster algorithms using a precursor tolerance of 5 Da for high-resolution Orbitrap data only was, in our opinion, not adequate to assess the performance of MS/MS clustering approaches.

Subject(s)

Algorithms , Tandem Mass Spectrometry , Benchmarking , Cluster Analysis , Proteomics

18.

Expanding the Use of Spectral Libraries in Proteomics.

Deutsch, Eric W; Perez-Riverol, Yasset; Chalkley, Robert J; Wilhelm, Mathias; Tate, Stephen; Sachsenberg, Timo; Walzer, Mathias; Käll, Lukas; Delanghe, Bernard; Böcker, Sebastian; Schymanski, Emma L; Wilmes, Paul; Dorfer, Viktoria; Kuster, Bernhard; Volders, Pieter-Jan; Jehmlich, Nico; Vissers, Johannes P C; Wolan, Dennis W; Wang, Ana Y; Mendoza, Luis; Shofstahl, Jim; Dowsey, Andrew W; Griss, Johannes; Salek, Reza M; Neumann, Steffen; Binz, Pierre-Alain; Lam, Henry; Vizcaíno, Juan Antonio; Bandeira, Nuno; Röst, Hannes.

J Proteome Res ; 17(12): 4051-4060, 2018 12 07.

Article in English | MEDLINE | ID: mdl-30270626

ABSTRACT

The 2017 Dagstuhl Seminar on Computational Proteomics provided an opportunity for a broad discussion on the current state and future directions of the generation and use of peptide tandem mass spectrometry spectral libraries. Their use in proteomics is growing slowly, but there are multiple challenges in the field that must be addressed to further increase the adoption of spectral libraries and related techniques. The primary bottlenecks are the paucity of high quality and comprehensive libraries and the general difficulty of adopting spectral library searching into existing workflows. There are several existing spectral library formats, but none captures a satisfactory level of metadata; therefore, a logical next improvement is to design a more advanced, Proteomics Standards Initiative-approved spectral library format that can encode all of the desired metadata. The group discussed a series of metadata requirements organized into three designations of completeness or quality, tentatively dubbed bronze, silver, and gold. The metadata can be organized at four different levels of granularity: at the collection (library) level, at the individual entry (peptide ion) level, at the peak (fragment ion) level, and at the peak annotation level. Strategies for encoding mass modifications in a consistent manner and the requirement for encoding high-quality and commonly seen but as-yet-unidentified spectra were discussed. The group also discussed related topics, including strategies for comparing two spectra, techniques for generating representative spectra for a library, approaches for selection of optimal signature ions for targeted workflows, and issues surrounding the merging of two or more libraries into one. We present here a review of this field and the challenges that the community must address in order to accelerate the adoption of spectral libraries in routine analysis of proteomics datasets.

Subject(s)

Databases, Protein/standards , Peptide Library , Proteomics/methods , Animals , Humans , Tandem Mass Spectrometry/methods , Workflow

19.

BioContainers: an open-source and community-driven framework for software standardization.

da Veiga Leprevost, Felipe; Grüning, Björn A; Alves Aflitos, Saulo; Röst, Hannes L; Uszkoreit, Julian; Barsnes, Harald; Vaudel, Marc; Moreno, Pablo; Gatto, Laurent; Weber, Jonas; Bai, Mingze; Jimenez, Rafael C; Sachsenberg, Timo; Pfeuffer, Julianus; Vera Alvarez, Roberto; Griss, Johannes; Nesvizhskii, Alexey I; Perez-Riverol, Yasset.

Bioinformatics ; 33(16): 2580-2582, 2017 Aug 15.

Article in English | MEDLINE | ID: mdl-28379341

ABSTRACT

MOTIVATION: BioContainers (biocontainers.pro) is an open-source and community-driven framework which provides platform independent executable environments for bioinformatics software. BioContainers allows labs of all sizes to easily install bioinformatics software, maintain multiple versions of the same software and combine tools into powerful analysis pipelines. BioContainers is based on popular open-source projects Docker and rkt frameworks, that allow software to be installed and executed under an isolated and controlled environment. Also, it provides infrastructure and basic guidelines to create, manage and distribute bioinformatics containers with a special focus on omics technologies. These containers can be integrated into more comprehensive bioinformatics pipelines and different architectures (local desktop, cloud environments or HPC clusters). AVAILABILITY AND IMPLEMENTATION: The software is freely available at github.com/BioContainers/. CONTACT: yperez@ebi.ac.uk.

Subject(s)

Computational Biology/methods , Software , Genomics/methods , Metabolomics/methods , Proteomics/methods

20.

Digital image analysis improves precision of PD-L1 scoring in cutaneous melanoma.

Koelzer, Viktor H; Gisler, Aline; Hanhart, Jonathan C; Griss, Johannes; Wagner, Stephan N; Willi, Niels; Cathomas, Gieri; Sachs, Melanie; Kempf, Werner; Thommen, Daniela S; Mertz, Kirsten D.

Histopathology ; 73(3): 397-406, 2018 Sep.

Article in English | MEDLINE | ID: mdl-29660160

ABSTRACT

AIMS: Immune checkpoint inhibitors have become a successful treatment in metastatic melanoma. The high response rates in a subset of patients suggest that a sensitive companion diagnostic test is required. The predictive value of programmed death ligand 1 (PD-L1) staining in melanoma has been questioned due to inconsistent correlation with clinical outcome. Whether this is due to predictive irrelevance of PD-L1 expression or inaccurate assessment techniques remains unclear. The aim of this study was to develop a standardised digital protocol for the assessment of PD-L1 staining in melanoma and to compare the output data and reproducibility to conventional assessment by expert pathologists. METHODS AND RESULTS: In two cohorts with a total of 69 cutaneous melanomas, a highly significant correlation was found between pathologist-based consensus reading and automated PD-L1 analysis (r = 0.97, P < 0.0001). Digital scoring captured the full diagnostic spectrum of PD-L1 expression at single cell resolution. An average of 150 472 melanoma cells (median 38 668 cells; range = 733-1 078 965) were scored per lesion. Machine learning was used to control for heterogeneity introduced by PD-L1-positive inflammatory cells in the tumour microenvironment. The PD-L1 image analysis protocol showed excellent reproducibility (r = 1.0, P < 0.0001) when carried out on independent workstations and reduced variability in PD-L1 scoring of human observers. When melanomas were grouped by PD-L1 expression status, we found a clear correlation of PD-L1 positivity with CD8-positive T cell infiltration, but not with tumour stage, metastasis or driver mutation status. CONCLUSION: Digital evaluation of PD-L1 reduces scoring variability and may facilitate patient stratification in clinical practice.

Subject(s)

B7-H1 Antigen/biosynthesis , Biomarkers, Tumor/analysis , Image Interpretation, Computer-Assisted/methods , Melanoma/pathology , Skin Neoplasms/pathology , Adult , Aged , Aged, 80 and over , B7-H1 Antigen/analysis , Female , Humans , Male , Middle Aged , Reproducibility of Results , Young Adult , Melanoma, Cutaneous Malignant

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

ABSTRACT

Subject(s)

SEND TO:

SELECTION OF CITATIONS

SEARCH DETAIL