RESUMO
MALDI mass spectrometry imaging (MSI) is emerging as a tool for protein and peptide imaging across tissue sections. Despite extensive study, there does not yet exist a baseline study evaluating the potential capabilities for this technique to detect diverse proteins in tissue sections. In this study, we developed a systematic approach for characterizing MALDI-MSI workflows in terms of limits of detection, coefficients of variation, spatial resolution, and the identification of endogenous tissue proteins. Our goal was to quantify these figures of merit for a number of different proteins and peptides, in order to gain more insight in the feasibility of protein biomarker discovery efforts using this technique. Control proteins and peptides were deposited in serial dilutions on thinly sectioned mouse xenograft tissue. Using our experimental setup, coefficients of variation were <30% on tissue sections and spatial resolution was 200 µm (or greater). Limits of detection for proteins and peptides on tissue were in the micromolar to millimolar range. Protein identification was only possible for proteins present in high abundance in the tissue. These results provide a baseline for the application of MALDI-MSI towards the discovery of new candidate biomarkers and a new benchmarking strategy that can be used for comparing diverse MALDI-MSI workflows.
Assuntos
Biomarcadores , Peptídeos/genética , Proteínas/genética , Espectrometria de Massas por Ionização e Dessorção a Laser Assistida por Matriz/métodos , Animais , Humanos , Limite de Detecção , Camundongos , Peptídeos/isolamento & purificação , Proteínas/isolamento & purificação , Proteômica , Ensaios Antitumorais Modelo de XenoenxertoRESUMO
BACKGROUND: Nasopharyngeal carcinoma (NPC) is a distinct cancer of the head and neck that is highly prevalent in Southeast Asia and North Africa. Though an extensive analysis of environmental and genetic contributors has been performed, very little is known about the proteome of this disease. A proteomic analysis of formalin-fixed paraffin-embedded (FFPE) tissues can provide valuable information on protein expression and molecular patterns for both increasing our understanding of the disease and for biomarker discovery. To date, very few NPC proteomic studies have been performed, and none focused on patients from Morocco and North Africa. METHODS: Label-free Liquid Chromatography-Tandem Mass Spectrometry (LC-MS/MS) was used to perform a proteomic analysis of FFPE tissue samples from a cohort of 41 NPC tumor samples of Morocco and North Africa origins. The LC-MS/MS data from this cohort were analyzed alongside 21 healthy controls using MaxQuant 2.4.2.0. A differential expression analysis was performed using the MSstats package in R. Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) functional annotations were carried out using the DAVID bioinformatic tool. RESULTS: 3341 proteins were identified across our NPC cases, revealing three main clusters and five DEPs with prognostic significance. The sex disparity of NPC was investigated from a proteomic perspective in which 59 DEPs were found between males and females, with significantly enriched terms associated with the immune response and gene expression. Furthermore, 26 DEPs were observed between patients with early and advanced stages of NPC with a significant cluster related to the immune response, implicating up-regulated DEPs such as IGHA, IGKC, and VAT1. Across both datasets, 6532 proteins were quantified between NPC patients and healthy controls. Among them, 1507 differentially expressed proteins (DEPs) were observed. GO and KEGG pathway analyses showed enriched terms of DEPs related to increased cellular activity, cell proliferation, and survival. PI3K and MAPK proteins as well as RAC1 BCL2 and PPIA were found to be overexpressed between cancer tissues and healthy controls. EBV infection was also one of the enriched pathways implicating its latent genes like LMP1 and LMP2 that activate several proteins and signaling pathways including NF-Kappa B, MAPK, and JAK-STAT pathways. CONCLUSION: Our findings unveil the proteomic landscape of NPC for the first time in the Moroccan population. These studies additionally may provide a foundation for identifying potential biomarkers. Further research is still needed to help develop tools for the early diagnosis and treatment of NPC in Moroccan and North African populations.
RESUMO
Benchmark challenges, such as the Critical Assessment of Structure Prediction (CASP) and Dialogue for Reverse Engineering Assessments and Methods (DREAM) have been instrumental in driving the development of bioinformatics methods. Typically, challenges are posted, and then competitors perform a prediction based upon blinded test data. Challengers then submit their answers to a central server where they are scored. Recent efforts to automate these challenges have been enabled by systems in which challengers submit Docker containers, a unit of software that packages up code and all of its dependencies, to be run on the cloud. Despite their incredible value for providing an unbiased test-bed for the bioinformatics community, there remain opportunities to further enhance the potential impact of benchmark challenges. Specifically, current approaches only evaluate end-to-end performance; it is nearly impossible to directly compare methodologies or parameters. Furthermore, the scientific community cannot easily reuse challengers' approaches, due to lack of specifics, ambiguity in tools and parameters as well as problems in sharing and maintenance. Lastly, the intuition behind why particular steps are used is not captured, as the proposed workflows are not explicitly defined, making it cumbersome to understand the flow and utilization of data. Here we introduce an approach to overcome these limitations based upon the WINGS semantic workflow system. Specifically, WINGS enables researchers to submit complete semantic workflows as challenge submissions. By submitting entries as workflows, it then becomes possible to compare not just the results and performance of a challenger, but also the methodology employed. This is particularly important when dozens of challenge entries may use nearly identical tools, but with only subtle changes in parameters (and radical differences in results). WINGS uses a component driven workflow design and offers intelligent parameter and data selection by reasoning about data characteristics. This proves to be especially critical in bioinformatics workflows where using default or incorrect parameter values is prone to drastically altering results. Different challenge entries may be readily compared through the use of abstract workflows, which also facilitate reuse. WINGS is housed on a cloud based setup, which stores data, dependencies and workflows for easy sharing and utility. It also has the ability to scale workflow executions using distributed computing through the Pegasus workflow execution system. We demonstrate the application of this architecture to the DREAM proteogenomic challenge.
Assuntos
Benchmarking/métodos , Semântica , Fluxo de Trabalho , Algoritmos , Biologia Computacional/métodos , Perfilação da Expressão Gênica/estatística & dados numéricos , Genômica , Metadados , Proteínas/genética , Proteínas/metabolismo , Reprodutibilidade dos Testes , Análise de Sequência de RNA/estatística & dados numéricosRESUMO
Currently prostate-specific antigen is used for prostate cancer (PCa) screening, however it lacks the necessary specificity for differentiating PCa from other diseases of the prostate such as benign prostatic hyperplasia (BPH), presenting a clinical need to distinguish these cases at the molecular level. Protein glycosylation plays an important role in a number of cellular processes involved in neoplastic progression and is aberrant in PCa. In this study, we systematically interrogate the alterations in the circulating levels of hundreds of serum proteins and their glycoforms in PCa and BPH samples using multi-lectin affinity chromatography and quantitative mass spectrometry-based proteomics. Specific lectins (AAL, PHA-L and PHA-E) were used to target and chromatographically separate core-fucosylated and highly-branched protein glycoforms for analysis, as differential expression of these glycan types have been previously associated with PCa. Global levels of CD5L, CFP, C8A, BST1, and C7 were significantly increased in the PCa samples. Notable glycoform-specific alterations between BPH and PCa were identified among proteins CD163, C4A, and ATRN in the PHA-L/E fraction and among C4BPB and AZGP1 glycoforms in the AAL fraction. Despite these modest differences, substantial similarities in glycoproteomic profiles were observed between PCa and BPH sera.
Assuntos
Lectinas/metabolismo , Proteínas de Membrana/metabolismo , Próstata/metabolismo , Hiperplasia Prostática/metabolismo , Neoplasias da Próstata/metabolismo , Adulto , Idoso , Cromatografia de Afinidade , Glicosilação , Humanos , Masculino , Pessoa de Meia-Idade , Antígeno Prostático Específico/metabolismo , Proteômica/métodos , Sensibilidade e EspecificidadeRESUMO
Recent advances in proteome informatics have led to an explosion in tools to analyze mass spectrometry data. These tools operate across the analysis pipeline doing everything from assessing quality control to matching peptides to spectra to quantification. Unfortunately, the vast majority of these tools are not able to operate directly on the proprietary formats generated by the diverse mass spectrometers. Consequently, the first step in many protocols is the conversion of data from vendor-specific binary files to open-format files. This protocol details the use of ProteoWizard's msConvert and msConvertGUI software for this conversion, taking format features, coding options, and vendor particularities into account. We specifically describe the various options available when doing conversions and the implications of each option.
Assuntos
Biologia Computacional/métodos , Proteômica , Software , Estatística como Assunto/métodos , Espectrometria de Massas/métodos , Proteômica/métodos , Interface Usuário-Computador , NavegadorRESUMO
BACKGROUND: Quantitative proteomics allows for the discovery and functional investigation of blood-based pre-diagnostic biomarkers for early cancer detection. However, a major limitation of proteomic investigations in biomarker studies remains the biological and technical variability in the analysis of complex clinical samples. Moreover, unlike 'omics analogues such as genomics and transcriptomics, proteomics has yet to achieve reproducibility and long-term stability on a unified technological platform. Few studies have thoroughly investigated protein variability in pre-diagnostic samples of cancer patients across multiple platforms. METHODS: We obtained ten blood plasma "case" samples collected up to 2 years prior to breast cancer diagnosis. Each case sample was paired with a matched control plasma from a full biological sister without breast cancer. We measured protein levels using both mass-spectrometry and antibody-based technologies to: (1) assess the technical considerations in different protein assays when analyzing limited clinical samples, and (2) evaluate the statistical power of potential diagnostic analytes. RESULTS: Although we found inherent technical variation in the three assays used, we detected protein dependent biological signal from the limited samples. The three assay types yielded 32 proteins with statistically significantly (p < 1E-01) altered expression levels between cases and controls, with no proteins retaining statistical significance after false discovery correction. CONCLUSIONS: Technical, practical, and study design considerations are essential to maximize information obtained in limited pre-diagnostic samples of cancer patients. This study provides a framework that estimates biological effect sizes critical for consideration in designing studies for pre-diagnostic blood-based biomarker detection.
RESUMO
Ongoing cancer genome characterization studies continue to elucidate the spectrum of genomic abnormalities that drive many cancers, and in the clinical arena assessment of the driver genetic alterations in patients is playing an increasingly important diagnostic and/or prognostic role for many cancer types. However, the landscape of genomic abnormalities is still unknown for less common cancers, and the influence of specific genotypes on clinical behavior is often still unclear. To address some of these deficiencies, we developed Profile, a prospective cohort study to obtain genomic information on all patients at a large tertiary care medical center for cancer-related care. We enrolled patients with any cancer diagnosis, and, for each patient (unselected for cancer site or type) we applied mass spectrometric genotyping (OncoMap) of 471 common recurrent mutations in 41 cancer-related genes. We report the results of the first 5000 patients, of which 26% exhibited potentially actionable somatic mutations. These observations indicate the utility of genotyping in advancing the field of precision oncology.