RESUMEN
Type I interferons (IFN-I) exert pleiotropic biological effects during viral infections, balancing virus control versus immune-mediated pathologies, and have been successfully employed for the treatment of viral diseases. Humans express 12 IFN-alpha (α) subtypes, which activate downstream signaling cascades and result in distinct patterns of immune responses and differential antiviral responses. Inborn errors in IFN-I immunity and the presence of anti-IFN autoantibodies account for very severe courses of COVID-19; therefore, early administration of IFN-I may be protective against life-threatening disease. Here we comprehensively analyzed the antiviral activity of all IFNα subtypes against severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) to identify the underlying immune signatures and explore their therapeutic potential. Prophylaxis of primary human airway epithelial cells (hAEC) with different IFNα subtypes during SARS-CoV-2 infection uncovered distinct functional classes with high, intermediate, and low antiviral IFNs. In particular, IFNα5 showed superior antiviral activity against SARS-CoV-2 infection in vitro and in SARS-CoV-2-infected mice in vivo. Dose dependency studies further displayed additive effects upon coadministration with the broad antiviral drug remdesivir in cell culture. Transcriptomic analysis of IFN-treated hAEC revealed different transcriptional signatures, uncovering distinct, intersecting, and prototypical genes of individual IFNα subtypes. Global proteomic analyses systematically assessed the abundance of specific antiviral key effector molecules which are involved in IFN-I signaling pathways, negative regulation of viral processes, and immune effector processes for the potent antiviral IFNα5. Taken together, our data provide a systemic, multimodular definition of antiviral host responses mediated by defined IFN-I. This knowledge will support the development of novel therapeutic approaches against SARS-CoV-2.
Asunto(s)
Tratamiento Farmacológico de COVID-19 , Interferón-alfa/farmacología , SARS-CoV-2/efectos de los fármacos , Transcriptoma , Replicación Viral/efectos de los fármacos , Animales , COVID-19/inmunología , COVID-19/virología , Chlorocebus aethiops , Clonación Molecular , Modelos Animales de Enfermedad , Escherichia coli/genética , Escherichia coli/metabolismo , Perfilación de la Expresión Génica , Regulación de la Expresión Génica , Vectores Genéticos/química , Vectores Genéticos/metabolismo , Humanos , Interferón-alfa/genética , Interferón-alfa/inmunología , Ratones , Isoformas de Proteínas/clasificación , Isoformas de Proteínas/genética , Isoformas de Proteínas/inmunología , Isoformas de Proteínas/farmacología , Proteínas Recombinantes/clasificación , Proteínas Recombinantes/genética , Proteínas Recombinantes/inmunología , Proteínas Recombinantes/farmacología , SARS-CoV-2/genética , SARS-CoV-2/inmunología , Transducción de Señal , Células VeroRESUMEN
PURPOSE: There is evidence that lower activity of the RAF/MEK/ERK network is associated with positive outcomes in mild and moderate courses of COVID-19. The effect of this cascade in COVID-19 sepsis is still undetermined. Therefore, we tested the hypothesis that activity of the RAF/MEK/ERK network in COVID-19-induced sepsis is associated with an impact on 30-day survival. METHODS: We used biomaterial from 81 prospectively recruited patients from the multicentric CovidDataNet.NRW-study cohort (German clinical trial registry: DRKS00026184) with their collected medical history, vital signs, laboratory parameters, microbiological findings and patient outcome. ERK activity was measured by evaluating ERK phosphorylation using a Proximity Ligation Assay. RESULTS: An increased ERK activity at 4 days after diagnosis of COVID-19-induced sepsis was associated with a more than threefold increased chance of survival in an adjusted Cox regression model. ERK activity was independent of other confounders such as Charlson Comorbidity Index or SOFA score (HR 0.28, 95% CI 0.10-0.84, p = 0.02). CONCLUSION: High activity of the RAF/MEK/ERK network during the course of COVID-19 sepsis is a protective factor and may indicate recovery of the immune system. Further studies are needed to confirm these results.
RESUMEN
Oligodendrocytes are generated via a two-step mechanism from pluripotent neural stem cells (NSCs): after differentiation of NSCs to oligodendrocyte precursor/NG2 cells (OPCs), they further develop into mature oligodendrocytes. The first step of this differentiation process is only incompletely understood. In this study, we utilized the neurosphere assay to investigate NSC to OPC differentiation in a time course-dependent manner by mass spectrometry-based (phospho-) proteomics. We identify doublecortin-like kinase 1 (Dclk1) as one of the most prominently regulated proteins in both datasets, and show that it undergoes a gradual transition between its short/long isoform during NSC to OPC differentiation. This is regulated by phosphorylation of its SP-rich region, resulting in inhibition of proteolytic Dclk1 long cleavage, and therefore Dclk1 short generation. Through interactome analyses of different Dclk1 isoforms by proximity biotinylation, we characterize their individual putative interaction partners and substrates. All data are available via ProteomeXchange with identifier PXD040652.
Asunto(s)
Células-Madre Neurales , Células Precursoras de Oligodendrocitos , Diferenciación Celular , Quinasas Similares a Doblecortina , Oligodendroglía , Fosforilación , Proteínas Serina-Treonina Quinasas , ProteómicaRESUMEN
BACKGROUND: Stem cell apheresis in the context of autologous stem cell transplantation requires an accurate cluster of differentiantion 34 (CD34+) count determined by flow cytometry as the current gold standard. Since flow cytometry is a personnel and time-intensive diagnostic tool, automated stem cell enumeration may provide a promising alternative. Hence, this study aimed to compare automated hematopoietic progenitor enumeration carried out on a Sysmex XN-20 module compared with conventional flow cytometric measurements. METHODS: One hundred forty-three blood samples from 41 patients were included in this study. Correlation between the two methods was calculated over all samples, depending on leukocyte count and diagnosis. RESULTS: Overall, we found a high degree of correlation (r = 0.884). Furthermore, correlation was not impaired by elevated leukocyte counts (>10 000/µL, r = 0.860 vs <10 000/µL, r = 0.849; >20 000/µL, r = 0.843 vs <20 000/µL, r = 0.875). However, correlation was significantly impaired in patients with multiple myeloma (multiple myeloma r = 0.840 vs nonmyeloma r = 0.934). SUMMARY: Stem cell measurement carried out on the Sysmex XN-20 module provides a significant correlation with flow cytometry and might be implemented in clinical practice. In clinical decision-making, there was discrepancy of under 15% of cases. In multiple myeloma patients, XN-20 should be used with caution.
Asunto(s)
Antígenos CD34 , Citometría de Flujo , Células Madre Hematopoyéticas , Adulto , Femenino , Humanos , Masculino , Antígenos CD34/análisis , Antígenos CD34/sangre , Recuento de Células Sanguíneas/métodos , Recuento de Células Sanguíneas/instrumentación , Citometría de Flujo/métodos , Células Madre Hematopoyéticas/citología , Recuento de Leucocitos/métodos , Mieloma Múltiple/sangre , Mieloma Múltiple/diagnósticoRESUMEN
The PRoteomics IDEntifications (PRIDE) database (https://www.ebi.ac.uk/pride/) is the world's largest data repository of mass spectrometry-based proteomics data. PRIDE is one of the founding members of the global ProteomeXchange (PX) consortium and an ELIXIR core data resource. In this manuscript, we summarize the developments in PRIDE resources and related tools since the previous update manuscript was published in Nucleic Acids Research in 2019. The number of submitted datasets to PRIDE Archive (the archival component of PRIDE) has reached on average around 500 datasets per month during 2021. In addition to continuous improvements in PRIDE Archive data pipelines and infrastructure, the PRIDE Spectra Archive has been developed to provide direct access to the submitted mass spectra using Universal Spectrum Identifiers. As a key point, the file format MAGE-TAB for proteomics has been developed to enable the improvement of sample metadata annotation. Additionally, the resource PRIDE Peptidome provides access to aggregated peptide/protein evidences across PRIDE Archive. Furthermore, we will describe how PRIDE has increased its efforts to reuse and disseminate high-quality proteomics data into other added-value resources such as UniProt, Ensembl and Expression Atlas.
Asunto(s)
Bases de Datos de Proteínas , Metadatos/estadística & datos numéricos , Anotación de Secuencia Molecular/estadística & datos numéricos , Péptidos/química , Proteínas/química , Programas Informáticos , Secuencia de Aminoácidos , Bibliometría , Conjuntos de Datos como Asunto , Humanos , Almacenamiento y Recuperación de la Información , Internet , Espectrometría de Masas , Péptidos/genética , Péptidos/metabolismo , Proteínas/genética , Proteínas/metabolismo , Proteómica/instrumentación , Proteómica/métodos , Alineación de SecuenciaRESUMEN
Diagnosing urothelial cancer (UCa) via invasive cystoscopy is painful, specifically in men, and can cause infection and bleeding. Because the UCa risk is higher for male patients, urinary non-invasive UCa biomarkers are highly desired to stratify men for invasive cystoscopy. We previously identified multiple DNA methylation sites in urine samples that detect UCa with a high sensitivity and specificity in men. Here, we identified the most relevant markers by employing multiple statistical approaches and machine learning (random forest, boosted trees, LASSO) using a dataset of 251 male UCa patients and 111 controls. Three CpG sites located in ALOX5, TRPS1 and an intergenic region on chromosome 16 have been concordantly selected by all approaches, and their combination in a single decision matrix for clinical use was tested based on their respective thresholds of the individual CpGs. The combination of ALOX5 and TRPS1 yielded the best overall sensitivity (61%) at a pre-set specificity of 95%. This combination exceeded both the diagnostic performance of the most sensitive bioinformatic approach and that of the best single CpG. In summary, we showed that overlap analysis of multiple statistical approaches identifies the most reliable biomarkers for UCa in a male collective. The results may assist in stratifying men for cystoscopy.
Asunto(s)
Líquidos Corporales , Dedos/anomalías , Enfermedades del Cabello , Síndrome de Langer-Giedion , Neoplasias , Nariz/anomalías , Masculino , Humanos , Biomarcadores de Tumor/genética , Metilación de ADN , Aprendizaje Automático , ADN de Neoplasias , Proteínas RepresorasRESUMEN
This article describes some use case studies and self-assessments of FAIR status of de.NBI services to illustrate the challenges and requirements for the definition of the needs of adhering to the FAIR (findable, accessible, interoperable and reusable) data principles in a large distributed bioinformatics infrastructure. We address the challenge of heterogeneity of wet lab technologies, data, metadata, software, computational workflows and the levels of implementation and monitoring of FAIR principles within the different bioinformatics sub-disciplines joint in de.NBI. On the one hand, this broad service landscape and the excellent network of experts are a strong basis for the development of useful research data management plans. On the other hand, the large number of tools and techniques maintained by distributed teams renders FAIR compliance challenging.
Asunto(s)
Manejo de Datos/métodos , Metadatos , Redes Neurales de la Computación , Proteómica/métodos , Programas Informáticos , Genoma Humano , Secuenciación de Nucleótidos de Alto Rendimiento , Humanos , Cooperación Internacional , Fenotipo , Plantas/genética , Proteoma , Autoevaluación (Psicología) , Flujo de TrabajoRESUMEN
Environmental sequence data of microbial communities now makes up the majority of public genomic information. The assignment of a function to sequences from these metagenomic sources is challenging because organisms associated with the data are often uncharacterized and not cultivable. To overcome these challenges, we created a rationally designed expression library of metagenomic proteins covering the sequence space of the thioredoxin superfamily. This library of 100 individual proteins represents more than 22,000 thioredoxins found in the Global Ocean Sampling data set. We screened this library for the functional rescue of Escherichia coli mutants lacking the thioredoxin-type reductase (ΔtrxA), isomerase (ΔdsbC), or oxidase (ΔdsbA). We were able to assign functions to more than a quarter of our representative proteins. The in vivo function of a given representative could not be predicted by phylogenetic relation but did correlate with the predicted isoelectric surface potential of the protein. Selected proteins were then purified, and we determined their activity using a standard insulin reduction assay and measured their redox potential. An unexpected gel shift of protein E5 during the redox potential determination revealed a redox cycle distinct from that of typical thioredoxin-superfamily oxidoreductases. Instead of the intramolecular disulfide bond formation typical for thioredoxins, this protein forms an intermolecular disulfide between the attacking cysteines of two separate subunits during its catalytic cycle. Our functional metagenomic approach proved not only useful to assign in vivo functions to representatives of thousands of proteins but also uncovered a novel reaction mechanism in a seemingly well-known protein superfamily.
Asunto(s)
Monitoreo del Ambiente , Glutarredoxinas/genética , Metagenómica , Tiorredoxinas/genética , Catálisis , Cisteína/química , Escherichia coli/genética , Glutarredoxinas/química , Glutarredoxinas/clasificación , Familia de Multigenes/genética , Océanos y Mares , Oxidación-Reducción , Filogenia , Proteína Disulfuro Isomerasas/química , Proteína Disulfuro Isomerasas/genética , Reductasa de Tiorredoxina-Disulfuro/química , Reductasa de Tiorredoxina-Disulfuro/genética , Tiorredoxinas/química , Tiorredoxinas/clasificaciónRESUMEN
Neuromelanin is a black-brownish pigment, present in so-called neuromelanin granules (NMGs) in the cell bodies of dopaminergic neurons in the substantia nigra (SN) pars compacta. These neurons are lost in neurodegenerative diseases, such as Parkinson's disease and dementia with Lewy bodies. Although it is known that lipids, proteins, and environmental toxins accumulate in NMGs, the function of NMGs has not yet been finally clarified as well as their origin and the synthesis of neuromelanin. We, therefore, isolated NMGs and surrounding SN tissue from control patients by laser microdissection and analyzed the proteomic profile by tandem mass spectrometry. With our improved workflow, we were able to (1) strengthen the regularly reported link between NMGs and lysosomes, (2) detect tyrosine hydroxylase to be highly abundant in NMGs, which may be related to neuromelanin synthesis and (3) indicate a yet undescribed link between stress granules (SGs) and NMGs. Based on our findings, we cautiously hypothesize, that SGs may be the origin of NMGs or form in close proximity to them, potentially due to the oxidative stress caused by neuromelanin-bound metals.
Asunto(s)
Proteómica , Tirosina 3-Monooxigenasa , Humanos , Lisosomas/metabolismo , Melaninas/metabolismo , Proteómica/métodos , Gránulos de Estrés , Sustancia Negra/metabolismo , Tirosina 3-Monooxigenasa/metabolismoRESUMEN
Currently data-dependent acquisition (DDA) is the method of choice for mass spectrometry-based proteomics discovery experiments, but data-independent acquisition (DIA) is steadily becoming more important. One of the most important requirements to perform a DIA analysis is the availability of suitable spectral libraries for peptide identification and quantification. Several studies were performed addressing the evaluation of spectral library performance for protein identification in DIA measurements. But so far only few experiments estimate the effect of these libraries on the quantitative level.In this work we created a gold standard spike-in sample set with known contents and ratios of proteins in a complex protein matrix that allowed a detailed comparison of DIA quantification data obtained with different spectral library approaches. We used in-house generated sample-specific spectral libraries created using varying sample preparation approaches and repeated DDA measurement. In addition, two different search engines were tested for protein identification from DDA data and subsequent library generation. In total, eight different spectral libraries were generated, and the quantification results compared with a library free method, as well as a default DDA analysis. Not only the number of identifications on peptide and protein level in the spectral libraries and the corresponding DIA analysis results was inspected, but also the number of expected and identified differentially abundant protein groups and their ratios.We found, that while libraries of prefractionated samples were generally larger, there was no significant increase in DIA identifications compared with repetitive non-fractionated measurements. Furthermore, we show that the accuracy of the quantification is strongly dependent on the applied spectral library and whether the quantification is based on peptide or protein level. Overall, the reproducibility and accuracy of DIA quantification is superior to DDA in all applied approaches.Data has been deposited to the ProteomeXchange repository with identifiers PXD012986, PXD012987, PXD012988 and PXD014956.
Asunto(s)
Exactitud de los Datos , Biblioteca de Péptidos , Proteoma/análisis , Proteómica/métodos , Animales , Línea Celular , Cromatografía Liquida/métodos , Bases de Datos de Proteínas , Ratones , Mioblastos/metabolismo , Péptidos/análisis , Proteínas/análisis , Reproducibilidad de los Resultados , Sensibilidad y Especificidad , Análisis de Secuencia de Proteína , Programas Informáticos , Espectrometría de Masas en Tándem/métodosRESUMEN
BACKGROUND: The COVID-19 pandemic has taken a toll on health care systems worldwide, which has led to increased mortality of different diseases like myocardial infarction. This is most likely due to three factors. First, an increased workload per nurse ratio, a factor associated with mortality. Second, patients presenting with COVID-19-like symptoms are isolated, which also decreases survival in cases of emergency. And third, patients hesitate to see a doctor or present themselves at a hospital. To assess if this is also true for sepsis patients, we asked whether non-COVID-19 sepsis patients had an increased 30-day mortality during the COVID-19 pandemic. METHODS: This is a post hoc analysis of the SepsisDataNet.NRW study, a multicentric, prospective study that includes septic patients fulfilling the SEPSIS-3 criteria. Within this study, we compared the 30-day mortality and disease severity of patients recruited pre-pandemic (recruited from March 2018 until February 2020) with non-COVID-19 septic patients recruited during the pandemic (recruited from March 2020 till December 2020). RESULTS: Comparing septic patients recruited before the pandemic to those recruited during the pandemic, we found an increased raw 30-day mortality in sepsis-patients recruited during the pandemic (33% vs. 52%, p = 0.004). We also found a significant difference in the severity of disease at recruitment (SOFA score pre-pandemic: 8 (5 - 11) vs. pandemic: 10 (8 - 13); p < 0.001). When adjusted for this, the 30-day mortality rates were not significantly different between the two groups (52% vs. 52% pre-pandemic and pandemic, p = 0.798). CONCLUSIONS: This led us to believe that the higher mortality of non-COVID19 sepsis patients during the pandemic might be attributed to a more severe septic disease at the time of recruitment. We note that patients may experience a delayed admission, as indicated by elevated SOFA scores. This could explain the higher mortality during the pandemic and we found no evidence for a diminished quality of care for critically ill sepsis patients in German intensive care units.
Asunto(s)
COVID-19/prevención & control , Pandemias , Sepsis/mortalidad , Tiempo de Tratamiento/estadística & datos numéricos , Anciano , Femenino , Alemania/epidemiología , Humanos , Masculino , Persona de Mediana Edad , Gravedad del Paciente , Estudios Prospectivos , SARS-CoV-2 , Análisis de SupervivenciaRESUMEN
Chronic obstructive pulmonary disease (COPD) is a major risk factor for the development of lung adenocarcinoma (AC). AC often develops on underlying COPD; thus, the differentiation of both entities by biomarker is challenging. Although survival of AC patients strongly depends on early diagnosis, a biomarker panel for AC detection and differentiation from COPD is still missing. Plasma samples from 176 patients with AC with or without underlying COPD, COPD patients, and hospital controls were analyzed using mass-spectrometry-based proteomics. We performed univariate statistics and additionally evaluated machine learning algorithms regarding the differentiation of AC vs. COPD and AC with COPD vs. COPD. Univariate statistics revealed significantly regulated proteins that were significantly regulated between the patient groups. Furthermore, random forest classification yielded the best performance for differentiation of AC vs. COPD (area under the curve (AUC) 0.935) and AC with COPD vs. COPD (AUC 0.916). The most influential proteins were identified by permutation feature importance and compared to those identified by univariate testing. We demonstrate the great potential of machine learning for differentiation of highly similar disease entities and present a panel of biomarker candidates that should be considered for the development of a future biomarker panel.
Asunto(s)
Adenocarcinoma del Pulmón , Neoplasias Pulmonares , Enfermedad Pulmonar Obstructiva Crónica , Biomarcadores , Humanos , Neoplasias Pulmonares/diagnóstico , Neoplasias Pulmonares/patología , Proteómica , Enfermedad Pulmonar Obstructiva Crónica/patologíaRESUMEN
Protein sequence databases play a crucial role in the majority of the currently applied mass-spectrometry-based proteomics workflows. Here UniProtKB serves as one of the major sources, as it combines the information of several smaller databases and enriches the entries with additional biological information. For the identification of peptides in a sample by tandem mass spectra, as generated by data-dependent acquisition, protein sequence databases provide the basis for most spectrum identification search engines. In addition, for targeted proteomics approaches like selected reaction monitoring (SRM) and parallel reaction monitoring (PRM), knowledge of the peptide sequences, their masses, and whether they are unique for a protein is essential. Because most bottom-up proteomics approaches use trypsin to cleave the proteins in a sample, the tryptic peptides contained in a protein database are of great interest. We present a database, called MaCPepDB (mass-centric peptide database), that consists of the complete tryptic digest of the Swiss-Prot and TrEMBL parts of UniProtKB. This database is especially designed to not only allow queries of peptide sequences and return the respective information about connected proteins and thus whether a peptide is unique but also allow queries of specific masses of peptides or precursors of MS/MS spectra. Furthermore, posttranslational modifications can be considered in a query as well as different mass deviations for posttranslational modifications. Hence the database can be used by a sequence query not only to, for example, check in which proteins of the UniProt database a tryptic peptide can be found but also to find possibly interfering peptides in PRM/SRM experiments using the mass query. The complete database contains currently 5â¯939â¯244â¯990 peptides from 185â¯561â¯610 proteins (UniProt version 2020_03), for which a single query usually takes less than 1 s. For easy exploration of the data, a web interface was developed. A REST application programming interface (API) for programmatic and workflow access is also available at https://macpepdb.mpc.rub.de.
Asunto(s)
Péptidos , Espectrometría de Masas en Tándem , Bases de Datos de Proteínas , Proteínas , ProteómicaRESUMEN
Sustainable noncommercial bioinformatics infrastructures are a prerequisite to use and take advantage of the potential of big data analysis for research and economy. Consequently, funders, universities and institutes as well as users ask for a transparent value model for the tools and services offered. In this article, a generally applicable lightweight method is described by which bioinformatics infrastructure projects can estimate the value of tools and services offered without determining exactly the total costs of ownership. Five representative scenarios for value estimation from a rough estimation to a detailed breakdown of costs are presented. To account for the diversity in bioinformatics applications and services, the notion of service-specific 'service provision units' is introduced together with the factors influencing them and the main underlying assumptions for these 'value influencing factors'. Special attention is given on how to handle personnel costs and indirect costs such as electricity. Four examples are presented for the calculation of the value of tools and services provided by the German Network for Bioinformatics Infrastructure (de.NBI): one for tool usage, one for (Web-based) database analyses, one for consulting services and one for bioinformatics training events. Finally, from the discussed values, the costs of direct funding and the costs of payment of services by funded projects are calculated and compared.
Asunto(s)
Biología Computacional/economía , Biología Computacional/métodos , Programas Informáticos/economía , Macrodatos/economía , Biología Computacional/educación , Consultores , Costos y Análisis de Costo , Arquitectura y Construcción de Instituciones de Salud/economía , Humanos , Servicios de Información/economía , Modelos Económicos , Navegador Web/economíaRESUMEN
BACKGROUND: Asthma is an inflammatory disease of the respiratory system, and a major factor of increasing health care costs worldwide. The molecular actors leading to the development of chronic asthma are not fully understood and require further investigation. OBJECTIVE: The aim of this study was to monitor the proteome dynamics during asthma development from early inflammatory to late fibrotic stages. METHODS: A mouse asthma model was used to analyse the lung proteome at four time points during asthma development (0 weeks = control, 5, 8 and 12 weeks of treatment, n = 6 each). The model was analysed using lung function tests, immune cell counting and histology. Furthermore, a multi-fraction mass spectrometry-based proteome analysis was performed to achieve a comprehensive coverage and quantification of the lung proteome. RESULTS: At early stages, the mice showed predominant eosinophilic inflammation of the airways, which disappeared at later stages and was replaced by marked airway hyper-reactivity and fibrosis of the airways. 3325 proteins were quantified with 435 proteins found to be significantly differentially abundant between the experimental groups (ANOVA p-value ≤.05, maximum fold change ≥1.5). We applied hierarchical clustering to identify common protein abundance profiles along the asthma development and analysed these clusters using gene ontology annotation and enrichment analysis. We demonstrate the correlation of protein clusters with the course of asthma development, that is eosinophilic inflammation and fibrotic remodelling of the airways. CONCLUSIONS AND CLINICAL RELEVANCE: Proteome analysis revealed proteins that were previously described to be important during asthma chronification. Moreover, we identified additional proteins previously not described in the context of asthma. We provide a comprehensive data set of a long-term mouse model of asthma that may contribute to a better understanding and allow new insights into the progression and development of chronic asthma. Data are available via ProteomeXchange with identifier PXD011159.
Asunto(s)
Asma , Proteoma , Animales , Modelos Animales de Enfermedad , Ontología de Genes , Humanos , Pulmón , RatonesRESUMEN
The PRoteomics IDEntifications (PRIDE) database (https://www.ebi.ac.uk/pride/) is the world's largest data repository of mass spectrometry-based proteomics data, and is one of the founding members of the global ProteomeXchange (PX) consortium. In this manuscript, we summarize the developments in PRIDE resources and related tools since the previous update manuscript was published in Nucleic Acids Research in 2016. In the last 3 years, public data sharing through PRIDE (as part of PX) has definitely become the norm in the field. In parallel, data re-use of public proteomics data has increased enormously, with multiple applications. We first describe the new architecture of PRIDE Archive, the archival component of PRIDE. PRIDE Archive and the related data submission framework have been further developed to support the increase in submitted data volumes and additional data types. A new scalable and fault tolerant storage backend, Application Programming Interface and web interface have been implemented, as a part of an ongoing process. Additionally, we emphasize the improved support for quantitative proteomics data through the mzTab format. At last, we outline key statistics on the current data contents and volume of downloads, and how PRIDE data are starting to be disseminated to added-value resources including Ensembl, UniProt and Expression Atlas.
Asunto(s)
Bases de Datos de Proteínas , Espectrometría de Masas , Proteómica , Péptidos/química , Programas InformáticosRESUMEN
Targeted proteomics techniques allow accurate quantitative measurements of analytes in complex matrices with dynamic linear ranges that span up to 4-5 orders of magnitude. Hence, targeted methods are promising for the development of robust protein assays in several sensitive areas, for example, in health care. However, exploiting the full method potential requires reliable determination of the dynamic range along with related quantification limits for each analyte. Here, a software named CalibraCurve that enables an automated batch-mode determination of dynamic linear ranges and quantification limits for both targeted proteomics and similar assays is presented. The software uses a variety of measures to assess the accuracy of the calibration, namely precision and trueness. Two different kinds of customizable graphs are created (calibration curves and response factor plots). The accuracy measures and the graphs offer an intuitive, detailed, and reliable opportunity to assess the quality of the model fit. Thus, CalibraCurve is deemed a highly useful and flexible tool to facilitate the development and control of reliable SRM/MRM-MS-based proteomics assays.
Asunto(s)
Espectrometría de Masas/métodos , Proteómica/métodos , Programas Informáticos , Calibración , HumanosRESUMEN
We evaluated the quantification strategies label-free (LF), stable isotope labeling by amino acids in cell culture (SILAC), and tandem mass tags (TMT) and their performance in quantification of proteins and phosphosites (p-sites) to identify the most powerful approach for monitoring cellular signaling. We analyzed the epidermal growth factor receptor (EGFR) signaling network, which plays an essential role in colorectal cancer, and studied its dynamics within 24 h upon treatment with the EGFR-blocking antibody cetuximab, representing the first cellular adaption toward therapy. LF achieved superior coverage but was outperformed by label-based approaches regarding technical variability, especially for quantification of p-sites. TMT showed the lowest coverage and most missing values. We found that its performance considerably decreases when experimental replicates are distributed over several TMT plexes. SILAC showed the highest precision and outstanding performance for quantification of p-sites, rendering it the method of choice for analyzing cellular signaling in cell culture models. On the protein level, we observed only little regulation upon cetuximab treatment, whereas a great fraction of p-sites was significantly regulated. These dynamics represented an initial downregulation of the MAPK pathway, which was partially rescued as early as 24 h after treatment. We identified upregulation and signaling via ERBB3 as well as calcium and cAMP signaling as possible mechanisms bypassing the blockage of EGFR.
Asunto(s)
Neoplasias Colorrectales , Proteómica , Línea Celular , Neoplasias Colorrectales/tratamiento farmacológico , Receptores ErbB/genética , Humanos , Marcaje IsotópicoRESUMEN
Histopathological differentiation between severe urocystitis with reactive urothelial atypia and carcinoma in situ (CIS) can be difficult, particularly after a treatment that deliberately induces an inflammatory reaction, such as intravesical instillation of Bacillus Calmette-Guèrin. However, precise grading in bladder cancer is critical for therapeutic decision making and thus requires reliable immunohistochemical biomarkers. Herein, an exemplary potential biomarker in bladder cancer was identified by the novel approach of Fourier transform infrared imaging for label-free tissue annotation of tissue thin sections. Identified regions of interest are collected by laser microdissection to provide homogeneous samples for liquid chromatography-tandem mass spectrometry-based proteomic analysis. This approach afforded label-free spatial classification with a high accuracy and without interobserver variability, along with the molecular resolution of the proteomic analysis. Cystitis and invasive high-grade urothelial carcinoma samples were analyzed. Three candidate biomarkers were identified and verified by immunohistochemistry in a small cohort, including low-grade urothelial carcinoma samples. The best-performing candidate AHNAK2 was further evaluated in a much larger independent verification cohort that also included CIS samples. Reactive urothelial atypia and CIS were distinguishable on the basis of the expression of this newly identified and verified immunohistochemical biomarker, with a sensitivity of 97% and a specificity of 69%. AHNAK2 can differentiate between reactive urothelial atypia in the setting of an acute or chronic cystitis and nonmuscle invasive-type CIS.
Asunto(s)
Biomarcadores de Tumor/metabolismo , Proteínas del Citoesqueleto/metabolismo , Proteínas de Neoplasias/metabolismo , Proteómica , Neoplasias de la Vejiga Urinaria , Urotelio , Femenino , Humanos , Inmunohistoquímica , Masculino , Espectroscopía Infrarroja por Transformada de Fourier , Neoplasias de la Vejiga Urinaria/diagnóstico por imagen , Neoplasias de la Vejiga Urinaria/metabolismo , Urotelio/diagnóstico por imagen , Urotelio/metabolismoRESUMEN
Proteomics using LC-MS/MS has become one of the main methods to analyze the proteins in biological samples in high-throughput. But the existing mass-spectrometry instruments are still limited with respect to resolution and measurable mass ranges, which is one of the main reasons why shotgun proteomics is the major approach. Here proteins are digested, which leads to the identification and quantification of peptides instead. While often neglected, the important step of protein inference needs to be conducted to infer from the identified peptides to the actual proteins in the original sample. In this work, we highlight some of the previously published and newly added features of the tool PIA - Protein Inference Algorithms, which helps the user with the protein inference of measured samples. We also highlight the importance of the usage of PSI standard file formats, as PIA is the only current software supporting all available standards used for spectrum identification and protein inference. Additionally, we briefly describe the benefits of working with workflow environments for proteomics analyses and show the new features of the PIA nodes for the KNIME Analytics Platform. Finally, we benchmark PIA against a recently published data set for isoform detection. PIA is open source and available for download on GitHub ( https://github.com/mpc-bioinformatics/pia ) or directly via the community extensions inside the KNIME analytics platform.