Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 119
Filtrar
Más filtros

Bases de datos
País/Región como asunto
Tipo del documento
Intervalo de año de publicación
1.
J Proteome Res ; 23(1): 418-429, 2024 01 05.
Artículo en Inglés | MEDLINE | ID: mdl-38038272

RESUMEN

The inherent diversity of approaches in proteomics research has led to a wide range of software solutions for data analysis. These software solutions encompass multiple tools, each employing different algorithms for various tasks such as peptide-spectrum matching, protein inference, quantification, statistical analysis, and visualization. To enable an unbiased comparison of commonly used bottom-up label-free proteomics workflows, we introduce WOMBAT-P, a versatile platform designed for automated benchmarking and comparison. WOMBAT-P simplifies the processing of public data by utilizing the sample and data relationship format for proteomics (SDRF-Proteomics) as input. This feature streamlines the analysis of annotated local or public ProteomeXchange data sets, promoting efficient comparisons among diverse outputs. Through an evaluation using experimental ground truth data and a realistic biological data set, we uncover significant disparities and a limited overlap in the quantified proteins. WOMBAT-P not only enables rapid execution and seamless comparison of workflows but also provides valuable insights into the capabilities of different software solutions. These benchmarking metrics are a valuable resource for researchers in selecting the most suitable workflow for their specific data sets. The modular architecture of WOMBAT-P promotes extensibility and customization. The software is available at https://github.com/wombat-p/WOMBAT-Pipelines.


Asunto(s)
Benchmarking , Proteómica , Flujo de Trabajo , Programas Informáticos , Proteínas , Análisis de Datos
2.
PLoS Comput Biol ; 19(9): e1011369, 2023 Sep.
Artículo en Inglés | MEDLINE | ID: mdl-37768885

RESUMEN

Research data is accumulating rapidly and with it the challenge of fully reproducible science. As a consequence, implementation of high-quality management of scientific data has become a global priority. The FAIR (Findable, Accesible, Interoperable and Reusable) principles provide practical guidelines for maximizing the value of research data; however, processing data using workflows-systematic executions of a series of computational tools-is equally important for good data management. The FAIR principles have recently been adapted to Research Software (FAIR4RS Principles) to promote the reproducibility and reusability of any type of research software. Here, we propose a set of 10 quick tips, drafted by experienced workflow developers that will help researchers to apply FAIR4RS principles to workflows. The tips have been arranged according to the FAIR acronym, clarifying the purpose of each tip with respect to the FAIR4RS principles. Altogether, these tips can be seen as practical guidelines for workflow developers who aim to contribute to more reproducible and sustainable computational science, aiming to positively impact the open science and FAIR community.

3.
Am J Physiol Lung Cell Mol Physiol ; 324(4): L521-L535, 2023 04 01.
Artículo en Inglés | MEDLINE | ID: mdl-36808722

RESUMEN

Lung fibroblasts are implicated in abnormal tissue repair in chronic obstructive pulmonary disease (COPD). The exact mechanisms are unknown and comprehensive analysis comparing COPD- and control fibroblasts is lacking. The aim of this study is to gain insight into the role of lung fibroblasts in COPD pathology using unbiased proteomic and transcriptomic analysis. Protein and RNA were isolated from cultured parenchymal lung fibroblasts of 17 patients with stage IV COPD and 16 non-COPD controls. Proteins were analyzed using LC-MS/MS and RNA through RNA sequencing. Differential protein and gene expression in COPD was assessed via linear regression, followed by pathway enrichment, correlation analysis, and immunohistological staining in lung tissue. Proteomic and transcriptomic data were compared to investigate the overlap and correlation between both levels of data. We identified 40 differentially expressed (DE) proteins and zero DE genes between COPD and control fibroblasts. The most significant DE proteins were HNRNPA2B1 and FHL1. Thirteen of the 40 proteins were previously associated with COPD, including FHL1 and GSTP1. Six of the 40 proteins were related to telomere maintenance pathways, and were positively correlated with the senescence marker LMNB1. No significant correlation between gene and protein expression was observed for the 40 proteins. We hereby describe 40 DE proteins in COPD fibroblasts including previously described COPD proteins (FHL1, GSTP1) and new COPD research targets like HNRNPA2B1. Lack of overlap and correlation between gene and protein data supports the use of unbiased proteomics analysis and indicates that different types of information are generated with both methods.


Asunto(s)
Proteómica , Enfermedad Pulmonar Obstructiva Crónica , Humanos , Cromatografía Liquida , Espectrometría de Masas en Tándem , Pulmón/metabolismo , Enfermedad Pulmonar Obstructiva Crónica/patología , ARN/metabolismo , Fibroblastos/metabolismo , Proteínas Musculares/metabolismo , Péptidos y Proteínas de Señalización Intracelular/metabolismo , Proteínas con Dominio LIM/metabolismo
4.
Am J Physiol Lung Cell Mol Physiol ; 324(6): L799-L814, 2023 06 01.
Artículo en Inglés | MEDLINE | ID: mdl-37039368

RESUMEN

Extracellular matrix (ECM) remodeling has been associated with chronic lung diseases. However, information about specific age-associated differences in lung ECM is currently limited. In this study, we aimed to identify and localize age-associated ECM differences in human lungs using comprehensive transcriptomic, proteomic, and immunohistochemical analyses. Our previously identified age-associated gene expression signature of the lung was re-analyzed limiting it to an aging signature based on 270 control patients (37-80 years) and focused on the Matrisome core geneset using geneset enrichment analysis. To validate the age-associated transcriptomic differences on protein level, we compared the age-associated ECM genes (false discovery rate, FDR < 0.05) with a profile of age-associated proteins identified from a lung tissue proteomics dataset from nine control patients (49-76 years) (FDR < 0.05). Extensive immunohistochemical analysis was used to localize and semi-quantify the age-associated ECM differences in lung tissues from 62 control patients (18-82 years). Comparative analysis of transcriptomic and proteomic data identified seven ECM proteins with higher expression with age at both gene and protein levels: COL1A1, COL6A1, COL6A2, COL14A1, FBLN2, LTBP4, and LUM. With immunohistochemistry, we demonstrated higher protein levels with age for COL6A2 in whole tissue, parenchyma, airway wall, and blood vessel, for COL14A1 and LUM in bronchial epithelium, and COL1A1 in lung parenchyma. Our study revealed that higher age is associated with lung ECM remodeling, with specific differences occurring in defined regions within the lung. These differences may affect lung structure and physiology with aging and as such may increase susceptibility to developing chronic lung diseases.NEW & NOTEWORTHY We identified seven age-associated extracellular matrix (ECM) proteins, i.e., COL1A1, COL6A1, COL6A2 COL14A1, FBLN2, LTBP4, and LUM with higher transcript and protein levels in human lung tissue with age. Extensive immunohistochemical analysis revealed significant age-associated differences for COL6A2 in whole tissue, parenchyma, airway wall, and vessel, for COL14A1 and LUM in bronchial epithelium, and COL1A1 in parenchyma. Our findings lay a new foundation for the investigation of ECM differences in age-associated chronic lung diseases.


Asunto(s)
Enfermedades Pulmonares , Proteómica , Humanos , Adulto , Persona de Mediana Edad , Anciano , Anciano de 80 o más Años , Adolescente , Adulto Joven , Matriz Extracelular/metabolismo , Proteínas de la Matriz Extracelular/genética , Pulmón/metabolismo , Enfermedades Pulmonares/metabolismo
5.
Bioinformatics ; 38(22): 5049-5054, 2022 11 15.
Artículo en Inglés | MEDLINE | ID: mdl-36179082

RESUMEN

MOTIVATION: Gaussian graphical models (GGMs) are network representations of random variables (as nodes) and their partial correlations (as edges). GGMs overcome the challenges of high-dimensional data analysis by using shrinkage methodologies. Therefore, they have become useful to reconstruct gene regulatory networks from gene-expression profiles. However, it is often ignored that the partial correlations are 'shrunk' and that they cannot be compared/assessed directly. Therefore, accurate (differential) network analyses need to account for the number of variables, the sample size, and also the shrinkage value, otherwise, the analysis and its biological interpretation would turn biased. To date, there are no appropriate methods to account for these factors and address these issues. RESULTS: We derive the statistical properties of the partial correlation obtained with the Ledoit-Wolf shrinkage. Our result provides a toolbox for (differential) network analyses as (i) confidence intervals, (ii) a test for zero partial correlation (null-effects) and (iii) a test to compare partial correlations. Our novel (parametric) methods account for the number of variables, the sample size and the shrinkage values. Additionally, they are computationally fast, simple to implement and require only basic statistical knowledge. Our simulations show that the novel tests perform better than DiffNetFDR-a recently published alternative-in terms of the trade-off between true and false positives. The methods are demonstrated on synthetic data and two gene-expression datasets from Escherichia coli and Mus musculus. AVAILABILITY AND IMPLEMENTATION: The R package with the methods and the R script with the analysis are available in https://github.com/V-Bernal/GeneNetTools. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Redes Reguladoras de Genes , Ratones , Animales , Distribución Normal , Tamaño de la Muestra , Expresión Génica
6.
Drug Metab Dispos ; 51(2): 249-256, 2023 02.
Artículo en Inglés | MEDLINE | ID: mdl-36379709

RESUMEN

Therapeutic proteins (TPs) are known to be heterogeneous due to modifications that occur during the production process and storage. Modifications may also occur in TPs after their administration to patients due to in vivo biotransformation. Ligand binding assays, which are widely used in the bioanalysis of TPs in body fluids, are typically unable to distinguish such modifications. Liquid chromatography coupled to mass spectrometry is being increasingly used to study modifications in TPs, but its use to study in vivo biotransformation has been limited until now. We present a novel approach that combines affinity enrichment using Affimer reagents with ion-exchange chromatography (IEX) to analyze charge variants of the TPs trastuzumab and pertuzumab in plasma of patients undergoing therapy for HER2-positive breast cancer. Affimer reagents were immobilized via engineered Cys tags to maleimide beads, and the TPs were eluted under acidic conditions followed by rapid neutralization. The enriched TPs were analyzed by cation-exchange chromatography (IEX) using pH-gradient elution, resulting in the separation of about 20 charge variants for trastuzumab and about five charge variants for pertuzumab. A comparison between in vitro stressed TPs spiked into plasma, and TPs enriched from patient plasma showed that the observed profiles were highly similar. This indicates that in vitro stress testing in plasma can mimic the situation in patient plasma, as far as the generation of charge variants is concerned. SIGNIFICANCE STATEMENT: This research attempts to elucidate the modifications that occur in therapeutic proteins (TPs) after they have been administered to patients. This is important because there is little knowledge about the fate of TPs in this regard, and certain modifications could affect their efficiency. Our results show that the modifications discovered are most likely due to a chemical process and are not patient specific.


Asunto(s)
Neoplasias de la Mama , Humanos , Femenino , Trastuzumab/uso terapéutico , Neoplasias de la Mama/tratamiento farmacológico , Neoplasias de la Mama/metabolismo , Anticuerpos Monoclonales Humanizados/uso terapéutico , Cromatografía por Intercambio Iónico , Receptor ErbB-2/metabolismo , Protocolos de Quimioterapia Combinada Antineoplásica
7.
Anal Chem ; 94(31): 10893-10906, 2022 08 09.
Artículo en Inglés | MEDLINE | ID: mdl-35880733

RESUMEN

With increasing sensitivity and accuracy in mass spectrometry, the tumor phosphoproteome is getting into reach. However, the selection of quantitation techniques best-suited to the biomedical question and diagnostic requirements remains a trial and error decision as no study has directly compared their performance for tumor tissue phosphoproteomics. We compared label-free quantification (LFQ), spike-in-SILAC (stable isotope labeling by amino acids in cell culture), and tandem mass tag (TMT) isobaric tandem mass tags technology for quantitative phosphosite profiling in tumor tissue. Compared to the classic SILAC method, spike-in-SILAC is not limited to cell culture analysis, making it suitable for quantitative analysis of tumor tissue samples. TMT offered the lowest accuracy and the highest precision and robustness toward different phosphosite abundances and matrices. Spike-in-SILAC offered the best compromise between these features but suffered from a low phosphosite coverage. LFQ offered the lowest precision but the highest number of identifications. Both spike-in-SILAC and LFQ presented susceptibility to matrix effects. Match between run (MBR)-based analysis enhanced the phosphosite coverage across technical replicates in LFQ and spike-in-SILAC but further reduced the precision and robustness of quantification. The choice of quantitative methodology is critical for both study design such as sample size in sample groups and quantified phosphosites and comparison of published cancer phosphoproteomes. Using ovarian cancer tissue as an example, our study builds a resource for the design and analysis of quantitative phosphoproteomic studies in cancer research and diagnostics.


Asunto(s)
Neoplasias Ováricas , Proteómica , Femenino , Humanos , Marcaje Isotópico/métodos , Espectrometría de Masas/métodos , Neoplasias Ováricas/diagnóstico , Proteoma/química , Proteómica/métodos
8.
Respir Res ; 23(1): 15, 2022 Jan 24.
Artículo en Inglés | MEDLINE | ID: mdl-35073932

RESUMEN

BACKGROUND: There is a strong need for biomarkers to better characterize individuals with COPD and to take into account the heterogeneity of COPD. The blood protein sRAGE has been put forward as promising biomarker for COPD in general and emphysema in particular. Here, we measured plasma sRAGE levels using quantitative LC-MS and assessed whether the plasma sRAGE levels associate with (changes in) lung function, radiological emphysema parameters, and radiological subtypes of emphysema. METHODS: Three hundred and twenty-four COPD patients (mean FEV1: 63%predicted) and 185 healthy controls from the COPDGene study were selected. Plasma sRAGE was measured by immunoprecipitation in 96-well plate methodology to enrich sRAGE, followed by targeted quantitative liquid chromatography-mass spectrometry. Spirometry and HRCT scans (inspiration and expiration) with a 5-year follow-up were used; both subjected to high quality control standards. RESULTS: Lower sRAGE values significantly associated with the presence of COPD, the severity of airflow obstruction, the severity of emphysema on HRCT, the heterogeneous distribution of emphysema, centrilobular emphysema, and 5-year progression of emphysema. However, sRAGE values did not associate with airway wall thickness or paraseptal emphysema. CONCLUSIONS: Rather than being a general COPD biomarker, sRAGE is especially a promising biomarker for centrilobular emphysema. Follow-up studies should elucidate whether sRAGE can be used as a biomarker for other COPD phenotypes as well.


Asunto(s)
Pulmón/diagnóstico por imagen , Enfisema Pulmonar/sangre , Receptor para Productos Finales de Glicación Avanzada/sangre , Tomografía Computarizada por Rayos X/métodos , Capacidad Vital/fisiología , Anciano , Biomarcadores/sangre , Femenino , Humanos , Pulmón/fisiopatología , Masculino , Persona de Mediana Edad , Enfisema Pulmonar/diagnóstico , Enfisema Pulmonar/fisiopatología
9.
BMC Bioinformatics ; 22(1): 424, 2021 Sep 07.
Artículo en Inglés | MEDLINE | ID: mdl-34493207

RESUMEN

BACKGROUND: In systems biology, it is important to reconstruct regulatory networks from quantitative molecular profiles. Gaussian graphical models (GGMs) are one of the most popular methods to this end. A GGM consists of nodes (representing the transcripts, metabolites or proteins) inter-connected by edges (reflecting their partial correlations). Learning the edges from quantitative molecular profiles is statistically challenging, as there are usually fewer samples than nodes ('high dimensional problem'). Shrinkage methods address this issue by learning a regularized GGM. However, it remains open to study how the shrinkage affects the final result and its interpretation. RESULTS: We show that the shrinkage biases the partial correlation in a non-linear way. This bias does not only change the magnitudes of the partial correlations but also affects their order. Furthermore, it makes networks obtained from different experiments incomparable and hinders their biological interpretation. We propose a method, referred to as 'un-shrinking' the partial correlation, which corrects for this non-linear bias. Unlike traditional methods, which use a fixed shrinkage value, the new approach provides partial correlations that are closer to the actual (population) values and that are easier to interpret. This is demonstrated on two gene expression datasets from Escherichia coli and Mus musculus. CONCLUSIONS: GGMs are popular undirected graphical models based on partial correlations. The application of GGMs to reconstruct regulatory networks is commonly performed using shrinkage to overcome the 'high-dimensional problem'. Besides it advantages, we have identified that the shrinkage introduces a non-linear bias in the partial correlations. Ignoring this type of effects caused by the shrinkage can obscure the interpretation of the network, and impede the validation of earlier reported results.


Asunto(s)
Biología de Sistemas , Animales , Ratones , Distribución Normal
10.
J Proteome Res ; 20(1): 634-644, 2021 01 01.
Artículo en Inglés | MEDLINE | ID: mdl-32985198

RESUMEN

Liquid chromatography tandem mass spectrometry (LC-MS/MS) has been the most widely used technology for phosphoproteomics studies. As an alternative to database searching and probability-based phosphorylation site localization approaches, spectral library searching has been proved to be effective in the identification of phosphopeptides. However, incompletion of experimental spectral libraries limits the identification capability. Herein, we utilize MS/MS spectrum prediction coupled with spectral matching for site localization of phosphopeptides. In silico MS/MS spectra are generated from peptide sequences by deep learning/machine learning models trained with nonphosphopeptides. Then, mass shift according to phosphorylation sites, phosphoric acid neutral loss, and a "budding" strategy are adopted to adjust the in silico mass spectra. In silico MS/MS spectra can also be generated in one step for phosphopeptides using models trained with phosphopeptides. The method is benchmarked on data sets of synthetic phosphopeptides and is used to process real biological samples. It is demonstrated to be a method requiring only computational resources that supplements the probability-based approaches for phosphorylation site localization of singly and multiply phosphorylated peptides.


Asunto(s)
Fosfopéptidos , Espectrometría de Masas en Tándem , Cromatografía Liquida , Simulación por Computador , Fosfopéptidos/metabolismo , Fosforilación
11.
J Proteome Res ; 20(11): 5218-5221, 2021 11 05.
Artículo en Inglés | MEDLINE | ID: mdl-34669399

RESUMEN

Affinity ligands such as antibodies are widely used in (bio)medical research for purifying proteins from complex biological samples. These ligands are generally immobilized onto solid supports which facilitate the separation of a captured protein from the sample matrix. Adsorptive microtiter plates are commonly used as solid supports prior to immunochemical detection (e.g., immunoassays) but hardly ever prior to liquid chromatography-mass spectrometry (LC-MS-)-based detection. Here, we describe the use of adsorptive microtiter plates for protein enrichment prior to LC-MS detection, and we discuss opportunities and challenges of corresponding workflows, based on examples of targeted (i.e., soluble receptor for advanced glycation end-products (sRAGE) in human serum) and discovery-based workflows (i.e., transcription factor p65 (NF-κB) in lysed murine RAW 264.7 macrophages and peptidyl-prolyl cis-trans isomerase FKBP5 (FKBP5) in lysed human A549 alveolar basal epithelial cells). Thereby, we aim to highlight the potential usefulness of adsorptive microtiter plates in affinity purification workflows prior to LC-MS detection, which could increase their usage in mass spectrometry-based protein research.


Asunto(s)
Flujo de Trabajo , Animales , Cromatografía de Afinidad , Cromatografía Liquida/métodos , Humanos , Espectrometría de Masas/métodos , Ratones , Receptor para Productos Finales de Glicación Avanzada
12.
J Cell Mol Med ; 2021 Jun 18.
Artículo en Inglés | MEDLINE | ID: mdl-34146379

RESUMEN

The extracellular matrix (ECM) is the tissue microenvironment that regulates the characteristics of stromal and systemic cells to control processes such as inflammation and angiogenesis. Despite ongoing anti-inflammatory treatment, low levels of inflammation exist in the airways in asthma, which alters ECM deposition by airway smooth muscle (ASM) cells. The altered ECM causes aberrant behaviour of cells, such as endothelial cells, in the airway tissue. We therefore sought to characterize the composition and angiogenic potential of the ECM deposited by asthmatic and non-asthmatic ASM. After 72 hours under non-stimulated conditions, the ECM deposited by primary human asthmatic ASM cells was equal in total protein, collagen I, III and fibronectin content to that from non-asthmatic ASM cells. Further, the matrices of non-asthmatic and asthmatic ASM cells were equivalent in regulating the growth, activity, attachment and migration of primary human umbilical vein endothelial cells (HUVECs). Under basal conditions, asthmatic and non-asthmatic ASM cells intrinsically deposit an ECM of equivalent composition and angiogenic potential. Previous findings indicate that dysregulation of the airway ECM is driven even by low levels of inflammatory provocation. This study suggests the need for more effective anti-inflammatory therapies in asthma to maintain the airway ECM and regulate ECM-mediated aberrant angiogenesis.

13.
Anal Chem ; 93(32): 11215-11224, 2021 08 17.
Artículo en Inglés | MEDLINE | ID: mdl-34355890

RESUMEN

The accurate processing of complex liquid chromatography coupled to tandem mass spectrometry (LC-MS/MS) data from biological samples is a major challenge for metabolomics, proteomics, and related approaches. Here, we present the pipelines and systems for threshold-avoiding quantification (PASTAQ) LC-MS/MS preprocessing toolset, which allows highly accurate quantification of data-dependent acquisition LC-MS/MS datasets. PASTAQ performs compound quantification using single-stage (MS1) data and implements novel algorithms for high-performance and accurate quantification, retention time alignment, feature detection, and linking annotations from multiple identification engines. PASTAQ offers straightforward parameterization and automatic generation of quality control plots for data and preprocessing assessment. This design results in smaller variance when analyzing replicates of proteomes mixed with known ratios and allows the detection of peptides over a larger dynamic concentration range compared to widely used proteomics preprocessing tools. The performance of the pipeline is also demonstrated in a biological human serum dataset for the identification of gender-related proteins.


Asunto(s)
Proteómica , Espectrometría de Masas en Tándem , Algoritmos , Cromatografía Liquida , Humanos , Péptidos , Proteoma
14.
Anal Chem ; 93(38): 12872-12880, 2021 09 28.
Artículo en Inglés | MEDLINE | ID: mdl-34519498

RESUMEN

Histone acetylation is an important, reversible post-translational protein modification and a hallmark of epigenetic regulation. However, little is known about the dynamics of this process, due to the lack of analytical methods that can capture site-specific acetylation and deacetylation reactions. We present a new approach that combines metabolic and chemical labeling (CoMetChem) using uniformly 13C-labeled glucose and stable isotope-labeled acetic anhydride. Thereby, chemically equivalent, fully acetylated histone species are generated, enabling accurate relative quantification of site-specific lysine acetylation dynamics in tryptic peptides using high-resolution mass spectrometry. We show that CoMetChem enables site-specific quantification of the incorporation or loss of lysine acetylation over time, allowing the determination of reaction rates for acetylation and deacetylation. Thus, the CoMetChem methodology provides a comprehensive description of site-specific acetylation dynamics.


Asunto(s)
Epigénesis Genética , Histonas , Acetilación , Cromatografía Liquida , Histonas/metabolismo , Isótopos , Procesamiento Proteico-Postraduccional , Espectrometría de Masas en Tándem
15.
Rapid Commun Mass Spectrom ; 35(17): e9141, 2021 Sep 15.
Artículo en Inglés | MEDLINE | ID: mdl-34106497

RESUMEN

RATIONALE: The World Antidoping Agency (WADA) Monitoring program concentrates analytical data from the WADA Accredited Laboratories for substances which are not prohibited but whose potential misuse must be known. The WADA List of Monitoring substances is updated annually, where substances may be removed, introduced or transferred to the Prohibited List, depending on the prevalence of their use. Retroactive processing of old sample datafiles has the potential to create information for the prevalence of use of candidate substances for the Monitoring List in previous years. MetAlign is a freeware software with functionality to reduce the size of liquid chromatography (LC)/high-resolution (HR) full-scan (FS) mass spectrometry (MS) datafiles and to perform a fast search for the presence of substances in thousands of reduced datafiles. METHODS: Validation was performed to the search procedure of MetAlign applied to Anti-Doping Lab Qatar (ADLQ)-screened LC/HR-FS-MS reduced datafiles originated from antidoping samples for tramadol (TRA), ecdysterone (ECDY) and the ECDY metabolite 14-desoxyecdysterone (DESECDY) of the WADA Monitoring List. Searching parameters were related to combinations of accurate masses and retention times (RTs). RESULTS: MetAlign search validation criteria were based on the creation of correct identifications, false positives (FPs) and false negatives (FNs). The search for TRA in 7410 ADLQ routine LC/HR-FS-MS datafiles from the years 2017 to 2020 revealed no false identification (FPs and FNs) compared with the ADLQ WADA reports. ECDY and DESECDY were detected by MetAlign search in approximately 5% of the same cohort of antidoping samples. CONCLUSIONS: MetAlign is a powerful tool for the fast retroactive processing of old reduced datafiles collected in screening by LC/HR-FS-MS to reveal the prevalence of use of antidoping substances. The current study proposed the validation scheme of the MetAlign search procedure, to be implemented per individual substance in the WADA Monitoring program, for the elimination of FNs and FPs.


Asunto(s)
Anabolizantes/orina , Cromatografía Liquida/métodos , Doping en los Deportes/métodos , Ecdisterona/orina , Espectrometría de Masas/métodos , Tramadol/orina , Doping en los Deportes/prevención & control , Humanos , Orina/química
16.
Thorax ; 75(2): 180-183, 2020 02.
Artículo en Inglés | MEDLINE | ID: mdl-31937552

RESUMEN

Translation of genomic alterations to protein changes in chronic obstructive pulmonary disease (COPD) is largely unexplored. Using integrated proteomic and RNA sequencing analysis of COPD and control lung tissues, we identified a protein signature in COPD characterised by extracellular matrix changes and a potential regulatory role for SUMO2. Furthermore, we identified 61 differentially expressed novel, non-reference, peptides in COPD compared with control lungs. This included two peptides encoding for a new splice variant of SORBS1, of which the transcript usage was higher in COPD compared with control lungs. These explorative findings and integrative proteogenomic approach open new avenues to further unravel the pathology of COPD.


Asunto(s)
Regulación de la Expresión Génica/genética , Proteínas de Microfilamentos/genética , Isoformas de Proteínas/genética , Proteogenómica/métodos , Enfermedad Pulmonar Obstructiva Crónica/genética , Enfermedad Pulmonar Obstructiva Crónica/fisiopatología , Anciano , Estudios de Casos y Controles , Progresión de la Enfermedad , Femenino , Humanos , Masculino , Persona de Mediana Edad , Valores de Referencia , Medición de Riesgo , Índice de Severidad de la Enfermedad
17.
Anal Chem ; 92(24): 16138-16148, 2020 12 15.
Artículo en Inglés | MEDLINE | ID: mdl-33317272

RESUMEN

Mass spectrometry imaging (MSI) is a technique that provides comprehensive molecular information with high spatial resolution from tissue. Today, there is a strong push toward sharing data sets through public repositories in many research fields where MSI is commonly applied; yet, there is no standardized protocol for analyzing these data sets in a reproducible manner. Shifts in the mass-to-charge ratio (m/z) of molecular peaks present a major obstacle that can make it impossible to distinguish one compound from another. Here, we present a label-free m/z alignment approach that is compatible with multiple instrument types and makes no assumptions on the sample's molecular composition. Our approach, MSIWarp (https://github.com/horvatovichlab/MSIWarp), finds an m/z recalibration function by maximizing a similarity score that considers both the intensity and m/z position of peaks matched between two spectra. MSIWarp requires only centroid spectra to find the recalibration function and is thereby readily applicable to almost any MSI data set. To deal with particularly misaligned or peak-sparse spectra, we provide an option to detect and exclude spurious peak matches with a tailored random sample consensus (RANSAC) procedure. We evaluate our approach with four publicly available data sets from both time-of-flight (TOF) and Orbitrap instruments and demonstrate up to 88% improvement in m/z alignment.

18.
Bioinformatics ; 35(23): 5011-5017, 2019 12 01.
Artículo en Inglés | MEDLINE | ID: mdl-31077287

RESUMEN

MOTIVATION: One of the main goals in systems biology is to learn molecular regulatory networks from quantitative profile data. In particular, Gaussian graphical models (GGMs) are widely used network models in bioinformatics where variables (e.g. transcripts, metabolites or proteins) are represented by nodes, and pairs of nodes are connected with an edge according to their partial correlation. Reconstructing a GGM from data is a challenging task when the sample size is smaller than the number of variables. The main problem consists in finding the inverse of the covariance estimator which is ill-conditioned in this case. Shrinkage-based covariance estimators are a popular approach, producing an invertible 'shrunk' covariance. However, a proper significance test for the 'shrunk' partial correlation (i.e. the GGM edges) is an open challenge as a probability density including the shrinkage is unknown. In this article, we present (i) a geometric reformulation of the shrinkage-based GGM, and (ii) a probability density that naturally includes the shrinkage parameter. RESULTS: Our results show that the inference using this new 'shrunk' probability density is as accurate as Monte Carlo estimation (an unbiased non-parametric method) for any shrinkage value, while being computationally more efficient. We show on synthetic data how the novel test for significance allows an accurate control of the Type I error and outperforms the network reconstruction obtained by the widely used R package GeneNet. This is further highlighted in two gene expression datasets from stress response in Eschericha coli, and the effect of influenza infection in Mus musculus. AVAILABILITY AND IMPLEMENTATION: https://github.com/V-Bernal/GGM-Shrinkage. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Asunto(s)
Algoritmos , Programas Informáticos , Animales , Ratones , Método de Montecarlo , Distribución Normal , Biología de Sistemas
19.
Cell Biol Toxicol ; 36(3): 261-272, 2020 06.
Artículo en Inglés | MEDLINE | ID: mdl-31599373

RESUMEN

In the advanced stages, malignant melanoma (MM) has a very poor prognosis. Due to tremendous efforts in cancer research over the last 10 years, and the introduction of novel therapies such as targeted therapies and immunomodulators, the rather dark horizon of the median survival has dramatically changed from under 1 year to several years. With the advent of proteomics, deep-mining studies can reach low-abundant expression levels. The complexity of the proteome, however, still surpasses the dynamic range capabilities of current analytical techniques. Consequently, many predicted protein products with potential biological functions have not yet been verified in experimental proteomic data. This category of 'missing proteins' (MP) is comprised of all proteins that have been predicted but are currently unverified. As part of the initiative launched in 2016 in the USA, the European Cancer Moonshot Center has performed numerous deep proteomics analyses on samples from MM patients. In this study, nine MPs were clearly identified by mass spectrometry in MM metastases. Some MPs significantly correlated with proteins that possess identical PFAM structural domains; and other MPs were significantly associated with cancer-related proteins. This is the first study to our knowledge, where unknown and novel proteins have been annotated in metastatic melanoma tumour tissue.


Asunto(s)
Melanoma/genética , Metástasis de la Neoplasia/genética , Proteómica/métodos , Adulto , Biomarcadores de Tumor/genética , Femenino , Genoma Humano/genética , Humanos , Masculino , Persona de Mediana Edad , Anotación de Secuencia Molecular/métodos , Anotación de Secuencia Molecular/tendencias , Pronóstico , Proteoma/genética , Proteoma/metabolismo , Neoplasias Cutáneas/genética , Melanoma Cutáneo Maligno
20.
Anal Chem ; 91(18): 11888-11896, 2019 09 17.
Artículo en Inglés | MEDLINE | ID: mdl-31403280

RESUMEN

Mass spectrometry imaging (MSI) has the potential to reveal the localization of thousands of biomolecules such as metabolites and lipids in tissue sections. The increase in both mass and spatial resolution of today's instruments brings on considerable challenges in terms of data processing; accurately extracting meaningful signals from the large data sets generated by MSI without losing information that could be clinically relevant is one of the most fundamental tasks of analysis software. Ion images of the biomolecules are generated by visualizing their intensities in 2-D space using mass spectra collected across the tissue section. The intensities are often calculated by summing each compound's signal between predefined sets of borders (bins) in the m/z dimension. This approach, however, can result in mixed signals from different compounds in the same bin or splitting the signal from one compound between two adjacent bins, leading to low quality ion images. To remedy this problem, we propose a novel data processing approach. Our approach consists of a sensitive peak detection method able to discover both faint and localized signals by utilizing clusterwise kernel density estimates (KDEs) of peak distributions. We show that our method can recall more ground-truth molecules, molecule fragments, and isotopes than existing methods based on binning. Furthermore, it automatically detects previously reported molecular ions of lipids, including those close in m/z, in an experimental data set.

SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA