Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 662
Filtrar
1.
Proteomics ; 24(16): e2470124, 2024 Aug.
Artículo en Inglés | MEDLINE | ID: mdl-39148217
2.
Methods Mol Biol ; 2817: 177-220, 2024.
Artículo en Inglés | MEDLINE | ID: mdl-38907155

RESUMEN

Mass-spectrometry (MS)-based single-cell proteomics (SCP) explores cellular heterogeneity by focusing on the functional effectors of the cells-proteins. However, extracting meaningful biological information from MS data is far from trivial, especially with single cells. Currently, data analysis workflows are substantially different from one research team to another. Moreover, it is difficult to evaluate pipelines as ground truths are missing. Our team has developed the R/Bioconductor package called scp to provide a standardized framework for SCP data analysis. It relies on the widely used QFeatures and SingleCellExperiment data structures. In addition, we used a design containing cell lines mixed in known proportions to generate controlled variability for data analysis benchmarking. In this chapter, we provide a flexible data analysis protocol for SCP data using the scp package together with comprehensive explanations at each step of the processing. Our main steps are quality control on the feature and cell level, aggregation of the raw data into peptides and proteins, normalization, and batch correction. We validate our workflow using our ground truth data set. We illustrate how to use this modular, standardized framework and highlight some crucial steps.


Asunto(s)
Espectrometría de Masas , Proteómica , Análisis de la Célula Individual , Programas Informáticos , Flujo de Trabajo , Proteómica/métodos , Proteómica/normas , Análisis de la Célula Individual/métodos , Espectrometría de Masas/métodos , Humanos , Biología Computacional/métodos , Proteoma/análisis , Análisis de Datos
3.
J Am Soc Mass Spectrom ; 35(7): 1539-1549, 2024 Jul 03.
Artículo en Inglés | MEDLINE | ID: mdl-38864778

RESUMEN

Ion mobility spectrometry (IMS) is a gas-phase analytical technique that separates ions with different sizes and shapes and is compatible with mass spectrometry (MS) to provide an additional separation dimension. The rapid nature of the IMS separation combined with the high sensitivity of MS-based detection and the ability to derive structural information on analytes in the form of the property collision cross section (CCS) makes IMS particularly well-suited for characterizing complex samples in -omics applications. In such applications, the quality of CCS from IMS measurements is critical to confident annotation of the detected components in the complex -omics samples. However, most IMS instrumentation in mainstream use requires calibration to calculate CCS from measured arrival times, with the most notable exception being drift tube IMS measurements using multifield methods. The strategy for calibrating CCS values, particularly selection of appropriate calibrants, has important implications for CCS accuracy, reproducibility, and transferability between laboratories. The conventional approach to CCS calibration involves explicitly defining calibrants ahead of data acquisition and crucially relies upon availability of reference CCS values. In this work, we present a novel reference-free approach to CCS calibration which leverages trends among putatively identified features and computational CCS prediction to conduct calibrations post-data acquisition and without relying on explicitly defined calibrants. We demonstrated the utility of this reference-free CCS calibration strategy for proteomics application using high-resolution structures for lossless ion manipulations (SLIM)-based IMS-MS. We first validated the accuracy of CCS values using a set of synthetic peptides and then demonstrated using a complex peptide sample from cell lysate.


Asunto(s)
Espectrometría de Movilidad Iónica , Espectrometría de Masas , Proteómica , Espectrometría de Movilidad Iónica/métodos , Proteómica/métodos , Proteómica/normas , Calibración , Espectrometría de Masas/métodos , Péptidos/análisis , Péptidos/química , Reproducibilidad de los Resultados , Humanos
4.
J Proteome Res ; 23(8): 3704-3715, 2024 Aug 02.
Artículo en Inglés | MEDLINE | ID: mdl-38943634

RESUMEN

Proteome coverage and accurate protein quantification are both important for evaluating biological systems; however, compromises between quantification, coverage, and mass spectrometry (MS) resources are often necessary. Consequently, experimental parameters that impact coverage and quantification must be adjusted, depending on experimental goals. Among these parameters is offline prefractionation, which is utilized in MS-based proteomics to decrease sample complexity resulting in higher overall proteome coverage upon MS analysis. Prefractionation leads to increases in required MS analysis time, although this is often mitigated by isobaric labeling using tandem-mass tags (TMT), which allow samples to be multiplexed. Here we evaluate common prefractionation schemes, TMT variants, and MS acquisition methods and their impact on protein quantification and coverage. Furthermore, we provide recommendations for experimental design depending on the experimental goals.


Asunto(s)
Proteoma , Proteómica , Espectrometría de Masas en Tándem , Proteómica/métodos , Proteómica/normas , Espectrometría de Masas en Tándem/métodos , Proteoma/análisis , Humanos , Fraccionamiento Químico/métodos , Coloración y Etiquetado/métodos
5.
J Am Soc Mass Spectrom ; 35(8): 1875-1882, 2024 Aug 07.
Artículo en Inglés | MEDLINE | ID: mdl-38918936

RESUMEN

Mass spectrometry is a powerful technique for analyzing molecules in complex biological samples. However, inter- and intralaboratory variability and bias can affect the data due to various factors, including sample handling and preparation, instrument calibration and performance, and data acquisition and processing. To address this issue, the Quality Control (QC) working group of the Human Proteome Organization's Proteomics Standards Initiative has established the standard mzQC file format for reporting and exchanging information relating to data quality. mzQC is based on the JavaScript Object Notation (JSON) format and provides a lightweight yet versatile file format that can be easily implemented in software. Here, we present open-source software libraries to process mzQC data in three programming languages: Python, using pymzqc; R, using rmzqc; and Java, using jmzqc. The libraries follow a common data model and provide shared functionalities, including the (de)serialization and validation of mzQC files. We demonstrate use of the software libraries in a workflow for extracting, analyzing, and visualizing QC metrics from different sources. Additionally, we show how these libraries can be integrated with each other, with existing software tools, and in automated workflows for the QC of mass spectrometry data. All software libraries are available as open source under the MS-Quality-Hub organization on GitHub (https://github.com/MS-Quality-Hub).


Asunto(s)
Espectrometría de Masas , Lenguajes de Programación , Proteómica , Control de Calidad , Programas Informáticos , Espectrometría de Masas/métodos , Espectrometría de Masas/normas , Humanos , Proteómica/métodos , Proteómica/normas , Flujo de Trabajo
6.
J Am Soc Mass Spectrom ; 35(7): 1441-1450, 2024 Jul 03.
Artículo en Inglés | MEDLINE | ID: mdl-38815255

RESUMEN

Currently, glycopeptide quantitation is mainly based on relative quantitation due to absolute quantitation requiring isotope-labeled or standard glycopeptides which may not be commercially available or are very costly and time consuming to synthesize. To address this grand challenge, coulometric mass spectrometry (CMS), based on the combination of electrochemistry (EC) and mass spectrometry (MS), was utilized to quantify electrochemically active glycopeptides without the need of using standard materials. In this study, we studied tyrosine-containing glycopeptides, NYIVGQPSS(ß-GlcNAc)TGNL-OH and NYSVPSS(ß-GlcNAc)TGNL-OH, and successfully quantified them directly with CMS with a discrepancy of less than 5% between the CMS measured amount and the theoretical amount. Taking one step further, we applied this approach to quantify glycopeptides generated from the digestion of NIST mAb, a monoclonal antibody reference material. Through HILIC column separation, five N297 glycopeptides resulting from NIST mAb tryptic digestion were successfully separated and quantified by CMS for an absolute amount without the use of any standard materials. This study indicates the potential utility of CMS for quantitative proteomics research.


Asunto(s)
Glicopéptidos , Espectrometría de Masas , Oxidación-Reducción , Glicopéptidos/análisis , Glicopéptidos/química , Espectrometría de Masas/métodos , Anticuerpos Monoclonales/química , Anticuerpos Monoclonales/análisis , Proteómica/métodos , Proteómica/normas , Tirosina/análisis , Tirosina/química , Técnicas Electroquímicas/métodos
7.
J Clin Oncol ; 42(16): 1961-1974, 2024 Jun 01.
Artículo en Inglés | MEDLINE | ID: mdl-38608213

RESUMEN

Effective diagnosis, prognostication, and management of CNS malignancies traditionally involves invasive brain biopsies that pose significant risk to the patient. Sampling and molecular profiling of cerebrospinal fluid (CSF) is a safer, rapid, and noninvasive alternative that offers a snapshot of the intracranial milieu while overcoming the challenge of sampling error that plagues conventional brain biopsy. Although numerous biomarkers have been identified, translational challenges remain, and standardization of protocols is necessary. Here, we systematically reviewed 141 studies (Medline, SCOPUS, and Biosis databases; between January 2000 and September 29, 2022) that molecularly profiled CSF from adults with brain malignancies including glioma, brain metastasis, and primary and secondary CNS lymphomas. We provide an overview of promising CSF biomarkers, propose CSF reporting guidelines, and discuss the various considerations that go into biomarker discovery, including the influence of blood-brain barrier disruption, cell of origin, and site of CSF acquisition (eg, lumbar and ventricular). We also performed a meta-analysis of proteomic data sets, identifying biomarkers in CNS malignancies and establishing a resource for the research community.


Asunto(s)
Biomarcadores de Tumor , Neoplasias Encefálicas , Humanos , Biomarcadores de Tumor/líquido cefalorraquídeo , Neoplasias Encefálicas/líquido cefalorraquídeo , Proteómica/métodos , Proteómica/normas , Neoplasias del Sistema Nervioso Central/líquido cefalorraquídeo , Neoplasias del Sistema Nervioso Central/diagnóstico
8.
J Proteome Res ; 23(8): 3235-3248, 2024 Aug 02.
Artículo en Inglés | MEDLINE | ID: mdl-38412263

RESUMEN

Currently, no consensus exists regarding criteria required to designate a protein within a proteomic data set as a cell surface protein. Most published proteomic studies rely on varied ontology annotations or computational predictions instead of experimental evidence when attributing protein localization. Consequently, standardized approaches for analyzing and reporting cell surface proteome data sets would increase confidence in localization claims and promote data use by other researchers. Recently, we developed Veneer, a web-based bioinformatic tool that analyzes results from cell surface N-glycocapture workflows─the most popular cell surface proteomics method used to date that generates experimental evidence of subcellular location. Veneer assigns protein localization based on defined experimental and bioinformatic evidence. In this study, we updated the criteria and process for assigning protein localization and added new functionality to Veneer. Results of Veneer analysis of 587 cell surface N-glycocapture data sets from 32 published studies demonstrate the importance of applying defined criteria when analyzing cell surface proteomics data sets and exemplify how Veneer can be used to assess experimental quality and facilitate data extraction for informing future biological studies and annotating public repositories.


Asunto(s)
Biología Computacional , Proteómica , Programas Informáticos , Proteómica/métodos , Proteómica/normas , Biología Computacional/métodos , Animales , Proteínas de la Membrana/metabolismo , Proteínas de la Membrana/análisis , Humanos , Anotación de Secuencia Molecular , Glicosilación , Bases de Datos de Proteínas , Proteoma/análisis , Proteoma/metabolismo , Internet
9.
J Proteome Res ; 23(8): 3141-3148, 2024 Aug 02.
Artículo en Inglés | MEDLINE | ID: mdl-38301217

RESUMEN

We present RawVegetable 2.0, a software tailored for assessing mass spectrometry data quality and fine-tuned for cross-linking mass spectrometry (XL-MS) applications. Building upon the capabilities of its predecessor, RawVegetable 2.0 introduces four main modules, each providing distinct and new functionalities: 1) Pair Finder, which identifies ion doublets characteristic of cleavable cross-linking experiments; 2) Diagnostic Peak Finder, which locates potential reporter ions associated with a specific cross-linker; 3) Precursor Signal Ratio, which computes the ratio between precursor intensity and the total signal in an MS/MS scan; and 4) Xrea, which evaluates spectral quality by analyzing the heterogeneity of peak intensities within a spectrum. These modules collectively streamline the process of optimizing mass spectrometry data acquisition for both Proteomics and XL-MS experiments. RawVegetable 2.0, along with a comprehensive tutorial is freely accessible for academic use at: http://patternlabforproteomics.org/rawvegetable2.


Asunto(s)
Proteómica , Control de Calidad , Programas Informáticos , Espectrometría de Masas en Tándem , Espectrometría de Masas en Tándem/métodos , Espectrometría de Masas en Tándem/normas , Proteómica/métodos , Proteómica/normas
10.
Nat Biotechnol ; 40(5): 692-702, 2022 05.
Artículo en Inglés | MEDLINE | ID: mdl-35102292

RESUMEN

Implementing precision medicine hinges on the integration of omics data, such as proteomics, into the clinical decision-making process, but the quantity and diversity of biomedical data, and the spread of clinically relevant knowledge across multiple biomedical databases and publications, pose a challenge to data integration. Here we present the Clinical Knowledge Graph (CKG), an open-source platform currently comprising close to 20 million nodes and 220 million relationships that represent relevant experimental data, public databases and literature. The graph structure provides a flexible data model that is easily extendable to new nodes and relationships as new databases become available. The CKG incorporates statistical and machine learning algorithms that accelerate the analysis and interpretation of typical proteomics workflows. Using a set of proof-of-concept biomarker studies, we show how the CKG might augment and enrich proteomics data and help inform clinical decision-making.


Asunto(s)
Bases del Conocimiento , Medicina de Precisión/métodos , Proteómica , Algoritmos , Toma de Decisiones Asistida por Computador , Aprendizaje Automático , Reconocimiento de Normas Patrones Automatizadas , Medicina de Precisión/normas , Proteómica/normas , Proteómica/estadística & datos numéricos
11.
Sci Rep ; 12(1): 1186, 2022 01 24.
Artículo en Inglés | MEDLINE | ID: mdl-35075163

RESUMEN

Cancer biomarker discovery is critically dependent on the integrity of biofluid and tissue samples acquired from study participants. Multi-omic profiling of candidate protein, lipid, and metabolite biomarkers is confounded by timing and fasting status of sample collection, participant demographics and treatment exposures of the study population. Contamination by hemoglobin, whether caused by hemolysis during sample preparation or underlying red cell fragility, contributes 0-10 g/L of extraneous protein to plasma, serum, and Buffy coat samples and may interfere with biomarker detection and validation. We analyzed 617 plasma, 701 serum, and 657 buffy coat samples from a 7-year longitudinal multi-omic biomarker discovery program evaluating 400+ participants with or at risk for pancreatic cancer, known as Project Survival. Hemolysis was undetectable in 93.1% of plasma and 95.0% of serum samples, whereas only 37.1% of buffy coat samples were free of contamination by hemoglobin. Regression analysis of multi-omic data demonstrated a statistically significant correlation between hemoglobin concentration and the resulting pattern of analyte detection and concentration. Although hemolysis had the greatest impact on identification and quantitation of the proteome, distinct differentials in metabolomics and lipidomics were also observed and correlated with severity. We conclude that quality control is vital to accurate detection of informative molecular differentials using OMIC technologies and that caution must be exercised to minimize the impact of hemolysis as a factor driving false discovery in large cancer biomarker studies.


Asunto(s)
Biomarcadores/sangre , Hemólisis , Lipidómica/normas , Neoplasias Pancreáticas/sangre , Pancreatitis/sangre , Proteómica/normas , Estudios de Casos y Controles , Femenino , Humanos , Masculino , Espectrometría de Masas , Medicina de Precisión
12.
J Am Soc Mass Spectrom ; 33(1): 17-30, 2022 Jan 05.
Artículo en Inglés | MEDLINE | ID: mdl-34813325

RESUMEN

Global and phosphoproteome profiling has demonstrated great utility for the analysis of clinical specimens. One barrier to the broad clinical application of proteomic profiling is the large amount of biological material required, particularly for phosphoproteomics─currently on the order of 25 mg wet tissue weight. For hematopoietic cancers such as acute myeloid leukemia (AML), the sample requirement is ≥10 million peripheral blood mononuclear cells (PBMCs). Across large study cohorts, this requirement will exceed what is obtainable for many individual patients/time points. For this reason, we were interested in the impact of differential peptide loading across multiplex channels on proteomic data quality. To achieve this, we tested a range of channel loading amounts (approximately the material obtainable from 5E5, 1E6, 2.5E6, 5E6, and 1E7 AML patient cells) to assess proteome coverage, quantification precision, and peptide/phosphopeptide detection in experiments utilizing isobaric tandem mass tag (TMT) labeling. As expected, fewer missing values were observed in TMT channels with higher peptide loading amounts compared to lower loadings. Moreover, channels with a lower loading have greater quantitative variability than channels with higher loadings. A statistical analysis showed that decreased loading amounts result in an increase in the type I error rate. We then examined the impact of differential loading on the detection of known differences between distinct AML cell lines. Similar patterns of increased data missingness and higher quantitative variability were observed as loading was decreased resulting in fewer statistical differences; however, we found good agreement in features identified as differential, demonstrating the value of this approach.


Asunto(s)
Fosfopéptidos , Proteómica/métodos , Proteómica/normas , Espectrometría de Masas en Tándem/métodos , Espectrometría de Masas en Tándem/normas , Células Cultivadas , Cromatografía de Afinidad , Exactitud de los Datos , Humanos , Marcaje Isotópico , Leucocitos Mononucleares/química , Fosfopéptidos/análisis , Fosfopéptidos/química , Fosfopéptidos/aislamiento & purificación
13.
Nucleic Acids Res ; 50(D1): D1535-D1540, 2022 01 07.
Artículo en Inglés | MEDLINE | ID: mdl-34718696

RESUMEN

Proteome-pI 2.0 is an update of an online database containing predicted isoelectric points and pKa dissociation constants of proteins and peptides. The isoelectric point-the pH at which a particular molecule carries no net electrical charge-is an important parameter for many analytical biochemistry and proteomics techniques. Additionally, it can be obtained directly from the pKa values of individual charged residues of the protein. The Proteome-pI 2.0 database includes data for over 61 million protein sequences from 20 115 proteomes (three to four times more than the previous release). The isoelectric point for proteins is predicted by 21 methods, whereas pKa values are inferred by one method. To facilitate bottom-up proteomics analysis, individual proteomes were digested in silico with the five most commonly used proteases (trypsin, chymotrypsin, trypsin + LysC, LysN, ArgC), and the peptides' isoelectric point and molecular weights were calculated. The database enables the retrieval of virtual 2D-PAGE plots and customized fractions of a proteome based on the isoelectric point and molecular weight. In addition, isoelectric points for proteins in NCBI non-redundant (nr), UniProt, SwissProt, and Protein Data Bank are available in both CSV and FASTA formats. The database can be accessed at http://isoelectricpointdb2.org.


Asunto(s)
Bases de Datos de Proteínas , Punto Isoeléctrico , Péptidos/química , Proteoma/química , Secuencia de Aminoácidos/genética , Biología Computacional , Electroforesis en Gel Bidimensional , Peso Molecular , Proteoma/clasificación , Proteómica/normas
14.
Nucleic Acids Res ; 50(D1): D1491-D1499, 2022 01 07.
Artículo en Inglés | MEDLINE | ID: mdl-34718741

RESUMEN

As a crucial molecular mechanism, post-translational modifications (PTMs) play critical roles in a wide range of biological processes in plants. Recent advances in mass spectrometry-based proteomic technologies have greatly accelerated the profiling and quantification of plant PTM events. Although several databases have been constructed to store plant PTM data, a resource including more plant species and more PTM types with quantitative dynamics still remains to be developed. In this paper, we present an integrative database of quantitative PTMs in plants named qPTMplants (http://qptmplants.omicsbio.info), which hosts 1 242 365 experimentally identified PTM events for 429 821 nonredundant sites on 123 551 proteins under 583 conditions for 23 PTM types in 43 plant species from 293 published studies, with 620 509 quantification events for 136 700 PTM sites on 55 361 proteins under 354 conditions. Moreover, the experimental details, such as conditions, samples, instruments and methods, were manually curated, while a variety of annotations, including the sequence and structural characteristics, were integrated into qPTMplants. Then, various search and browse functions were implemented to access the qPTMplants data in a user-friendly manner. Overall, we anticipate that the qPTMplants database will be a valuable resource for further research on PTMs in plants.


Asunto(s)
Bases de Datos de Proteínas , Plantas/genética , Procesamiento Proteico-Postraduccional/genética , Proteínas/genética , Plantas/clasificación , Proteínas/clasificación , Proteómica/normas
15.
Int J Mol Sci ; 22(21)2021 Nov 08.
Artículo en Inglés | MEDLINE | ID: mdl-34769509

RESUMEN

According to proteomics technology, as impacted by the complexity of sampling in the experimental process, several problems remain with the reproducibility of mass spectrometry experiments, and the peptide identification and quantitative results continue to be random. Predicting the detectability exhibited by peptides can optimize the mentioned results to be more accurate, so such a prediction is of high research significance. This study builds a novel method to predict the detectability of peptides by complying with the capsule network (CapsNet) and the convolutional block attention module (CBAM). First, the residue conical coordinate (RCC), the amino acid composition (AAC), the dipeptide composition (DPC), and the sequence embedding code (SEC) are extracted as the peptide chain features. Subsequently, these features are divided into the biological feature and sequence feature, and separately inputted into the neural network of CapsNet. Moreover, the attention module CBAM is added to the network to assign weights to channels and spaces, as an attempt to enhance the feature learning and improve the network training effect. To verify the effectiveness of the proposed method, it is compared with some other popular methods. As revealed from the experimentally achieved results, the proposed method outperforms those methods in most performance assessments.


Asunto(s)
Aminoácidos/química , Biología Computacional/métodos , Espectrometría de Masas/métodos , Péptidos/análisis , Proteómica/métodos , Algoritmos , Bases de Datos de Proteínas , Humanos , Redes Neurales de la Computación , Péptidos/química , Proteómica/normas , Reproducibilidad de los Resultados
16.
Int J Mol Sci ; 22(17)2021 Sep 06.
Artículo en Inglés | MEDLINE | ID: mdl-34502557

RESUMEN

Analysis of differential abundance in proteomics data sets requires careful application of missing value imputation. Missing abundance values widely vary when performing comparisons across different sample treatments. For example, one would expect a consistent rate of "missing at random" (MAR) across batches of samples and varying rates of "missing not at random" (MNAR) depending on the inherent difference in sample treatments within the study. The missing value imputation strategy must thus be selected that best accounts for both MAR and MNAR simultaneously. Several important issues must be considered when deciding the appropriate missing value imputation strategy: (1) when it is appropriate to impute data; (2) how to choose a method that reflects the combinatorial manner of MAR and MNAR that occurs in an experiment. This paper provides an evaluation of missing value imputation strategies used in proteomics and presents a case for the use of hybrid left-censored missing value imputation approaches that can handle the MNAR problem common to proteomics data.


Asunto(s)
Exactitud de los Datos , Bases de Datos de Proteínas/estadística & datos numéricos , Espectrometría de Masas/métodos , Proteómica/estadística & datos numéricos , Neoplasias de la Mama/metabolismo , Neoplasias de la Mama/patología , Línea Celular Tumoral , Glucosa/metabolismo , Humanos , Proteómica/métodos , Proteómica/normas
17.
PLoS One ; 16(8): e0256167, 2021.
Artículo en Inglés | MEDLINE | ID: mdl-34411146

RESUMEN

Saliva biomarkers are suitable for monitoring the therapeutic response of canine oral melanoma (COM), because saliva directly contacts the tumor, and saliva collection is non-invasive, convenient and cost effective. The present study aimed to investigate novel biomarkers from the salivary proteome of COM treated with surgery and a chemotherapy drug, carboplatin, 1-6 times, using a liquid chromatography-tandem mass spectrometry approach. The expression of a potential salivary biomarker, ubiquitin D (UBD), was observed and verified by western blot analysis. A significantly increased ratio of free UBD (fUBD) to conjugated UBD (cUBD) was shown in the pre-surgery stage (PreS) in OM dogs with short-term survival (STS) (less than 12 months after surgery) compared with that with long-term survival (more than 12 months after surgery). In dogs with STS, the ratio was also shown to be augmented in PreS compared with that after surgery, followed by treatment with carboplatin twice, 4 and 5 times [After treatment (AT)2, AT4 and AT5]. In addition, the expression of fUBD was enhanced in PreS compared with that of AT2 in the STS group. In conclusion, this study revealed that a ratio of fUBD to cUBD in PreS was plausibly shown to be a potential prognostic biomarker for survival in dogs with OM.


Asunto(s)
Melanoma/genética , Neoplasias de la Boca/genética , Proteoma/genética , Glándulas Salivales/metabolismo , Animales , Biomarcadores de Tumor/genética , Cromatografía Liquida , Enfermedades de los Perros/genética , Enfermedades de los Perros/patología , Perros , Regulación Neoplásica de la Expresión Génica/genética , Melanoma/patología , Neoplasias de la Boca/patología , Proteómica/normas , Glándulas Salivales/patología , Proteínas y Péptidos Salivales/genética
18.
Molecules ; 26(16)2021 Aug 06.
Artículo en Inglés | MEDLINE | ID: mdl-34443345

RESUMEN

Protein glycosylation that mediates interactions among viral proteins, host receptors, and immune molecules is an important consideration for predicting viral antigenicity. Viral spike proteins, the proteins responsible for host cell invasion, are especially important to be examined. However, there is a lack of consensus within the field of glycoproteomics regarding identification strategy and false discovery rate (FDR) calculation that impedes our examinations. As a case study in the overlap between software, here as a case study, we examine recently published SARS-CoV-2 glycoprotein datasets with four glycoproteomics identification software with their recommended protocols: GlycReSoft, Byonic, pGlyco2, and MSFragger-Glyco. These software use different Target-Decoy Analysis (TDA) forms to estimate FDR and have different database-oriented search methods with varying degrees of quantification capabilities. Instead of an ideal overlap between software, we observed different sets of identifications with the intersection. When clustering by glycopeptide identifications, we see higher degrees of relatedness within software than within glycosites. Taking the consensus between results yields a conservative and non-informative conclusion as we lose identifications in the desire for caution; these non-consensus identifications are often lower abundance and, therefore, more susceptible to nuanced changes. We conclude that present glycoproteomics softwares are not directly comparable, and that methods are needed to assess their overall results and FDR estimation performance. Once such tools are developed, it will be possible to improve FDR methods and quantify complex glycoproteomes with acceptable confidence, rather than potentially misleading broad strokes.


Asunto(s)
Algoritmos , Glicopéptidos/análisis , Glicoproteínas/análisis , COVID-19/metabolismo , Bases de Datos de Proteínas , Glicopéptidos/química , Glicoproteínas/química , Glicosilación , Humanos , Proteómica/métodos , Proteómica/normas , SARS-CoV-2/metabolismo , Programas Informáticos , Glicoproteína de la Espiga del Coronavirus/análisis , Glicoproteína de la Espiga del Coronavirus/química , Espectrometría de Masas en Tándem/métodos , Proteínas Virales de Fusión/análisis , Proteínas Virales de Fusión/química
19.
Biomolecules ; 11(6)2021 06 03.
Artículo en Inglés | MEDLINE | ID: mdl-34204944

RESUMEN

Proteomics can map extracellular vesicles (EVs), including exosomes, across disease states between organisms and cell types. Due to the diverse origin and cargo of EVs, tailoring methodological and analytical techniques can support the reproducibility of results. Proteomics scans are sensitive to in-sample contaminants, which can be retained during EV isolation procedures. Contaminants can also arise from the biological origin of exosomes, such as the lipid-rich environment in human milk. Human milk (HM) EVs and exosomes are emerging as a research interest in health and disease, though the experimental characterization and functional assays remain varied. Past studies of HM EV proteomes have used data-dependent acquisition methods for protein detection, however, improvements in data independent acquisition could allow for previously undetected EV proteins to be identified by mass spectrometry. Depending on the research question, only a specific population of proteins can be compared and measured using isotope and other labelling techniques. In this review, we summarize published HM EV proteomics protocols and suggest a methodological workflow with the end-goal of effective and reproducible analysis of human milk EV proteomes.


Asunto(s)
Vesículas Extracelulares/química , Proteínas de la Leche/análisis , Leche Humana/química , Proteómica/métodos , Biología Computacional/métodos , Biología Computacional/normas , Exosomas/química , Humanos , Espectrometría de Masas/métodos , Espectrometría de Masas/normas , Proteómica/normas , Reproducibilidad de los Resultados , Ultracentrifugación/métodos , Ultracentrifugación/normas
20.
Nat Commun ; 12(1): 3810, 2021 06 21.
Artículo en Inglés | MEDLINE | ID: mdl-34155216

RESUMEN

To a large extent functional diversity in cells is achieved by the expansion of molecular complexity beyond that of the coding genome. Various processes create multiple distinct but related proteins per coding gene - so-called proteoforms - that expand the functional capacity of a cell. Evaluating proteoforms from classical bottom-up proteomics datasets, where peptides instead of intact proteoforms are measured, has remained difficult. Here we present COPF, a tool for COrrelation-based functional ProteoForm assessment in bottom-up proteomics data. It leverages the concept of peptide correlation analysis to systematically assign peptides to co-varying proteoform groups. We show applications of COPF to protein complex co-fractionation data as well as to more typical protein abundance vs. sample data matrices, demonstrating the systematic detection of assembly- and tissue-specific proteoform groups, respectively, in either dataset. We envision that the presented approach lays the foundation for a systematic assessment of proteoforms and their functional implications directly from bottom-up proteomic datasets.


Asunto(s)
Isoformas de Proteínas/análisis , Proteómica/métodos , Algoritmos , Animales , Benchmarking , Humanos , Ratones , Péptidos/análisis , Péptidos/metabolismo , Isoformas de Proteínas/metabolismo , Proteómica/normas , Espectrometría de Masas en Tándem , Flujo de Trabajo
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA
...