Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 27
Filtrar
1.
J Am Soc Mass Spectrom ; 34(8): 1598-1608, 2023 Aug 02.
Artículo en Inglés | MEDLINE | ID: mdl-37463304

RESUMEN

Mass spectrometry (MS)-based analysis of RNA oligonucleotides (oligos) plays an increasingly important role in the development of RNA therapeutics and epitranscriptomics research. However, MS fragmentation behaviors of RNA oligomers are understood insufficiently. Herein, we characterized the negative-ion-mode fragmentation behaviors of 26 synthetic RNA oligos containing four to eight nucleotides using collision-induced dissociation (CID) on a high-resolution, accurate-mass instrument. We found that in CID spectra acquired under the normalized collision energy (NCE) of 35%, approximately 70% of the total peak intensity was attributed to sequencing ions (a-B, a, b, c, d, w, x, y, z), around 25% of the peak intensity came from precursor ions that experienced complete or partial loss of a nucleobase in the form of either a neutral or an anion, and the remainder were internal ions and anionic nucleobases. The top five sequencing ions were the y, c, w, a-B, and a ions. Furthermore, we observed that CID fragmentation behaviors of RNA oligos were significantly impacted by their precursor charge. Specifically, when the precursors had a charge from 1- to 5-, the fractional intensity of sequencing ions decreased, while that of precursors that underwent either neutral or charged losses of a nucleobase increased. Additionally, we found that RNA oligos containing 3'-U tended to produce precursors with HNCO and/or NCO- losses, which presumably corresponded to isocyanic acid and cyanate anion, respectively. These findings provide valuable insights for better comprehending the mechanism behind RNA fragmentation by MS/MS, thereby facilitating the future automated identification of RNA oligos based on their CID spectra in a more efficient manner.


Asunto(s)
Oligonucleótidos , Espectrometría de Masas en Tándem , Oligonucleótidos/química , Espectrometría de Masas en Tándem/métodos , ARN , Iones/química , Aniones , Espectrometría de Masa por Ionización de Electrospray
2.
J Cardiovasc Pharmacol ; 81(1): 85-92, 2023 01 01.
Artículo en Inglés | MEDLINE | ID: mdl-36027482

RESUMEN

ABSTRACT: Salidroside has anti-inflammatory and antiatherosclerotic effects, and mitochondrial homeostasis imbalance is closely related to cardiovascular disease. The aim of this study was to investigate the effect of salidroside on mitochondrial homeostasis after macrophage polarization and elucidate its possible mechanism against atherosclerosis. RAW264.7 cells were stimulated with 1 µg·mL -1 Lipopolysaccharide and 50 ng·mL -1 IFN-γ establish M1 polarization and were also pretreated with 400 µM salidroside. The relative expression of proinflammatory genes was detected by RT-PCR whereas that of mitochondrial homeostasis-related proteins and nuclear factor kappa-B (NF-κB) was detected by WB. Levels of intracellular reactive oxygen species (ROS), mitochondrial membrane potential, and mass were measured by chemifluorescence whereas that of NF-κB nuclear translocation was detected by immunofluorescence. Compared with the Mφ group, the M1 group demonstrated increased mRNA expression of interleukin-1ß , inductible nitric oxide synthase (iNOS), and tumor necrosis factor-α ; increased protein expression of iNOS, NOD-like receptor protein 3, putative kinase 1 , and NF-κB p65 but decreased protein expression of MFN2, Tom20, and PGC-1α; decreased mitochondrial membrane potential and mass; and increased ROS levels and NF-κB p65 nuclear translocation. Salidroside intervention decreased mRNA expression of interleukin-1ß and tumor necrosis factor-α compared with the M1 group but did not affect that of iNOS. Furthermore, salidroside intervention prevented the changes in protein expression, mitochondrial membrane potential and mass, ROS levels, and NF-κB p65 nuclear translocation observed in the M1 group. In summary, salidroside ultimately inhibits M1 macrophage polarization and maintains mitochondrial homeostasis after macrophage polarization by increasing mitochondrial membrane potential, decreasing ROS levels, inhibiting NF-κB activation, and in turn regulating the expression of proinflammatory factors and mitochondrial homeostasis-associated proteins.


Asunto(s)
FN-kappa B , Factor de Necrosis Tumoral alfa , FN-kappa B/metabolismo , Interleucina-1beta/metabolismo , Factor de Necrosis Tumoral alfa/metabolismo , Especies Reactivas de Oxígeno/metabolismo , Macrófagos , Lipopolisacáridos/farmacología , Óxido Nítrico Sintasa/metabolismo , Homeostasis , ARN Mensajero/metabolismo
3.
Methods Mol Biol ; 2500: 105-129, 2022.
Artículo en Inglés | MEDLINE | ID: mdl-35657590

RESUMEN

The remarkable advancement of top-down proteomics in the past decade is driven by the technological development in separation, mass spectrometry (MS) instrumentation, novel fragmentation, and bioinformatics. However, the accurate identification and quantification of proteoforms, all clearly-defined molecular forms of protein products from a single gene, remain a challenging computational task. This is in part due to the complicated mass spectra from intact proteoforms when compared to those from the digested peptides. Herein, pTop 2.0 is developed to fill in the gap between the large-scale complex top-down MS data and the shortage of high-accuracy bioinformatic tools. Compared with pTop 1.0, the first version, pTop 2.0 concentrates mainly on the identification of the proteoforms with unexpected modifications or a terminal truncation. The quantitation based on isotopic labeling is also a new function, which can be carried out by the convenient and user-friendly "one-key operation," integrated together with the qualitative identifications. The accuracy and running speed of pTop 2.0 is significantly improved on the test data sets. This chapter will introduce the main features, step-by-step running operations, and algorithmic developments of pTop 2.0 in order to push the identification and quantitation of intact proteoforms to a higher-accuracy level in top-down proteomics.


Asunto(s)
Proteoma , Proteómica , Espectrometría de Masas , Proteoma/metabolismo , Proteómica/métodos
4.
Nat Commun ; 10(1): 3911, 2019 09 02.
Artículo en Inglés | MEDLINE | ID: mdl-31477730

RESUMEN

Chemical cross-linking of proteins coupled with mass spectrometry analysis (CXMS) is widely used to study protein-protein interactions (PPI), protein structures, and even protein dynamics. However, structural information provided by CXMS is still limited, partly because most CXMS experiments use lysine-lysine (K-K) cross-linkers. Although superb in selectivity and reactivity, they are ineffective for lysine deficient regions. Herein, we develop aromatic glyoxal cross-linkers (ArGOs) for arginine-arginine (R-R) cross-linking and the lysine-arginine (K-R) cross-linker KArGO. The R-R or K-R cross-links generated by ArGO or KArGO fit well with protein crystal structures and provide information not attainable by K-K cross-links. KArGO, in particular, is highly valuable for CXMS, with robust performance on a variety of samples including a kinase and two multi-protein complexes. In the case of the CNGP complex, KArGO cross-links covered as much of the PPI interface as R-R and K-K cross-links combined and improved the accuracy of Rosetta docking substantially.


Asunto(s)
Arginina/química , Reactivos de Enlaces Cruzados/química , Lisina/química , Espectrometría de Masas/métodos , Proteínas/química , Algoritmos , Arginina/metabolismo , Lisina/metabolismo , Modelos Moleculares , Estructura Molecular , Péptidos/química , Péptidos/metabolismo , Conformación Proteica , Mapas de Interacción de Proteínas , Proteínas/metabolismo
5.
Nat Commun ; 10(1): 3404, 2019 07 30.
Artículo en Inglés | MEDLINE | ID: mdl-31363125

RESUMEN

We describe pLink 2, a search engine with higher speed and reliability for proteome-scale identification of cross-linked peptides. With a two-stage open search strategy facilitated by fragment indexing, pLink 2 is ~40 times faster than pLink 1 and 3~10 times faster than Kojak. Furthermore, using simulated datasets, synthetic datasets, 15N metabolically labeled datasets, and entrapment databases, four analysis methods were designed to evaluate the credibility of ten state-of-the-art search engines. This systematic evaluation shows that pLink 2 outperforms these methods in precision and sensitivity, especially at proteome scales. Lastly, re-analysis of four published proteome-scale cross-linking datasets with pLink 2 required only a fraction of the time used by pLink 1, with up to 27% more cross-linked residue pairs identified. pLink 2 is therefore an efficient and reliable tool for cross-linking mass spectrometry analysis, and the systematic evaluation methods described here will be useful for future software development.


Asunto(s)
Péptidos/química , Proteoma/química , Motor de Búsqueda/métodos , Algoritmos , Animales , Bases de Datos de Proteínas , Humanos , Proteómica , Programas Informáticos
6.
Nat Biotechnol ; 2018 Oct 08.
Artículo en Inglés | MEDLINE | ID: mdl-30295672

RESUMEN

We present a sequence-tag-based search engine, Open-pFind, to identify peptides in an ultra-large search space that includes coeluting peptides, unexpected modifications and digestions. Our method detects peptides with higher precision and speed than seven other search engines. Open-pFind identified 70-85% of the tandem mass spectra in four large-scale datasets and 14,064 proteins, each supported by at least two protein-unique peptides, in a human proteome dataset.

7.
Nat Commun ; 8(1): 438, 2017 09 05.
Artículo en Inglés | MEDLINE | ID: mdl-28874712

RESUMEN

The precise and large-scale identification of intact glycopeptides is a critical step in glycoproteomics. Owing to the complexity of glycosylation, the current overall throughput, data quality and accessibility of intact glycopeptide identification lack behind those in routine proteomic analyses. Here, we propose a workflow for the precise high-throughput identification of intact N-glycopeptides at the proteome scale using stepped-energy fragmentation and a dedicated search engine. pGlyco 2.0 conducts comprehensive quality control including false discovery rate evaluation at all three levels of matches to glycans, peptides and glycopeptides, improving the current level of accuracy of intact glycopeptide identification. The N-glycoproteome of samples metabolically labeled with 15N/13C were analyzed quantitatively and utilized to validate the glycopeptide identification, which could be used as a novel benchmark pipeline to compare different search engines. Finally, we report a large-scale glycoproteome dataset consisting of 10,009 distinct site-specific N-glycans on 1988 glycosylation sites from 955 glycoproteins in five mouse tissues.Protein glycosylation is a heterogeneous post-translational modification that generates greater proteomic diversity that is difficult to analyze. Here the authors describe pGlyco 2.0, a workflow for the precise one step identification of intact N-glycopeptides at the proteome scale.


Asunto(s)
Glicopéptidos/análisis , Proteómica/métodos , Motor de Búsqueda , Espectrometría de Masas en Tándem/métodos , Animales , Isótopos de Carbono , Glicopéptidos/metabolismo , Glicosilación , Ensayos Analíticos de Alto Rendimiento/métodos , Humanos , Masculino , Ratones Endogámicos C57BL , Isótopos de Nitrógeno , Polisacáridos/análisis , Polisacáridos/metabolismo , Procesamiento Proteico-Postraduccional , Control de Calidad , Programas Informáticos , Flujo de Trabajo
8.
J Proteome Res ; 16(2): 645-654, 2017 02 03.
Artículo en Inglés | MEDLINE | ID: mdl-28019094

RESUMEN

De novo peptide sequencing has improved remarkably, but sequencing full-length peptides with unexpected modifications is still a challenging problem. Here we present an open de novo sequencing tool, Open-pNovo, for de novo sequencing of peptides with arbitrary types of modifications. Although the search space increases by ∼300 times, Open-pNovo is close to or even ∼10-times faster than the other three proposed algorithms. Furthermore, considering top-1 candidates on three MS/MS data sets, Open-pNovo can recall over 90% of the results obtained by any one traditional algorithm and report 5-87% more peptides, including 14-250% more modified peptides. On a high-quality simulated data set, ∼85% peptides with arbitrary modifications can be recalled by Open-pNovo, while hardly any results can be recalled by others. In summary, Open-pNovo is an excellent tool for open de novo sequencing and has great potential for discovering unexpected modifications in the real biological applications.


Asunto(s)
Secuencia de Aminoácidos/genética , Péptidos/genética , Procesamiento Proteico-Postraduccional/genética , Algoritmos , Bases de Datos de Proteínas , Análisis de Secuencia de Proteína , Programas Informáticos , Espectrometría de Masas en Tándem
9.
Sci Rep ; 6: 25102, 2016 05 03.
Artículo en Inglés | MEDLINE | ID: mdl-27139140

RESUMEN

Confident characterization of the microheterogeneity of protein glycosylation through identification of intact glycopeptides remains one of the toughest analytical challenges for glycoproteomics. Recently proposed mass spectrometry (MS)-based methods still have some defects such as lack of the false discovery rate (FDR) analysis for the glycan identification and lack of sufficient fragmentation information for the peptide identification. Here we proposed pGlyco, a novel pipeline for the identification of intact glycopeptides by using complementary MS techniques: 1) HCD-MS/MS followed by product-dependent CID-MS/MS was used to provide complementary fragments to identify the glycans, and a novel target-decoy method was developed to estimate the false discovery rate of the glycan identification; 2) data-dependent acquisition of MS3 for some most intense peaks of HCD-MS/MS was used to provide fragments to identify the peptide backbones. By integrating HCD-MS/MS, CID-MS/MS and MS3, intact glycopeptides could be confidently identified. With pGlyco, a standard glycoprotein mixture was analyzed in the Orbitrap Fusion, and 309 non-redundant intact glycopeptides were identified with detailed spectral information of both glycans and peptides.


Asunto(s)
Glicopéptidos/análisis , Espectrometría de Masas/métodos , Flujo de Trabajo
10.
Anal Chem ; 88(6): 3082-90, 2016 Mar 15.
Artículo en Inglés | MEDLINE | ID: mdl-26844380

RESUMEN

There has been tremendous progress in top-down proteomics (TDP) in the past 5 years, particularly in intact protein separation and high-resolution mass spectrometry. However, bioinformatics to deal with large-scale mass spectra has lagged behind, in both algorithmic research and software development. In this study, we developed pTop 1.0, a novel software tool to significantly improve the accuracy and efficiency of mass spectral data analysis in TDP. The precursor mass offers crucial clues to infer the potential post-translational modifications co-occurring on the protein, the reliability of which relies heavily on its mass accuracy. Concentrating on detecting the precursors more accurately, a machine-learning model incorporating a variety of spectral features was trained online in pTop via a support vector machine (SVM). pTop employs the sequence tags extracted from the MS/MS spectra and a dynamic programming algorithm to accelerate the search speed, especially for those spectra with multiple post-translational modifications. We tested pTop on three publicly available data sets and compared it with ProSight and MS-Align+ in terms of its recall, precision, running time, and so on. The results showed that pTop can, in general, outperform ProSight and MS-Align+. pTop recalled 22% more correct precursors, although it exported 30% fewer precursors than Xtract (in ProSight) from a human histone data set. The running speed of pTop was about 1 to 2 orders of magnitude faster than that of MS-Align+. This algorithmic advancement in pTop, including both accuracy and speed, will inspire the development of other similar software to analyze the mass spectra from the entire proteins.


Asunto(s)
Bases de Datos de Proteínas , Almacenamiento y Recuperación de la Información , Proteínas/análisis , Algoritmos , Aprendizaje Automático , Programas Informáticos
11.
J Proteomics ; 129: 33-41, 2015 Nov 03.
Artículo en Inglés | MEDLINE | ID: mdl-26232248

RESUMEN

Database search is the dominant approach in high-throughput proteomic analysis. However, the interpretation rate of MS/MS spectra is very low in such a restricted mode, which is mainly due to unexpected modifications and irregular digestion types. In this study, we developed a new algorithm called Alioth, to be integrated into the search engine of pFind, for fast and accurate unrestricted database search on high-resolution MS/MS data. An ion index is constructed for both peptide precursors and fragment ions, by which arbitrary digestions and a single site of any modifications and mutations can be searched efficiently. A new re-ranking algorithm is used to distinguish the correct peptide-spectrum matches from random ones. The algorithm is tested on several HCD datasets and the interpretation rate of MS/MS spectra using Alioth is as high as 60%-80%. Peptides from semi- and non-specific digestions, as well as those with unexpected modifications or mutations, can be effectively identified using Alioth and confidently validated using other search engines. The average processing speed of Alioth is 5-10 times faster than some other unrestricted search engines and is comparable to or even faster than the restricted search algorithms tested.This article is part of a Special Issue entitled: Computational Proteomics.

12.
J Proteomics ; 125: 89-97, 2015 Jul 01.
Artículo en Inglés | MEDLINE | ID: mdl-25979774

RESUMEN

Database search is the dominant approach in high-throughput proteomic analysis. However, the interpretation rate of MS/MS spectra is very low in such a restricted mode, which is mainly due to unexpected modifications and irregular digestion types. In this study, we developed a new algorithm called Alioth, to be integrated into the search engine of pFind, for fast and accurate unrestricted database search on high-resolution MS/MS data. An ion index is constructed for both peptide precursors and fragment ions, by which arbitrary digestions and a single site of any modifications and mutations can be searched efficiently. A new re-ranking algorithm is used to distinguish the correct peptide-spectrum matches from random ones. The algorithm is tested on several HCD datasets and the interpretation rate of MS/MS spectra using Alioth is as high as 60%-80%. Peptides from semi- and non-specific digestions, as well as those with unexpected modifications or mutations, can be effectively identified using Alioth and confidently validated using other search engines. The average processing speed of Alioth is 5-10 times faster than some other unrestricted search engines and is comparable to or even faster than the restricted search algorithms tested.


Asunto(s)
Algoritmos , Bases de Datos de Proteínas , Espectrometría de Masas , Análisis de Secuencia de Proteína/métodos
13.
Curr Protoc Bioinformatics ; 49: 8.21.1-8.21.19, 2015 Mar 09.
Artículo en Inglés | MEDLINE | ID: mdl-25754995

RESUMEN

pLink is a search engine for high-throughput identification of cross-linked peptides from their tandem mass spectra, which is the data-analysis step in chemical cross-linking of proteins coupled with mass spectrometry analysis. pLink has accumulated more than 200 registered users from all over the world since its first release in 2012. After 2 years of continual development, a new version of pLink has been released, which is at least 40 times faster, more versatile, and more user-friendly. Also, the function of the new pLink has been expanded to identifying endogenous protein cross-linking sites such as disulfide bonds and SUMO (Small Ubiquitin-like MOdifier) modification sites. Integrated into the new version are two accessory tools: pLabel, to annotate spectra of cross-linked peptides for visual inspection and publication, and pConfig, to assist users in setting up search parameters. Here, we provide detailed guidance on running a database search for identification of protein cross-links using the 2014 version of pLink.


Asunto(s)
Reactivos de Enlaces Cruzados/química , Péptidos/análisis , Motor de Búsqueda , Guías como Asunto , Internet , Espectrometría de Masas , Interfaz Usuario-Computador
14.
Nat Methods ; 12(4): 329-31, 2015 Apr.
Artículo en Inglés | MEDLINE | ID: mdl-25664544

RESUMEN

We developed a high-throughput mass spectrometry method, pLink-SS (http://pfind.ict.ac.cn/software/pLink/2014/pLink-SS.html), for precise identification of disulfide-linked peptides. Using pLink-SS, we mapped all native disulfide bonds of a monoclonal antibody and ten standard proteins. We performed disulfide proteome analyses and identified 199 disulfide bonds in Escherichia coli and 568 in proteins secreted by human endothelial cells. We discovered many regulatory disulfide bonds involving catalytic or metal-binding cysteine residues.


Asunto(s)
Disulfuros/química , Espectrometría de Masas , Proteoma/química , Proteómica/métodos , Secuencia de Aminoácidos , Escherichia coli/química , Humanos , Modelos Moleculares , Datos de Secuencia Molecular , Biblioteca de Péptidos , Ribonucleasas/química
15.
Anal Chem ; 86(11): 5286-94, 2014 Jun 03.
Artículo en Inglés | MEDLINE | ID: mdl-24799117

RESUMEN

In relative protein abundance determination from peptide intensities recorded in full mass scans, a major complication that affects quantitation accuracy is signal interference from coeluting ions of similar m/z values. Here, we present pQuant, a quantitation software tool that solves this problem. pQuant detects interference signals, identifies for each peptide a pair of least interfered isotopic chromatograms: one for the light and one for the heavy isotope-labeled peptide. On the basis of these isotopic pairs, pQuant calculates the relative heavy/light peptide ratios along with their 99.75% confidence intervals (CIs). From the peptides ratios and their CIs, pQuant estimates the protein ratios and associated CIs by kernel density estimation. We tested pQuant, Census and MaxQuant on data sets obtained from mixtures (at varying mixing ratios from 10:1 to 1:10) of light- and heavy-SILAC labeled HeLa cells or (14)N- and (15)N-labeled Escherichia coli cells. pQuant quantitated more peptides with better accuracy than Census and MaxQuant in all 14 data sets. On the SILAC data sets, the nonquantified "NaN" (not a number) ratios generated by Census, MaxQuant, and pQuant accounted for 2.5-10.7%, 1.8-2.7%, and 0.01-0.5% of all ratios, respectively. On the (14)N/(15)N data sets, which cannot be quantified by MaxQuant, Census and pQuant produced 0.9-10.0% and 0.3-2.9% NaN ratios, respectively. Excluding these NaN results, the standard deviations of the numerical ratios calculated by Census or MaxQuant are 30-100% larger than those by pQuant. These results show that pQuant outperforms Census and MaxQuant in SILAC and (15)N-based quantitation.


Asunto(s)
Péptidos/química , Proteínas/química , Escherichia coli/química , Células HeLa/química , Humanos , Isótopos , Espectrometría de Masas , Isótopos de Nitrógeno , Radioisótopos de Nitrógeno , Programas Informáticos
16.
Mol Cell Proteomics ; 13(1): 360-71, 2014 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-24187338

RESUMEN

The proteome informatics research group of the Association of Biomolecular Resource Facilities conducted a study to assess the community's ability to detect and characterize peptides bearing a range of biologically occurring post-translational modifications when present in a complex peptide background. A data set derived from a mixture of synthetic peptides with biologically occurring modifications combined with a yeast whole cell lysate as background was distributed to a large group of researchers and their results were collectively analyzed. The results from the twenty-four participants, who represented a broad spectrum of experience levels with this type of data analysis, produced several important observations. First, there is significantly more variability in the ability to assess whether a results is significant than there is to determine the correct answer. Second, labile post-translational modifications, particularly tyrosine sulfation, present a challenge for most researchers. Finally, for modification site localization there are many tools being employed, but researchers are currently unsure of the reliability of the results these programs are producing.


Asunto(s)
Péptidos/aislamiento & purificación , Procesamiento Proteico-Postraduccional/genética , Proteoma , Secuencia de Aminoácidos/genética , Mezclas Complejas/química , Mezclas Complejas/genética , Biología Computacional , Humanos , Péptidos/química , Péptidos/metabolismo , Análisis de Secuencia de Proteína
17.
J Proteome Res ; 12(2): 615-25, 2013 Feb 01.
Artículo en Inglés | MEDLINE | ID: mdl-23272783

RESUMEN

De novo peptide sequencing is the only tool for extracting peptide sequences directly from tandem mass spectrometry (MS) data without any protein database. However, neither the accuracy nor the efficiency of de novo sequencing has been satisfactory, mainly due to incomplete fragmentation information in experimental spectra. Recent advancement in MS technology has enabled acquisition of higher energy collisional dissociation (HCD) and electron transfer dissociation (ETD) spectra of the same precursor. These spectra contain complementary fragmentation information and can be collected with high resolution and high mass accuracy. Taking these advantages, we have developed a new algorithm called pNovo+, which greatly improves the accuracy and speed of de novo sequencing. On tryptic peptides, 86% of the topmost candidate sequences deduced by pNovo+ from HCD + ETD spectral pairs matched the database search results, and the success rate reached 95% if the top three candidates were included, which was much higher than using only HCD (87%) or only ETD spectra (57%). On Asp-N, Glu-C, or Elastase digested peptides, 69-87% of the HCD + ETD spectral pairs were correctly identified by pNovo+ among the topmost candidates, or 84-95% among the top three. On average, it takes pNovo+ only 0.018 s to extract the sequence from a spectrum or spectral pair on a common personal computer. This is more than three times as fast as other de novo sequencing programs. The increase of speed is mainly due to pDAG, a component algorithm of pNovo+. pDAG finds the k longest paths in a directed acyclic graph without the antisymmetry restriction. We have verified that the antisymmetry restriction is unnecessary for high resolution, high mass accuracy data. The extensive use of HCD and ETD spectral information and the pDAG algorithm make pNovo+ an excellent de novo sequencing tool.


Asunto(s)
Algoritmos , Péptidos/aislamiento & purificación , Análisis de Secuencia de Proteína/normas , Espectrometría de Masas en Tándem/normas , Secuencia de Aminoácidos , Animales , Bases de Datos de Proteínas , Humanos , Metaloendopeptidasas/química , Datos de Secuencia Molecular , Elastasa Pancreática/química , Péptidos/química , Sensibilidad y Especificidad , Análisis de Secuencia de Proteína/métodos , Serina Endopeptidasas/química , Tripsina/química
18.
Proteomics ; 12(2): 226-35, 2012 Jan.
Artículo en Inglés | MEDLINE | ID: mdl-22106041

RESUMEN

Determining the monoisotopic peak of a precursor is a first step in interpreting mass spectra, which is basic but non-trivial. The reason is that in the isolation window of a precursor, other peaks interfere with the determination of the monoisotopic peak, leading to wrong mass-to-charge ratio or charge state. Here we propose a method, named pParse, to export the most probable monoisotopic peaks for precursors, including co-eluted precursors. We use the relationship between the position of the highest peak and the mass of the first peak to detect candidate clusters. Then, we extract three features to sort the candidate clusters: (i) the sum of the intensity, (ii) the similarity of the experimental and the theoretical isotopic distribution, and (iii) the similarity of elution profiles. We showed that the recall of pParse, MaxQuant, and BioWorks was 98-98.8%, 0.5-17%, and 1.8-36.5% at the same precision, respectively. About 50% of tandem mass spectra are triggered by multiple precursors which are difficult to identify. Then we design a new scoring function to identify the co-eluted precursors. About 26% of all identified peptides were exclusively from co-eluted peptides. Therefore, accurately determining monoisotopic peaks, including co-eluted precursors, can greatly increase peptide identification rate.


Asunto(s)
Péptidos/análisis , Proteómica/métodos , Programas Informáticos , Espectrometría de Masas en Tándem/métodos , Algoritmos , Células HeLa/química , Humanos , Péptidos/química , Precursores de Proteínas/análisis , Precursores de Proteínas/química , Reproducibilidad de los Resultados , Motor de Búsqueda , Sensibilidad y Especificidad , Factores de Tiempo , Levaduras/química
19.
Int J Mol Sci ; 12(4): 2556-71, 2011.
Artículo en Inglés | MEDLINE | ID: mdl-21731458

RESUMEN

Nuclear magnetic resonance/liquid chromatography-mass spectroscopy parallel dynamic spectroscopy (NMR/LC-MS PDS) is a method aimed at the simultaneous structural identification of natural products in complex mixtures. In this study, the method is illustrated with respect to (1)H NMR and rapid resolution liquid chromatography-mass spectroscopy (RRLC-MS) data, acquired from the crude extract of Anoectochilus roxburghii, which was separated into a series of fractions with the concentration of constituent dynamic variation using reversed-phase preparative chromatography. Through fraction ranges and intensity changing profiles in (1)H NMR/RRLC-MS PDS spectrum, (1)H NMR and the extracted ion chromatogram (XIC) signals deriving from the same individual constituent, were correlated due to the signal amplitude co-variation resulting from the concentration variation of constituents in a series of incompletely separated fractions. 1H NMR/RRLC-MS PDS was then successfully used to identify three types of natural products, including eight flavonoids, four organic acids and p-hydroxybenzaldehyde, five of which have not previously been reported in Anoectochilus roxburghii. In addition, two groups of co-eluted compounds were successfully identified. The results prove that this approach should be of benefit in the unequivocal structural determination of a variety of classes of compounds from extremely complex mixtures, such as herbs and biological samples, which will lead to improved efficiency in the identification of new potential lead compounds.


Asunto(s)
Productos Biológicos/química , Orchidaceae/química , Extractos Vegetales/química , Productos Biológicos/aislamiento & purificación , Cromatografía Líquida de Alta Presión , Espectroscopía de Resonancia Magnética , Espectrometría de Masas , Orchidaceae/metabolismo , Extractos Vegetales/aislamiento & purificación
20.
Mol Cell Proteomics ; 10(5): M110.000455, 2011 May.
Artículo en Inglés | MEDLINE | ID: mdl-21321130

RESUMEN

Identification of proteins and their modifications via liquid chromatography-tandem mass spectrometry is an important task for the field of proteomics. However, because of the complexity of tandem mass spectra, the majority of the spectra cannot be identified. The presence of unanticipated protein modifications is among the major reasons for the low spectral identification rate. The conventional database search approach to protein identification has inherent difficulties in comprehensive detection of protein modifications. In recent years, increasing efforts have been devoted to developing unrestrictive approaches to modification identification, but they often suffer from their lack of speed. This paper presents a statistical algorithm named DeltAMT (Delta Accurate Mass and Time) for fast detection of abundant protein modifications from tandem mass spectra with high-accuracy precursor masses. The algorithm is based on the fact that the modified and unmodified versions of a peptide are usually present simultaneously in a sample and their spectra are correlated with each other in precursor masses and retention times. By representing each pair of spectra as a delta mass and time vector, bivariate Gaussian mixture models are used to detect modification-related spectral pairs. Unlike previous approaches to unrestrictive modification identification that mainly rely upon the fragment information and the mass dimension in liquid chromatography-tandem mass spectrometry, the proposed algorithm makes the most of precursor information. Thus, it is highly efficient while being accurate and sensitive. On two published data sets, the algorithm effectively detected various modifications and other interesting events, yielding deep insights into the data. Based on these discoveries, the spectral identification rates were significantly increased and many modified peptides were identified.


Asunto(s)
Algoritmos , Cromatografía Liquida/métodos , Procesamiento Proteico-Postraduccional , Proteoma/química , Espectrometría de Masas en Tándem/métodos , Cromatografía Liquida/normas , Interpretación Estadística de Datos , Bases de Datos de Proteínas , Proteínas Fúngicas/química , Células HeLa , Humanos , Peso Molecular , Estándares de Referencia , Espectrometría de Masas en Tándem/normas
SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA