Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 8 de 8
Filter
Add more filters











Database
Language
Publication year range
1.
J Proteome Res ; 18(9): 3429-3438, 2019 09 06.
Article in English | MEDLINE | ID: mdl-31378069

ABSTRACT

Peptides detected by tandem mass spectrometry (MS/MS) in bottom-up proteomics serve as proxies for the proteins expressed in the sample. Protein inference is a process routinely applied to these peptides to generate a plausible list of candidate protein identifications. The use of multiple proteases for parallel protein digestions expands sequence coverage, provides additional peptide identifications, and increases the probability of identifying peptides that are unique to a single protein, which are all valuable for protein inference. We have developed and implemented a multi-protease protein inference algorithm in MetaMorpheus, a bottom-up search software program, which incorporates the calculation of protease-specific q-values and preserves the association of peptide sequences and their protease of origin. This integrated multi-protease protein inference algorithm provides more accurate results than either the aggregation of results from the separate analysis of the peptide identifications produced by each protease (separate approach) in MetaMorpheus, or results that are obtained using Fido, ProteinProphet, or DTASelect2. MetaMorpheus' integrated multi-protease data analysis decreases the ambiguity of the protein group list, reduces the frequency of erroneous identifications, and increases the number of post-translational modifications identified, while combining multi-protease search and protein inference into a single software program.


Subject(s)
Proteins/isolation & purification , Proteomics , Software , Tandem Mass Spectrometry/methods , Algorithms , Amino Acid Sequence/genetics , Databases, Protein , Peptide Hydrolases/chemistry , Peptide Hydrolases/isolation & purification , Peptides/chemistry , Peptides/isolation & purification , Proteins/chemistry
2.
J Proteome Res ; 18(1): 349-358, 2019 01 04.
Article in English | MEDLINE | ID: mdl-30346791

ABSTRACT

Post-translationally spliced peptides have recently garnered significant interest as potential targets for cancer immunotherapy and as contributors to autoimmune diseases such as type 1 diabetes, yet feasible identification methods for spliced peptides have yet to be developed. Here we present Neo-Fusion, a search program for discovering spliced peptides in tandem mass spectrometry data. Neo-Fusion utilizes two separated ion database searches to identify the two halves of each spliced peptide, and then it infers the full spliced sequence. This strategy allows for the identification of spliced peptides without peptide length constraints, providing a broadly applicable tool suitable for identification of spliced peptides in a variety of systems, such as the HLA-I and HLA-II immunopeptidomes and in vitro digested protein samples obtained from organelles, cells, or tissues of interest. Using simulated spliced peptides to benchmark Neo-Fusion, 25% of all simulated spliced peptides were identified at a measured false-discovery rate of 5% for HLA-I. Neo-Fusion provides the research community with a powerful new tool to aid in the study of the prevalence and biological significance of post-translationally spliced peptides.


Subject(s)
Peptides/analysis , Protein Processing, Post-Translational , Software , Tandem Mass Spectrometry/methods , Amino Acid Sequence , Histocompatibility Antigens Class I/analysis , Humans , Proteolysis
3.
J Proteome Res ; 17(7): 2370-2376, 2018 07 06.
Article in English | MEDLINE | ID: mdl-29793340

ABSTRACT

Protein chemical cross-linking combined with mass spectrometry has become an important technique for the analysis of protein structure and protein-protein interactions. A variety of cross-linkers are well developed, but reliable, rapid, and user-friendly tools for large-scale analysis of cross-linked proteins are still in need. Here we report MetaMorpheusXL, a new search module within the MetaMorpheus software suite that identifies both MS-cleavable and noncleavable cross-linked peptides in MS data. MetaMorpheusXL identifies MS-cleavable cross-linked peptides with an ion-indexing algorithm, which enables an efficient large database search. The identification does not require the presence of signature fragment ions, an advantage compared with similar programs such as XlinkX. One complication associated with the need for signature ions from cleavable cross-linkers such as DSSO (disuccinimidyl sulfoxide) is the requirement for multiple fragmentation types and energy combinations, which is not necessary for MetaMorpheusXL. The ability to perform proteome-wide analysis is another advantage of MetaMorpheusXL compared with programs such as MeroX and DXMSMS. MetaMorpheusXL is also faster than other currently available MS-cleavable cross-link search software programs. It is imbedded in MetaMorpheus, an open-source and freely available software suite that provides a reliable, fast, user-friendly graphical user interface that is readily accessible to researchers.


Subject(s)
Algorithms , Cross-Linking Reagents/chemistry , Peptides/analysis , Tandem Mass Spectrometry/methods , Databases as Topic , Peptides/chemistry , Proteome/analysis , Software
4.
J Proteome Res ; 17(5): 1844-1851, 2018 05 04.
Article in English | MEDLINE | ID: mdl-29578715

ABSTRACT

Correct identification of protein post-translational modifications (PTMs) is crucial to understanding many aspects of protein function in biological processes. G-PTM-D is a recently developed technique for global identification and localization of PTMs. Spectral file calibration prior to applying G-PTM-D, and algorithmic enhancements in the peptide database search significantly increase the accuracy, speed, and scope of PTM identification. We enhance G-PTM-D by using multinotch searches and demonstrate its effectiveness in identification of numerous types of PTMs including high-mass modifications such as glycosylations. The changes described in this work lead to a 20% increase in the number of identified modifications and an order of magnitude decrease in search time. The complete workflow is implemented in MetaMorpheus, a software tool that integrates the database search procedure, identification of coisolated peptides, spectral calibration, and the enhanced G-PTM-D workflow. Multinotch searches are also shown to be useful in contexts other than G-PTM-D by producing superior results when used instead of standard narrow-window and open database searches.


Subject(s)
Algorithms , Databases, Protein , Protein Processing, Post-Translational , Software/standards , Animals , Glycosylation , Humans , Methods , Proteomics/methods , Workflow
5.
J Proteome Res ; 17(3): 1321-1325, 2018 03 02.
Article in English | MEDLINE | ID: mdl-29397739

ABSTRACT

The Consortium for Top-Down Proteomics (CTDP) proposes a standardized notation, ProForma, for writing the sequence of fully characterized proteoforms. ProForma provides a means to communicate any proteoform by writing the amino acid sequence using standard one-letter notation and specifying modifications or unidentified mass shifts within brackets following certain amino acids. The notation is unambiguous, human-readable, and can easily be parsed and written by bioinformatic tools. This system uses seven rules and supports a wide range of possible use cases, ensuring compatibility and reproducibility of proteoform annotations. Standardizing proteoform sequences will simplify storage, comparison, and reanalysis of proteomic studies, and the Consortium welcomes input and contributions from the research community on the continued design and maintenance of this standard.


Subject(s)
Computational Biology/methods , Protein Processing, Post-Translational , Proteome/analysis , Proteomics/methods , Software , Tandem Mass Spectrometry/standards , Amino Acid Sequence , Computational Biology/statistics & numerical data , Databases, Protein/statistics & numerical data , Humans , Information Dissemination , International Cooperation , Molecular Sequence Annotation , Proteome/genetics , Proteome/metabolism , Proteomics/statistics & numerical data , Reproducibility of Results , Tandem Mass Spectrometry/methods
6.
Anal Chem ; 90(2): 1325-1333, 2018 01 16.
Article in English | MEDLINE | ID: mdl-29227670

ABSTRACT

In top-down proteomics, intact proteins are analyzed by tandem mass spectrometry and proteoforms, which are defined forms of a protein with specific sequences of amino acids and localized post-translational modifications, are identified using precursor mass and fragmentation data. Many proteoforms that are detected in the precursor scan (MS1) are not selected for fragmentation by the instrument and therefore remain unidentified in typical top-down proteomic workflows. Our laboratory has developed the open source software program Proteoform Suite to analyze MS1-only intact proteoform data. Here, we have adapted it to provide identifications of proteoform masses in precursor MS1 spectra of top-down data, supplementing the top-down identifications obtained using the MS2 fragmentation data. Proteoform Suite performs mass calibration using high-scoring top-down identifications and identifies additional proteoforms using calibrated, accurate intact masses. Proteoform families, the set of proteoforms from a given gene, are constructed and visualized from proteoforms identified by both top-down and intact-mass analyses. Using this strategy, we constructed proteoform families and identified 1861 proteoforms in yeast lysate, yielding an approximately 40% increase over the original 1291 proteoform identifications observed using traditional top-down analysis alone.


Subject(s)
Mass Spectrometry/methods , Proteome/analysis , Proteomics/methods , Saccharomyces cerevisiae Proteins/analysis , Saccharomyces cerevisiae/chemistry , Software
7.
J Proteome Res ; 17(1): 568-578, 2018 01 05.
Article in English | MEDLINE | ID: mdl-29195273

ABSTRACT

We present an open-source, interactive program named Proteoform Suite that uses proteoform mass and intensity measurements from complex biological samples to identify and quantify proteoforms. It constructs families of proteoforms derived from the same gene, assesses proteoform function using gene ontology (GO) analysis, and enables visualization of quantified proteoform families and their changes. It is applied here to reveal systemic proteoform variations in the yeast response to salt stress.


Subject(s)
Proteomics/methods , Software , Fungal Proteins/analysis , Fungal Proteins/drug effects , Gene Ontology , Mass Spectrometry , Salts/pharmacology , Stress, Physiological/drug effects
8.
J Proteome Res ; 17(1): 386-391, 2018 01 05.
Article in English | MEDLINE | ID: mdl-29083185

ABSTRACT

The rapid and accurate quantification of peptides is a critical element of modern proteomics that has become increasingly challenging as proteomic data sets grow in size and complexity. We present here FlashLFQ, a computer program for high-speed label-free quantification of peptides following a search of bottom-up mass spectrometry data. FlashLFQ is approximately an order of magnitude faster than established label-free quantification methods. The increased speed makes it practical to base quantification upon all of the charge states for a given peptide rather than solely upon the charge state that was selected for MS2 fragmentation. This increases the number of quantified peptides, improves replicate-to-replicate reproducibility, and increases quantitative accuracy. We integrated FlashLFQ into the graphical user interface of the MetaMorpheus search software, allowing it to work together with the global post-translational modification discovery (G-PTM-D) engine to accurately quantify modified peptides. FlashLFQ is also available as a NuGet package, facilitating its integration into other software, and as a standalone command line software program for the quantification of search results from other programs (e.g., MaxQuant).


Subject(s)
Peptides/analysis , Software , Mass Spectrometry , Proteomics/methods , Reproducibility of Results , Time Factors , User-Computer Interface
SELECTION OF CITATIONS
SEARCH DETAIL