RÉSUMÉ
The search engine processor (SEPro) is a tool for filtering, organizing, sharing, and displaying peptide spectrum matches. It employs a novel three-tier Bayesian approach that uses layers of spectrum, peptide, and protein logic to lead the data to converge to a single list of reliable protein identifications. SEPro is integrated into the PatternLab for proteomics environment, where an arsenal of tools for analyzing shotgun proteomic data is provided. By using the semi-labeled decoy approach for benchmarking, we show that SEPro significantly outperforms a commercially available competitor.
Sujet(s)
Algorithmes , Bases de données de protéines , Fragments peptidiques/composition chimique , Protéomique/méthodes , Logiciel , Animaux , Théorème de Bayes , Chromatographie en phase liquide , Systèmes de gestion de bases de données , Souris , Protéines/composition chimique , Protéines/classification , Spectrométrie de masse en tandem , Interface utilisateurRÉSUMÉ
The decoy-database approach is currently the gold standard for assessing the confidence of identifications in shotgun proteomic experiments. Here, we demonstrate that what might appear to be a good result under the decoy-database approach for a given false-discovery rate could be, in fact, the product of overfitting. This problem has been overlooked until now and could lead to obtaining boosted identification numbers whose reliability does not correspond to the expected false-discovery rate. To overcome this, we are introducing a modified version of the method, termed a semi-labeled decoy approach, which enables the statistical determination of an overfitted result.
Sujet(s)
Biologie informatique , Protéomique/normes , Découverte de médicament/normesRÉSUMÉ
SUMMARY: XDIA is a computational strategy for analyzing multiplexed spectra acquired using electron transfer dissociation and collision-activated dissociation; it significantly increases identified spectra (approximately 250%) and unique peptides (approximately 30%) when compared with the data-dependent ETCaD analysis on middle-down, single-phase shotgun proteomic analysis. Increasing identified spectra and peptides improves quantitation statistics confidence and protein coverage, respectively. AVAILABILITY: The software and data produced in this work are freely available for academic use at http://fields.scripps.edu/XDIA CONTACT: paulo@pcarvalho.com SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Sujet(s)
Protéomique/méthodes , Logiciel , Algorithmes , Bases de données factuellesRÉSUMÉ
UNLABELLED: YADA can deisotope and decharge high-resolution mass spectra from large peptide molecules, link the precursor monoisotopic peak information to the corresponding tandem mass spectrum, and account for different co-fragmenting ion species (multiplexed spectra). We describe how YADA enables a pipeline consisting of ProLuCID and DTASelect for analyzing large-scale middle-down proteomics data. AVAILABILITY: http://fields.scripps.edu/yada
Sujet(s)
Biologie informatique/méthodes , Spectrométrie de masse/méthodes , Peptides/composition chimique , Protéomique/méthodes , Logiciel , AlgorithmesRÉSUMÉ
Electron transfer dissociation (ETD) can dissociate highly charged ions. Efficient analysis of ions dissociated with ETD requires accurate determination of charge states for calculation of molecular weight. We created an algorithm to assign the charge state of ions often used for ETD. The program, Charge Prediction Machine (CPM), uses Bayesian decision theory to account for different charge reduction processes encountered in ETD and can also handle multiplex spectra. CPM correctly assigned charge states to 98% of the 13,097 MS2 spectra from a combined data set of four experiments. In a comparison between CPM and a competing program, Charger (ThermoFisher), CPM produced half the mistakes.