Your browser doesn't support javascript.
loading
: 20 | 50 | 100
1 - 8 de 8
1.
iScience ; 27(6): 109944, 2024 Jun 21.
Article En | MEDLINE | ID: mdl-38784018

Maternal-to-zygotic transition (MZT) is central to early embryogenesis. However, its underlying molecular mechanisms are still not well described. Here, we revealed the expression dynamics of 5,000 proteins across four stages of zebrafish embryos during MZT, representing one of the most systematic surveys of proteome landscape of the zebrafish embryos during MZT. Nearly 700 proteins were differentially expressed and were divided into six clusters according to their expression patterns. The proteome expression profiles accurately reflect the main events that happen during the MZT, i.e., zygotic genome activation (ZGA), clearance of maternal mRNAs, and initiation of cellular differentiation and organogenesis. MZT is modulated by many proteins at multiple levels in a collaborative fashion, i.e., transcription factors, histones, histone-modifying enzymes, RNA helicases, and P-body proteins. Significant discrepancies were discovered between zebrafish proteome and transcriptome profiles during the MZT. The proteome dynamics database will be a valuable resource for bettering our understanding of MZT.

2.
bioRxiv ; 2024 Apr 09.
Article En | MEDLINE | ID: mdl-38645171

Top-down mass spectrometry is widely used for proteoform identification, characterization, and quantification owing to its ability to analyze intact proteoforms. In the last decade, top-down proteomics has been dominated by top-down data-dependent acquisition mass spectrometry (TD-DDA-MS), and top-down data-independent acquisition mass spectrometry (TD-DIA-MS) has not been well studied. While TD-DIA-MS produces complex multiplexed tandem mass spectrometry (MS/MS) spectra, which are challenging to confidently identify, it selects more precursor ions for MS/MS analysis and has the potential to increase proteoform identifications compared with TD-DDA-MS. Here we present TopDIA, the first software tool for proteoform identification by TD-DIA-MS. It generates demultiplexed pseudo MS/MS spectra from TD-DIA-MS data and then searches the pseudo MS/MS spectra against a protein sequence database for proteoform identification. We compared the performance of TD-DDA-MS and TD-DIA-MS using Escherichia coli K-12 MG1655 cells and demonstrated that TD-DIA-MS with TopDIA increased proteoform and protein identifications compared with TD-DDA-MS.

3.
Anal Chem ; 95(21): 8189-8196, 2023 05 30.
Article En | MEDLINE | ID: mdl-37196155

Top-down liquid chromatography-mass spectrometry (LC-MS) analyzes intact proteoforms and generates mass spectra containing peaks of proteoforms with various isotopic compositions, charge states, and retention times. An essential step in top-down MS data analysis is proteoform feature detection, which aims to group these peaks into peak sets (features), each containing all peaks of a proteoform. Accurate protein feature detection enhances the accuracy in MS-based proteoform identification and quantification. Here, we present TopFD, a software tool for top-down MS feature detection that integrates algorithms for proteoform feature detection, feature boundary refinement, and machine learning models for proteoform feature evaluation. We performed extensive benchmarking of TopFD, ProMex, FlashDeconv, and Xtract using seven top-down MS data sets and demonstrated that TopFD outperforms other tools in feature accuracy, reproducibility, and feature abundance reproducibility.


Proteome , Proteomics , Proteomics/methods , Reproducibility of Results , Proteome/analysis , Mass Spectrometry , Software
4.
Nucleic Acids Res ; 49(W1): W510-W515, 2021 07 02.
Article En | MEDLINE | ID: mdl-33999207

PERCEPTRON is a next-generation freely available web-based proteoform identification and characterization platform for top-down proteomics (TDP). PERCEPTRON search pipeline brings together algorithms for (i) intact protein mass tuning, (ii) de novo sequence tags-based filtering, (iii) characterization of terminal as well as post-translational modifications, (iv) identification of truncated proteoforms, (v) in silico spectral comparison, and (vi) weight-based candidate protein scoring. High-throughput performance is achieved through the execution of optimized code via multiple threads in parallel, on graphics processing units (GPUs) using NVidia Compute Unified Device Architecture (CUDA) framework. An intuitive graphical web interface allows for setting up of search parameters as well as for visualization of results. The accuracy and performance of the tool have been validated on several TDP datasets and against available TDP software. Specifically, results obtained from searching two published TDP datasets demonstrate that PERCEPTRON outperforms all other tools by up to 135% in terms of reported proteins and 10-fold in terms of runtime. In conclusion, the proposed tool significantly enhances the state-of-the-art in TDP search software and is publicly available at https://perceptron.lums.edu.pk. Users can also create in-house deployments of the tool by building code available on the GitHub repository (http://github.com/BIRL/Perceptron).


Proteomics/methods , Software , Algorithms , Protein Processing, Post-Translational , Workflow
5.
Anal Chem ; 92(11): 7778-7785, 2020 06 02.
Article En | MEDLINE | ID: mdl-32356965

Top-down mass spectrometry has become the main method for intact proteoform identification, characterization, and quantitation. Because of the complexity of top-down mass spectrometry data, spectral deconvolution is an indispensable step in spectral data analysis, which groups spectral peaks into isotopic envelopes and extracts monoisotopic masses of precursor or fragment ions. The performance of spectral deconvolution methods relies heavily on their scoring functions, which distinguish correct envelopes from incorrect ones. A good scoring function increases the accuracy of deconvoluted masses reported from mass spectra. In this paper, we present EnvCNN, a convolutional neural network-based model for evaluating isotopic envelopes. We show that the model outperforms other scoring functions in distinguishing correct envelopes from incorrect ones and that it increases the number of identifications and improves the statistical significance of identifications in top-down spectral interpretation.


Neural Networks, Computer , Ovarian Neoplasms/pathology , Animals , Brain , Databases, Protein , Female , Humans , Machine Learning , Mass Spectrometry , Zebrafish
6.
Sci Rep ; 9(1): 11267, 2019 08 02.
Article En | MEDLINE | ID: mdl-31375721

Top-Down Proteomics (TDP) is an emerging proteomics protocol that involves identification, characterization, and quantitation of intact proteins using high-resolution mass spectrometry. TDP has an edge over other proteomics protocols in that it allows for: (i) accurate measurement of intact protein mass, (ii) high sequence coverage, and (iii) enhanced identification of post-translational modifications (PTMs). However, the complexity of TDP spectra poses a significant impediment to protein search and PTM characterization. Furthermore, limited software support is currently available in the form of search algorithms and pipelines. To address this need, we propose 'SPECTRUM', an open-architecture and open-source toolbox for TDP data analysis. Its salient features include: (i) MS2-based intact protein mass tuning, (ii) de novo peptide sequence tag analysis, (iii) propensity-driven PTM characterization, (iv) blind PTM search, (v) spectral comparison, (vi) identification of truncated proteins, (vii) multifactorial coefficient-weighted scoring, and (viii) intuitive graphical user interfaces to access the aforementioned functionalities and visualization of results. We have validated SPECTRUM using published datasets and benchmarked it against salient TDP tools. SPECTRUM provides significantly enhanced protein identification rates (91% to 177%) over its contemporaries. SPECTRUM has been implemented in MATLAB, and is freely available along with its source code and documentation at https://github.com/BIRL/SPECTRUM/.


Algorithms , Proteomics/methods , Software , Databases, Protein , Datasets as Topic , HeLa Cells , Humans , Molecular Weight , Protein Isoforms/chemistry , Protein Isoforms/isolation & purification , Protein Isoforms/metabolism , Protein Processing, Post-Translational , Proteome/chemistry , Proteome/isolation & purification , Proteome/metabolism , Sequence Analysis, Protein/methods
7.
J Am Soc Mass Spectrom ; 30(12): 2470-2479, 2019 Dec.
Article En | MEDLINE | ID: mdl-31073891

Capillary zone electrophoresis (CZE)-tandem mass spectrometry (MS/MS) has been recognized as an efficient approach for top-down proteomics recently for its high-capacity separation and highly sensitive detection of proteoforms. However, the commonly used collision-based dissociation methods often cannot provide extensive fragmentation of proteoforms for thorough characterization. Activated ion electron transfer dissociation (AI-ETD), that combines infrared photoactivation concurrent with ETD, has shown better performance for proteoform fragmentation than higher energy-collisional dissociation (HCD) and standard ETD. Here, we present the first application of CZE-AI-ETD on an Orbitrap Fusion Lumos mass spectrometer for large-scale top-down proteomics of Escherichia coli (E. coli) cells. CZE-AI-ETD outperformed CZE-ETD regarding proteoform and protein identifications (IDs). CZE-AI-ETD reached comparable proteoform and protein IDs with CZE-HCD. CZE-AI-ETD tended to generate better expectation values (E values) of proteoforms than CZE-HCD and CZE-ETD, indicating a higher quality of MS/MS spectra from AI-ETD respecting the number of sequence-informative fragment ions generated. CZE-AI-ETD showed great reproducibility regarding the proteoform and protein IDs with relative standard deviations less than 4% and 2% (n = 3). Coupling size exclusion chromatography (SEC) to CZE-AI-ETD identified 3028 proteoforms and 387 proteins from E. coli cells with 1% spectrum level and 5% proteoform-level false discovery rates. The data represents the largest top-down proteomics dataset using the AI-ETD method so far. Single-shot CZE-AI-ETD of one SEC fraction identified 957 proteoforms and 253 proteins. N-terminal truncations, signal peptide cleavage, N-terminal methionine removal, and various post-translational modifications including protein N-terminal acetylation, methylation, S-thiolation, disulfide bonds, and lysine succinylation were detected.


Electrophoresis, Capillary/methods , Proteomics/methods , Tandem Mass Spectrometry/methods , Electron Transport , Escherichia coli/chemistry , Escherichia coli Proteins/analysis , Protein Processing, Post-Translational
8.
J Am Soc Mass Spectrom ; 30(8): 1435-1445, 2019 Aug.
Article En | MEDLINE | ID: mdl-30972727

Capillary zone electrophoresis-electrospray ionization-tandem mass spectrometry (CZE-ESI-MS/MS) has attracted attention recently for top-down proteomics because it can achieve highly efficient separation and very sensitive detection of proteins. However, separation window and sample loading volume of CZE need to be boosted for a better proteome coverage using CZE-MS/MS. Here, we present an improved CZE-MS/MS system that achieved a 180-min separation window and a 2-µL sample loading volume for top-down characterization of protein mixtures. The system obtained highly efficient separation of proteins with nearly one million theoretical plates for myoglobin and enabled baseline separation of three different proteoforms of myoglobin. The CZE-MS/MS system identified 797 ± 21 proteoforms and 258 ± 7 proteins (n = 2) from an Escherichia coli (E. coli) proteome sample in a single run with only 250 ng of proteins injected. The system still identified 449 ± 40 proteoforms and 173 ± 6 proteins (n = 2) from the E. coli sample when only 25 ng of proteins were injected per run. Single-shot CZE-MS/MS analyses of zebrafish brain cerebellum (Cb) and optic tectum (Teo) regions identified 1730 ± 196 proteoforms (n = 3) and 2024 ± 255 proteoforms (n = 3), respectively, with only 500-ng proteins loaded per run. Label-free quantitative top-down proteomics of zebrafish brain Cb and Teo regions revealed significant differences between Cb and Teo regarding the proteoform abundance. Over 700 proteoforms from 131 proteins had significantly higher abundance in Cb compared to Teo, and these proteins were highly enriched in several biological processes, including muscle contraction, glycolytic process, and mesenchyme migration. Graphical Abstract.


Proteins/analysis , Proteomics/methods , Animals , Cattle , Electrophoresis, Capillary/methods , Escherichia coli/chemistry , Escherichia coli Proteins/analysis , Horses , Myoglobin/analysis , Proteome/analysis , Spectrometry, Mass, Electrospray Ionization/methods , Tandem Mass Spectrometry/methods
...