RESUMEN
PIWI-interacting RNAs (piRNAs) play critical and conserved roles in transposon silencing and gene regulation in the animal germline. Three distinct piRNA populations are present during mouse spermatogenesis: fetal piRNAs in fetal/perinatal testes, pre-pachytene and pachytene piRNAs in postnatal testes. PNLDC1 is required for piRNA 3' end maturation in multiple species. However, whether PNLDC1 is the bona fide piRNA trimmer and the physiological role of 3' trimming of different piRNA populations in spermatogenesis in mammals remain unclear. Here, by inactivating Pnldc1 exonuclease activity in vitro and in mice, we reveal that the PNLDC1 trimmer activity is essential for spermatogenesis and male fertility. PNLDC1 catalytic activity is required for both fetal and postnatal piRNA 3' end trimming. Despite this, postnatal piRNA trimming but not fetal piRNA trimming is critical for LINE1 transposon silencing. Furthermore, conditional inactivation of Pnldc1 in postnatal germ cells causes LINE1 transposon de-repression and spermatogenic arrest in mice, indicating that germline-specific postnatal piRNA trimming is essential for transposon silencing and germ cell development. Our findings highlight the germ cell-intrinsic role of PNLDC1 and piRNA trimming in mammals to safeguard the germline genome and promote fertility.
Asunto(s)
Silenciador del Gen , Elementos de Nucleótido Esparcido Largo , ARN Interferente Pequeño , Espermatogénesis , Testículo , Animales , Espermatogénesis/genética , Masculino , ARN Interferente Pequeño/genética , ARN Interferente Pequeño/metabolismo , Ratones , Elementos de Nucleótido Esparcido Largo/genética , Testículo/metabolismo , Testículo/crecimiento & desarrollo , Células Germinativas/metabolismo , Células Germinativas/crecimiento & desarrollo , Elementos Transponibles de ADN/genética , Fertilidad/genética , ARN de Interacción con PiwiRESUMEN
Intracellular recognition of lipopolysaccharide (LPS) by mouse caspase-11 or human caspase-4 is a vital event for the activation of the noncanonical inflammasome. Whether negative regulators are involved in intracellular LPS sensing is still elusive. Here we show that adipose triglyceride lipase (ATGL) is a negative regulator of the noncanonical inflammasome. Through screening for genes participating in the noncanonical inflammasome, ATGL is identified as a negative player for intracellular LPS signaling. ATGL binds LPS and catalyzes the removal of the acylated side chains that contain ester bonds. LPS with under-acylated side chains no longer activates the inflammatory caspases. Cells with ATGL deficiency exhibit enhanced immune responses when encountering intracellular LPS, including an elevated secretion of interleukin-1ß, decreased cell viability and increased cell cytotoxicity. Moreover, ATGL-deficient mice show exacerbated responses to endotoxin challenges. Our results uncover that ATGL degrades cytosolic LPS to suppress noncanonical inflammasome activation.
Asunto(s)
Inflamasomas , Lipasa , Lipopolisacáridos , Animales , Humanos , Ratones , Caspasas Iniciadoras/metabolismo , Supervivencia Celular/efectos de los fármacos , Hidrólisis , Inflamasomas/metabolismo , Interleucina-1beta/metabolismo , Lipasa/metabolismo , Lipasa/genética , Lipopolisacáridos/farmacología , Ratones Endogámicos C57BL , Ratones NoqueadosRESUMEN
The 2nd CASMS conference was held virtually through Gather. Town platform from October 17 to 21, 2022, with a total of 363 registrants including an outstanding and diverse group of scientists at the forefront of their research fields from both academia and industry worldwide, especially in the United States and China. The conference offered a 5-day agenda with an exciting scientific program consisting of two plenary lectures, 14 parallel symposia, and 4 special sessions in which a total of 97 invited speakers presented technological innovations and their applications in proteomics & biological mass spectrometry and metabo-lipidomics & pharmaceutical mass spectrometry. In addition, 18 invited speakers/panelists presented at 3 research-focused and 2 career development workshops. Moreover, 144 posters, 54 lightning talks, 5 sponsored workshops, and 14 exhibitions were presented, from which 20 posters and 8 lightning talks received presentation awards. Furthermore, the conference featured 1 MCP lectureship and 5 young investigator awardees for the first time to highlight outstanding mid-career and early-career rising stars in mass spectrometry from our society. The conference provided a unique scientific platform for young scientists (i.e., graduate students, postdocs and junior faculty/investigators) to present their research, meet with prominent scientists, and learn about career development and job opportunities (http://casms.org).
Asunto(s)
Espectrometría de Masas , Sociedades Científicas , Humanos , China , Preparaciones Farmacéuticas , Proteómica , Estados UnidosRESUMEN
Defining the denatured state ensemble (DSE) and disordered proteins is essential to understanding folding, chaperone action, degradation, and translocation. As compared with water-soluble proteins, the DSE of membrane proteins is much less characterized. Here, we measure the DSE of the helical membrane protein GlpG of Escherichia coli (E. coli) in native-like lipid bilayers. The DSE was obtained using our steric trapping method, which couples denaturation of doubly biotinylated GlpG to binding of two streptavidin molecules. The helices and loops are probed using limited proteolysis and mass spectrometry, while the dimensions are determined using our paramagnetic biotin derivative and double electron-electron resonance spectroscopy. These data, along with our Upside simulations, identify the DSE as being highly dynamic, involving the topology changes and unfolding of some of the transmembrane (TM) helices. The DSE is expanded relative to the native state but only to 15 to 75% of the fully expanded condition. The degree of expansion depends on the local protein packing and the lipid composition. E. coli's lipid bilayer promotes the association of TM helices in the DSE and, probably in general, facilitates interhelical interactions. This tendency may be the outcome of a general lipophobic effect of proteins within the cell membranes.
Asunto(s)
Membrana Dobles de Lípidos/química , Membrana Dobles de Lípidos/metabolismo , Proteínas de la Membrana/química , Proteínas de la Membrana/metabolismo , Conformación Proteica , Biotinilación , Membrana Celular , Microscopía por Crioelectrón , Proteínas de Unión al ADN , Endopeptidasas , Escherichia coli , Proteínas de Escherichia coli/química , Modelos Moleculares , Desnaturalización Proteica , Pliegue de Proteína , EstreptavidinaRESUMEN
Characterization of histone proteoforms with various post-translational modifications (PTMs) is critical for a better understanding of functions of histone proteoforms in epigenetic control of gene expression. Mass spectrometry (MS)-based top-down proteomics (TDP) is a valuable approach for delineating histone proteoforms because it can provide us with a bird's-eye view of histone proteoforms carrying diverse combinations of PTMs. Here, we present the first example of coupling capillary zone electrophoresis (CZE), ion mobility spectrometry (IMS), and MS for online multi-dimensional separations of histone proteoforms. Our CZE-high-field asymmetric waveform IMS (FAIMS)-MS/MS platform identified 366 (ProSight PD) and 602 (TopPIC) histone proteoforms from a commercial calf histone sample using a low microgram amount of histone sample as the starting material. CZE-FAIMS-MS/MS improved the number of histone proteoform identifications by about 3 folds compared to CZE-MS/MS alone (without FAIMS). The results indicate that CZE-FAIMS-MS/MS could be a useful tool for comprehensive characterization of histone proteoforms with high sensitivity.
Asunto(s)
Histonas , Espectrometría de Masas en Tándem , Espectrometría de Masas en Tándem/métodos , Espectrometría de Movilidad Iónica , Procesamiento Proteico-Postraduccional , Electroforesis Capilar/métodosRESUMEN
The protein corona, a layer of biomolecules forming around nanoparticles in biological environments, critically influences nanoparticle interactions with biosystems, affecting pharmacokinetics and biological outcomes. Initially, the protein corona presented challenges for nanomedicine and nanotoxicology, such as nutrient depletion in cell cultures and masking of nanoparticle-targeting species. However, recent advancements have highlighted its potential in environmental toxicity, proteomics, and immunology. This viewpoint focuses on leveraging the protein corona to enhance the depth of plasma proteome analysis, addressing challenges posed by the high dynamic range of protein concentrations in plasma. The protein corona simplifies sample preparation, enriches low-abundance proteins, and improves proteome coverage. Innovations include using diverse nanoparticles and spiking small molecules to increase the number of quantified proteins. Reproducibility issues across core facilities necessitate standardized protocols. Moreover, top-down proteomics enables proteoform-specific measurements, providing deeper insights into protein corona composition. Future research should aim at improving top-down proteomics techniques and integrating protein corona studies and proteomics for personalized medicine and advanced diagnostics.
RESUMEN
Mass spectrometry (MS)-based top-down proteomics (TDP) analysis of histone proteoforms provides critical information about combinatorial post-translational modifications (PTMs), which is vital for pursuing a better understanding of epigenetic regulation of gene expression. It requires high-resolution separations of histone proteoforms before MS and tandem MS (MS/MS) analysis. In this work, for the first time, we combined SDS-PAGE-based protein fractionation (passively eluting proteins from polyacrylamide gels as intact species for mass spectrometry, PEPPI-MS) with capillary zone electrophoresis (CZE)-MS/MS for high-resolution characterization of histone proteoforms. We systematically studied the histone proteoform extraction from SDS-PAGE gel and follow-up cleanup as well as CZE-MS/MS, to determine an optimal procedure. The optimal procedure showed reproducible and high-resolution separation and characterization of histone proteoforms. SDS-PAGE separated histone proteins (H1, H2, H3, and H4) based on their molecular weight and CZE provided additional separations of proteoforms of each histone protein based on their electrophoretic mobility, which was affected by PTMs, for example, acetylation and phosphorylation. Using the technique, we identified over 200 histone proteoforms from a commercial calf thymus histone sample with good reproducibility. The orthogonal and high-resolution separations of SDS-PAGE and CZE made our technique attractive for the delineation of histone proteoforms extracted from complex biological systems.
Asunto(s)
Electroforesis Capilar , Electroforesis en Gel de Poliacrilamida , Histonas , Procesamiento Proteico-Postraduccional , Proteómica , Espectrometría de Masas en Tándem , Histonas/análisis , Espectrometría de Masas en Tándem/métodos , Electroforesis Capilar/métodos , Proteómica/métodos , Electroforesis en Gel de Poliacrilamida/métodos , Animales , HumanosRESUMEN
Abnormal accumulation of tau protein in the brain is one pathological hallmark of Alzheimer's disease (AD). Many tau protein post-translational modifications (PTMs) are associated with the development of AD, such as phosphorylation, acetylation, and methylation. Therefore, a complete picture of the PTM landscape of tau is critical for understanding the molecular mechanisms of AD progression. Here, we offered a pilot study of combining two complementary analytical techniques, capillary zone electrophoresis (CZE)-tandem mass spectrometry (MS/MS) and reversed-phase liquid chromatography (RPLC)-MS/MS, for bottom-up proteomics of recombinant human tau-0N3R. We identified 50 phosphorylation sites of tau-0N3R in total, which is about 25% higher than that from RPLC-MS/MS alone. CZE-MS/MS provided more PTM sites (i.e., phosphorylation) and modified peptides of tau-0N3R than RPLC-MS/MS, and its predicted electrophoretic mobility helped improve the confidence of the identified modified peptides. We developed a highly efficient capillary isoelectric focusing (cIEF)-MS technique to offer a bird's-eye view of tau-0N3R proteoforms, with 11 putative tau-0N3R proteoforms carrying up to nine phosphorylation sites and lower pI values from more phosphorylated proteoforms detected. Interestingly, under native-like cIEF-MS conditions, we observed three putative tau-0N3R dimers carrying phosphate groups. The findings demonstrate that CE-MS is a valuable analytical technique for the characterization of tau PTMs, proteoforms, and even oligomerization.
Asunto(s)
Electroforesis Capilar , Procesamiento Proteico-Postraduccional , Espectrometría de Masas en Tándem , Proteínas tau , Proteínas tau/metabolismo , Proteínas tau/química , Proteínas tau/análisis , Electroforesis Capilar/métodos , Humanos , Proyectos Piloto , Espectrometría de Masas en Tándem/métodos , Fosforilación , Enfermedad de Alzheimer/metabolismo , Cromatografía de Fase Inversa/métodos , Proteómica/métodos , Focalización Isoeléctrica/métodos , Proteínas Recombinantes/química , Proteínas Recombinantes/metabolismo , AcetilaciónRESUMEN
Mass spectrometry (MS)-based top-down proteomics (TDP) has revolutionized biological research by measuring intact proteoforms in cells, tissues, and biofluids. Capillary zone electrophoresis-tandem MS (CZE-MS/MS) is a valuable technique for TDP, offering a high peak capacity and sensitivity for proteoform separation and detection. However, the long-term reproducibility of CZE-MS/MS in TDP remains unstudied, which is a crucial aspect for large-scale studies. This work investigated the long-term qualitative and quantitative reproducibility of CZE-MS/MS for TDP for the first time, focusing on a yeast cell lysate. Over 1000 proteoforms were identified per run across 62 runs using one linear polyacrylamide (LPA)-coated separation capillary, highlighting the robustness of the CZE-MS/MS technique. However, substantial decreases in proteoform intensity and identification were observed after some initial runs due to proteoform adsorption onto the capillary inner wall. To address this issue, we developed an efficient capillary cleanup procedure using diluted ammonium hydroxide, achieving high qualitative and quantitative reproducibility for the yeast sample across at least 23 runs. The data underscore the capability of CZE-MS/MS for large-scale quantitative TDP of complex samples, signaling its readiness for deployment in broad biological applications. The MS RAW files were deposited in ProteomeXchange Consortium with the data set identifier of PXD046651.
Asunto(s)
Proteoma , Espectrometría de Masas en Tándem , Espectrometría de Masas en Tándem/métodos , Proteoma/análisis , Saccharomyces cerevisiae/química , Proteómica/métodos , Proyectos Piloto , Reproducibilidad de los Resultados , Electroforesis Capilar/métodos , Proteínas de Unión al ADNRESUMEN
Multilevel proteomics aims to delineate proteins at the peptide (bottom-up proteomics), proteoform (top-down proteomics), and protein complex (native proteomics) levels. Capillary electrophoresis-mass spectrometry (CE-MS) can achieve highly efficient separation and highly sensitive detection of complex mixtures of peptides, proteoforms, and even protein complexes because of its substantial technical progress. CE-MS has become a valuable alternative to the routinely used liquid chromatography-mass spectrometry for multilevel proteomics. This review summarizes the most recent (2019-2021) advances of CE-MS for multilevel proteomics regarding technological progress and biological applications. We also provide brief perspectives on CE-MS for multilevel proteomics at the end, highlighting some future directions and potential challenges.
Asunto(s)
Proteínas , Proteómica , Proteómica/métodos , Espectrometría de Masas/métodos , Proteínas/análisis , Péptidos , Electroforesis Capilar/métodosRESUMEN
We present an approach to estimate the operational distinguishability between an entangled state and any separable state directly from measuring an entanglement witness. We show that this estimation also implies bounds on a variety of other well-known entanglement quantifiers. This approach for entanglement estimation is then extended to both the measurement-device-independent scenario and the fully device-independent scenario, where we obtain nontrivial but suboptimal bounds. The procedure requires no numerical optimization and is easy to compute. It offers ways for experimenters to not only detect, but also quantify, entanglement from the standard entanglement witness procedure.
RESUMEN
Following the highly successful Chinese American Society for Mass Spectrometry (CASMS) conferences in the previous 2 years, the 3rd CASMS Conference was held virtually on August 28-31, 2023, using the Gather.Town platform to bring together scientists in the MS field. The conference offered a 4-day agenda with a scientific program consisting of two plenary lectures, and 14 parallel symposia in which a total of 70 speakers presented technological innovations and their applications in proteomics and biological MS and metabo-lipidomics and pharmaceutical MS. In addition, 16 invited speakers/panelists presented at two research-focused and three career development workshops. Moreover, 86 posters, 12 lightning talks, 3 sponsored workshops, and 11 exhibitions were presented, from which 9 poster awards and 2 lightning talk awards were selected. Furthermore, the conference featured four young investigator awardees to highlight early-career achievements in MS from our society. The conference provided a unique scientific platform for young scientists (i.e. graduate students, postdocs, and junior faculty/investigators) to present their research, meet with prominent scientists, learn about career development, and job opportunities (http://casms.org).
Asunto(s)
Espectrometría de Masas , Lipidómica , Preparaciones Farmacéuticas , Proteómica , Congresos como AsuntoRESUMEN
Native proteomics measures endogenous proteoforms and protein complexes under a near physiological condition using native mass spectrometry (nMS) coupled with liquid-phase separations. Native proteomics should provide the most accurate bird's-eye view of proteome dynamics within cells, which is fundamental for understanding almost all biological processes. nMS has been widely employed to characterize well-purified protein complexes. However, there are only very few trials of utilizing nMS to measure proteoforms and protein complexes in a complex sample (i.e., a whole cell lysate). Here, we pioneer the native proteomics measurement of large proteoforms or protein complexes up to 400â kDa from a complex proteome via online coupling of native capillary zone electrophoresis (nCZE) to an ultra-high mass range (UHMR) Orbitrap mass spectrometer. The nCZE-MS technique enabled the measurement of a 115-kDa standard protein complex while consuming only about 0.1â ng of protein material. nCZE-MS analysis of an E.coli cell lysate detected 72â proteoforms or protein complexes in a mass range of 30-400â kDa in a single run while consuming only 50-ng protein material. The mass distribution of detected proteoforms or protein complexes agreed well with that from mass photometry measurement. This work represents a technical breakthrough in native proteomics for measuring complex proteomes.
RESUMEN
We present a large-scale top-down proteomics (TDP) study of plant leaf and chloroplast proteins, achieving the identification of over 4700 unique proteoforms. Using capillary zone electrophoresis coupled with tandem mass spectrometry analysis of offline size-exclusion chromatography fractions, we identify 3198 proteoforms for total leaf and 1836 proteoforms for chloroplast, with 1024 and 363 proteoforms having post-translational modifications, respectively. The electrophoretic mobility prediction of capillary zone electrophoresis allowed us to validate post-translational modifications that impact the charge state such as acetylation and phosphorylation. Identified modifications included Trp (di)oxidation events on six chloroplast proteins that may represent novel targets of singlet oxygen sensing. Furthermore, our TDP data provides direct experimental evidence of the N- and C-terminal residues of numerous mature proteoforms from chloroplast, mitochondria, endoplasmic reticulum, and other sub-cellular localizations. With this information, we suggest true transit peptide cleavage sites and correct sub-cellular localization signal predictions. This large-scale analysis illustrates the power of top-down proteoform identification of post-translational modifications and intact sequences that can benefit our understanding of both the structure and function of hundreds of plant proteins.
Asunto(s)
Arabidopsis , Proteoma , Proteoma/análisis , Arabidopsis/metabolismo , Proteómica/métodos , Espectrometría de Masas en Tándem/métodos , Procesamiento Proteico-Postraduccional , Proteínas de Unión al ADN/metabolismoRESUMEN
Capillary zone electrophoresis-tandem mass spectrometry (CZE-MS/MS) has emerged as an essential technique for top-down proteomics (TDP), providing superior separation efficiency and high detection sensitivity for proteoform analysis. Here, we aimed to further enhance the performance of CZE-MS/MS for TDP via coupling online gas-phase proteoform fractionation using high-field asymmetric waveform ion mobility spectrometry (FAIMS). When the compensation voltage (CV) of FAIMS was changed from -50 to 30 V, the median mass of identified proteoforms increased from less than 10 kDa to about 30 kDa, suggesting that FAIMS can efficiently fractionate proteoforms by their size. CZE-FAIMS-MS/MS boosted the number of proteoform identifications from a yeast sample by nearly 3-fold relative to CZE-MS/MS alone. It particularly benefited the identification of relatively large proteoforms, improving the number of proteoforms in a mass range of 20-45 kDa by 6-fold compared to CZE-MS/MS alone. FAIMS fractionation gained nearly 20-fold better signal-to-noise ratios of randomly selected proteoforms than no FAIMS. We expect that CZE-FAIMS-MS/MS will be a useful tool for further advancing the sensitivity and coverage of TDP. This work shows the first example of coupling CE with ion mobility spectrometry (IMS) for TDP.
Asunto(s)
Proteómica , Espectrometría de Masas en Tándem , Espectrometría de Masas en Tándem/métodos , Proteómica/métodos , Espectrometría de Movilidad Iónica , Saccharomyces cerevisiae , Electroforesis Capilar/métodos , Proteínas de Unión al ADNRESUMEN
Top-down liquid chromatography-mass spectrometry (LC-MS) analyzes intact proteoforms and generates mass spectra containing peaks of proteoforms with various isotopic compositions, charge states, and retention times. An essential step in top-down MS data analysis is proteoform feature detection, which aims to group these peaks into peak sets (features), each containing all peaks of a proteoform. Accurate protein feature detection enhances the accuracy in MS-based proteoform identification and quantification. Here, we present TopFD, a software tool for top-down MS feature detection that integrates algorithms for proteoform feature detection, feature boundary refinement, and machine learning models for proteoform feature evaluation. We performed extensive benchmarking of TopFD, ProMex, FlashDeconv, and Xtract using seven top-down MS data sets and demonstrated that TopFD outperforms other tools in feature accuracy, reproducibility, and feature abundance reproducibility.
Asunto(s)
Proteoma , Proteómica , Proteómica/métodos , Reproducibilidad de los Resultados , Proteoma/análisis , Espectrometría de Masas , Programas InformáticosRESUMEN
Mass spectrometry (MS)-based top-down characterization of integral membrane proteins (IMPs) is crucial for understanding their functions in biological processes. However, it is technically challenging due to their low solubility in typical MS-compatible buffers. In this work, for the first time, we developed an efficient capillary zone electrophoresis (CZE)-tandem MS (MS/MS) method for the top-down proteomics (TDP) of IMPs enriched from mouse brains. Our technique employs a sample buffer containing 30% (v/v) formic acid and 60% (v/v) methanol for solubilizing IMPs and utilizes a separation buffer of 30% (v/v) acetic acid and 30% (v/v) methanol for maintaining the solubility of IMPs during CZE separation. Single-shot CZE-MS/MS identified 51 IMP proteoforms from the mouse brain sample. Coupling size exclusion chromatography (SEC) to CZE-MS/MS enabled the identification of 276 IMP proteoforms from the mouse brain sample containing 1-4 transmembrane domains. This proof-of-concept work demonstrates the high potential of CZE-MS/MS for the large-scale TDP of IMPs.
Asunto(s)
Metanol , Espectrometría de Masas en Tándem , Animales , Ratones , Proteómica , Electroforesis Capilar , Proteínas de la Membrana , Encéfalo , Proteínas de Unión al ADNRESUMEN
Protein phosphorylation is a vital and common post-translational modification (PTM) in cells, modulating various biological processes and diseases. Comprehensive top-down proteomics of phosphorylated proteoforms (phosphoproteoforms) in cells and tissues is essential for a better understanding of the roles of protein phosphorylation in fundamental biological processes and diseases. Mass spectrometry (MS)-based top-down proteomics of phosphoproteoforms remains challenging due to their relatively low abundance. Herein, we investigated magnetic nanoparticle-based immobilized metal affinity chromatography (IMAC, Ti4+, and Fe3+) for selective enrichment of phosphoproteoforms for MS-based top-down proteomics. The IMAC method achieved reproducible and highly efficient enrichment of phosphoproteoforms from simple and complex protein mixtures. It outperformed one commercial phosphoprotein enrichment kit regarding the capture efficiency and recovery of phosphoproteins. Reversed-phase liquid chromatography (RPLC)-tandem mass spectrometry (MS/MS) analyses of yeast cell lysates after IMAC (Ti4+ or Fe3+) enrichment produced roughly 100% more phosphoproteoform identifications compared to without IMAC enrichment. Importantly, the phosphoproteoforms identified after Ti4+-IMAC or Fe3+-IMAC enrichment correspond to proteins with much lower overall abundance compared to that identified without the IMAC treatment. We also revealed that Ti4+-IMAC and Fe3+-IMAC could enrich different pools of phosphoproteoforms from complex proteomes and the combination of those two methods will be useful for further improving the phosphoproteoform coverage from complex samples. The results clearly demonstrate the value of our magnetic nanoparticle-based Ti4+-IMAC and Fe3+-IMAC for advancing top-down MS characterization of phosphoproteoforms in complex biological systems.
Asunto(s)
Nanopartículas de Magnetita , Espectrometría de Masas en Tándem , Espectrometría de Masas en Tándem/métodos , Proteómica/métodos , Nanopartículas de Magnetita/análisis , Proyectos Piloto , Cromatografía de Afinidad/métodos , Fosfoproteínas , Fosfopéptidos/análisisRESUMEN
Reversed-phase liquid chromatography (RPLC) and capillary zone electrophoresis (CZE) are two primary proteoform separation methods in mass spectrometry (MS)-based top-down proteomics. Proteoform retention time (RT) prediction in RPLC and migration time (MT) prediction in CZE provide additional information for accurate proteoform identification and quantification. While existing methods are mainly focused on peptide RT and MT prediction in bottom-up MS, there is still a lack of methods for proteoform RT and MT prediction in top-down MS. We systematically evaluated eight machine learning models and a transfer learning method for proteoform RT prediction and five models and the transfer learning method for proteoform MT prediction. Experimental results showed that a gated recurrent unit (GRU)-based model with transfer learning achieved a high accuracy (R = 0.978) for proteoform RT prediction and that the GRU-based model and a fully connected neural network model obtained a high accuracy of R = 0.982 and 0.981 for proteoform MT prediction, respectively.
Asunto(s)
Proteómica , Espectrometría de Masas en Tándem , Cromatografía de Fase Inversa , Electroforesis Capilar/métodos , Aprendizaje Automático , Proteoma/análisis , Proteómica/métodos , Espectrometría de Masas en Tándem/métodosRESUMEN
Protein complexes are the functional machines in the cell and are heterogeneous due to protein sequence variations and post-translational modifications (PTMs). Here, we present an automated nondenaturing capillary isoelectric focusing-mass spectrometry (ncIEF-MS) methodology for uncovering the microheterogeneity of intact protein complexes. The method exhibited superior separation resolution for protein complexes than conventional native capillary zone electrophoresis (nCZE-MS). In our study, ncIEF-MS achieved liquid-phase separations and MS characterization of seven different forms of a streptavidin homotetramer with variations of N-terminal methionine removal, acetylation, and formylation and four forms of the carbonic anhydrase-zinc complex arising from variations of PTMs (succinimide, deamidation, etc.). In addition, ncIEF-MS resolved different states of an interchain cysteine-linked antibody-drug conjugate (ADC1) as a new class of anticancer therapeutic agents that bears a distribution of varied drug-to-antibody ratio (DAR) species. More importantly, ncIEF-MS enabled precise measurements of isoelectric points (pIs) of protein complexes, which reflect the surface electrostatic properties of protein complexes. We studied how protein sequence variations/PTMs modulate the pIs of protein complexes and how drug loading affects the pIs of antibodies. We discovered that keeping the N-terminal methionine residue of one subunit of the streptavidin homotetramer decreased its pI by 0.1, adding one acetyl group onto the streptavidin homotetramer reduced its pI by nearly 0.4, incorporating one formyl group onto the streptavidin homotetramer reduced its pI by around 0.3, and loading two more drug molecules on one ADC1 molecule increased its pI by 0.1. The data render the ncIEF-MS method a valuable tool for delineating protein complexes.