RESUMEN
Brown adipose tissue (BAT) is rich in mitochondria and plays important roles in energy expenditure, thermogenesis, and glucose homeostasis. We find that levels of mitochondrial protein succinylation and malonylation are high in BAT and subject to physiological and genetic regulation. BAT-specific deletion of Sirt5, a mitochondrial desuccinylase and demalonylase, results in dramatic increases in global protein succinylation and malonylation. Mass spectrometry-based quantification of succinylation reveals that Sirt5 regulates the key thermogenic protein in BAT, UCP1. Mutation of the two succinylated lysines in UCP1 to acyl-mimetic glutamine and glutamic acid significantly decreases its stability and activity. The reduced function of UCP1 and other proteins in Sirt5KO BAT results in impaired mitochondria respiration, defective mitophagy, and metabolic inflexibility. Thus, succinylation of UCP1 and other mitochondrial proteins plays an important role in BAT and in regulation of energy homeostasis.
Asunto(s)
Metabolismo Energético/genética , Mitocondrias/metabolismo , Obesidad/genética , Sirtuinas/genética , Proteína Desacopladora 1/genética , Tejido Adiposo Pardo/metabolismo , Tejido Adiposo Pardo/patología , Animales , Regulación de la Expresión Génica , Glucosa/metabolismo , Ratones , Ratones Noqueados , Mitocondrias/genética , Proteínas Mitocondriales/genética , Obesidad/metabolismo , Obesidad/patología , Proteómica/métodos , Ácido Succínico/metabolismo , Termogénesis/genética , Proteína Desacopladora 1/metabolismoRESUMEN
Noninvasive detection of protein biomarkers in plasma is crucial for clinical purposes. Liquid chromatography-mass spectrometry (LC-MS) is the gold standard technique for plasma proteome analysis, but despite recent advances, it remains limited by throughput, cost, and coverage. Here, we introduce a new hybrid method that integrates direct infusion shotgun proteome analysis (DISPA) with nanoparticle (NP) protein corona enrichment for high-throughput and efficient plasma proteomic profiling. We realized over 280 protein identifications in 1.4 min collection time, which enables a potential throughput of approximately 1000 samples daily. The identified proteins are involved in valuable pathways, and 44 of the proteins are FDA-approved biomarkers. The robustness and quantitative accuracy of this method were evaluated across multiple NPs and concentrations with a mean coefficient of variation of 17%. Moreover, different protein corona profiles were observed among various NPs based on their distinct surface modifications, and all NP protein profiles exhibited deeper coverage and better quantification than neat plasma. Our streamlined workflow merges coverage and throughput with precise quantification, leveraging both DISPA and NP protein corona enrichment. This underscores the significant potential of DISPA when paired with NP sample preparation techniques for plasma proteome studies.
Asunto(s)
Proteínas Sanguíneas , Nanopartículas , Corona de Proteínas , Proteoma , Proteómica , Humanos , Proteínas Sanguíneas/análisis , Proteínas Sanguíneas/química , Nanopartículas/química , Corona de Proteínas/química , Corona de Proteínas/análisis , Proteoma/análisis , Proteómica/métodos , Cromatografía Liquida/métodos , Espectrometría de Masas/métodos , Biomarcadores/sangreRESUMEN
The coevolution of liquid chromatography (LC) with mass spectrometry (MS) has shaped contemporary proteomics. LC hyphenated to MS now enables quantification of more than 10,000 proteins in a single injection, a number that likely represents most proteins in specific human cells or tissues. Separations by ion mobility spectrometry (IMS) have recently emerged to complement LC and further improve the depth of proteomics. Given the theoretical advantages in speed and robustness of IMS in comparison to LC, we envision that ongoing improvements to IMS paired with MS may eventually make LC obsolete, especially when combined with targeted or simplified analyses, such as rapid clinical proteomics analysis of defined biomarker panels. In this perspective, we describe the need for faster analysis that might drive this transition, the current state of direct infusion proteomics, and discuss some technical challenges that must be overcome to fully complete the transition to entirely gas phase proteomics.
Asunto(s)
Espectrometría de Movilidad Iónica , Proteómica , Proteómica/métodos , Espectrometría de Movilidad Iónica/métodos , Humanos , Cromatografía Liquida/métodos , Espectrometría de Masas/métodos , Ensayos Analíticos de Alto Rendimiento/métodosRESUMEN
BACKGROUND: Descending thoracic aortic aneurysms and dissections can go undetected until severe and catastrophic, and few clinical indices exist to screen for aneurysms or predict risk of dissection. METHODS: This study generated a plasma proteomic dataset from 75 patients with descending type B dissection (Type B) and 62 patients with descending thoracic aortic aneurysm (DTAA). Standard statistical approaches were compared to supervised machine learning (ML) algorithms to distinguish Type B from DTAA cases. Quantitatively similar proteins were clustered based on linkage distance from hierarchical clustering and ML models were trained with uncorrelated protein lists across various linkage distances with hyperparameter optimization using fivefold cross validation. Permutation importance (PI) was used for ranking the most important predictor proteins of ML classification between disease states and the proteins among the top 10 PI protein groups were submitted for pathway analysis. RESULTS: Of the 1,549 peptides and 198 proteins used in this study, no peptides and only one protein, hemopexin (HPX), were significantly different at an adjusted p < 0.01 between Type B and DTAA cases. The highest performing model on the training set (Support Vector Classifier) and its corresponding linkage distance (0.5) were used for evaluation of the test set, yielding a precision-recall area under the curve of 0.7 to classify between Type B from DTAA cases. The five proteins with the highest PI scores were immunoglobulin heavy variable 6-1 (IGHV6-1), lecithin-cholesterol acyltransferase (LCAT), coagulation factor 12 (F12), HPX, and immunoglobulin heavy variable 4-4 (IGHV4-4). All proteins from the top 10 most important groups generated the following significantly enriched pathways in the plasma of Type B versus DTAA patients: complement activation, humoral immune response, and blood coagulation. CONCLUSIONS: We conclude that ML may be useful in differentiating the plasma proteome of highly similar disease states that would otherwise not be distinguishable using statistics, and, in such cases, ML may enable prioritizing important proteins for model prediction.
RESUMEN
BACKGROUND & AIMS: The consumption of sugar and a high-fat diet (HFD) promotes the development of obesity and metabolic dysfunction. Despite their well-known synergy, the mechanisms by which sugar worsens the outcomes associated with a HFD are largely elusive. METHODS: Six-week-old, male, C57Bl/6 J mice were fed either chow or a HFD and were provided with regular, fructose- or glucose-sweetened water. Moreover, cultured AML12 hepatocytes were engineered to overexpress ketohexokinase-C (KHK-C) using a lentivirus vector, while CRISPR-Cas9 was used to knockdown CPT1α. The cell culture experiments were complemented with in vivo studies using mice with hepatic overexpression of KHK-C and in mice with liver-specific CPT1α knockout. We used comprehensive metabolomics, electron microscopy, mitochondrial substrate phenotyping, proteomics and acetylome analysis to investigate underlying mechanisms. RESULTS: Fructose supplementation in mice fed normal chow and fructose or glucose supplementation in mice fed a HFD increase KHK-C, an enzyme that catalyzes the first step of fructolysis. Elevated KHK-C is associated with an increase in lipogenic proteins, such as ACLY, without affecting their mRNA expression. An increase in KHK-C also correlates with acetylation of CPT1α at K508, and lower CPT1α protein in vivo. In vitro, KHK-C overexpression lowers CPT1α and increases triglyceride accumulation. The effects of KHK-C are, in part, replicated by a knockdown of CPT1α. An increase in KHK-C correlates negatively with CPT1α protein levels in mice fed sugar and a HFD, but also in genetically obese db/db and lipodystrophic FIRKO mice. Mechanistically, overexpression of KHK-C in vitro increases global protein acetylation and decreases levels of the major cytoplasmic deacetylase, SIRT2. CONCLUSIONS: KHK-C-induced acetylation is a novel mechanism by which dietary fructose augments lipogenesis and decreases fatty acid oxidation to promote the development of metabolic complications. IMPACT AND IMPLICATIONS: Fructose is a highly lipogenic nutrient whose negative consequences have been largely attributed to increased de novo lipogenesis. Herein, we show that fructose upregulates ketohexokinase, which in turn modifies global protein acetylation, including acetylation of CPT1a, to decrease fatty acid oxidation. Our findings broaden the impact of dietary sugar beyond its lipogenic role and have implications on drug development aimed at reducing the harmful effects attributed to sugar metabolism.
Asunto(s)
Carnitina O-Palmitoiltransferasa , Hígado , Masculino , Ratones , Animales , Carnitina O-Palmitoiltransferasa/genética , Carnitina O-Palmitoiltransferasa/metabolismo , Carnitina O-Palmitoiltransferasa/farmacología , Acetilación , Hígado/metabolismo , Obesidad/metabolismo , Glucosa/metabolismo , Dieta Alta en Grasa/efectos adversos , Ácidos Grasos/metabolismo , Fructosa/metabolismo , Fructoquinasas/genética , Fructoquinasas/metabolismoRESUMEN
Large-scale proteome analysis requires rapid and high-throughput analytical methods. We recently reported a new paradigm in proteome analysis where direct infusion and ion mobility are used instead of liquid chromatography (LC) to achieve rapid and high-throughput proteome analysis. Here, we introduce an improved direct infusion shotgun proteome analysis protocol including label-free quantification (DISPA-LFQ) using CsoDIAq software. With CsoDIAq analysis of DISPA data, we can now identify up to â¼2000 proteins from the HeLa and 293T proteomes, and with DISPA-LFQ, we can quantify â¼1000 proteins from no more than 1 µg of sample within minutes. The identified proteins are involved in numerous valuable pathways including central carbon metabolism, nucleic acid replication and transport, protein synthesis, and endocytosis. Together with a high-throughput sample preparation method in a 96-well plate, we further demonstrate the utility of this technology for performing high-throughput drug analysis in human 293T cells. The total time for data collection from a whole 96-well plate is approximately 8 h. We conclude that the DISPA-LFQ strategy presents a valuable tool for fast identification and quantification of proteins in complex mixtures, which will power a high-throughput proteomic era of drug screening, biomarker discovery, and clinical analysis.
Asunto(s)
Proteoma , Proteómica , Humanos , Proteoma/análisis , Proteómica/métodos , Cromatografía Liquida/métodos , Programas InformáticosRESUMEN
Identification and proteomic characterization of rare cell types within complex organ-derived cell mixtures is best accomplished by label-free quantitative mass spectrometry. High throughput is required to rapidly survey hundreds to thousands of individual cells to adequately represent rare populations. Here we present parallelized nanoflow dual-trap single-column liquid chromatography (nanoDTSC) operating at 15 min of total run time per cell with peptides quantified over 11.5 min using standard commercial components, thus offering an accessible and efficient LC solution to analyze 96 single cells per day. At this throughput, nanoDTSC quantified over 1000 proteins in individual cardiomyocytes and heterogeneous populations of single cells from the aorta.
Asunto(s)
Proteínas , Proteómica , Proteómica/métodos , Cromatografía Liquida/métodos , Proteínas/química , Péptidos/química , Espectrometría de Masas/métodosRESUMEN
Liquid chromatography-mass spectrometry (LC-MS) delivers sensitive peptide analysis for proteomics but requires extensive analysis time, reducing throughput. Here, we demonstrate that gas-phase peptide separation instead of LC enables fast proteome analysis. Using direct infusion-shotgun proteome analysis (DI-SPA) by data-independent acquisition mass spectrometry (DIA-MS), we demonstrate the targeted quantification of over 500 proteins within minutes of MS data collection (~3.5 proteins per second). We show the utility of this technology in performing a complex multifactorial proteomic study of interactions between nutrients, genotype and mitochondrial toxins in a collection of cultured human cells. More than 45,000 quantitative protein measurements from 132 samples were achieved in only ~4.4 h of MS data collection. Enabling fast, unbiased proteome quantification without LC, DI-SPA offers an approach to boost throughput, critical to drug and biomarker discovery studies that require analysis of thousands of proteomes.
Asunto(s)
Cromatografía de Gases y Espectrometría de Masas/métodos , Proteoma/análisis , Proteómica/métodos , Células A549 , Línea Celular Tumoral , Perfilación de la Expresión Génica/métodos , Células HEK293 , Humanos , Células MCF-7RESUMEN
MOTIVATION: Cells respond to environments by regulating gene expression to exploit resources optimally. Recent advances in technologies allow for measuring the abundances of RNA, proteins, lipids and metabolites. These highly complex datasets reflect the states of the different layers in a biological system. Multi-omics is the integration of these disparate methods and data to gain a clearer picture of the biological state. Multi-omic studies of the proteome and metabolome are becoming more common as mass spectrometry technology continues to be democratized. However, knowledge extraction through the integration of these data remains challenging. RESULTS: Connections between molecules in different omic layers were discovered through a combination of machine learning and model interpretation. Discovered connections reflected protein control (ProC) over metabolites. Proteins discovered to control citrate were mapped onto known genetic and metabolic networks, revealing that these protein regulators are novel. Further, clustering the magnitudes of ProC over all metabolites enabled the prediction of five gene functions, each of which was validated experimentally. Two uncharacterized genes, YJR120W and YDL157C, were accurately predicted to modulate mitochondrial translation. Functions for three incompletely characterized genes were also predicted and validated, including SDH9, ISC1 and FMP52. A website enables results exploration and also MIMaL analysis of user-supplied multi-omic data. AVAILABILITY AND IMPLEMENTATION: The website for MIMaL is at https://mimal.app. Code for the website is at https://github.com/qdickinson/mimal-website. Code to implement MIMaL is at https://github.com/jessegmeyerlab/MIMaL. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Asunto(s)
Aprendizaje Automático , Redes y Vías Metabólicas , Análisis por Conglomerados , ProteomaRESUMEN
Machine learning with multi-layered artificial neural networks, also known as "deep learning," is effective for making biological predictions. However, model interpretation is challenging, especially for sequential input data used with recurrent neural network architectures. Here, we introduce a framework called "Positional SHAP" (PoSHAP) to interpret models trained from biological sequences by utilizing SHapely Additive exPlanations (SHAP) to generate positional model interpretations. We demonstrate this using three long short-term memory (LSTM) regression models that predict peptide properties, including binding affinity to major histocompatibility complexes (MHC), and collisional cross section (CCS) measured by ion mobility spectrometry. Interpretation of these models with PoSHAP reproduced MHC class I (rhesus macaque Mamu-A1*001 and human A*11:01) peptide binding motifs, reflected known properties of peptide CCS, and provided new insights into interpositional dependencies of amino acid interactions. PoSHAP should have widespread utility for interpreting a variety of models trained from biological sequences.
Asunto(s)
Biología Computacional/métodos , Aprendizaje Profundo , Modelos Biológicos , Análisis de Secuencia de Proteína/métodos , Secuencia de Aminoácidos , Animales , Sitios de Unión , Humanos , Macaca mulatta , Péptidos/química , Péptidos/metabolismoRESUMEN
Proteomic analysis on the scale that captures population and biological heterogeneity over hundreds to thousands of samples requires rapid mass spectrometry methods, which maximize instrument utilization (IU) and proteome coverage while maintaining precise and reproducible quantification. To achieve this, a short liquid chromatography gradient paired to rapid mass spectrometry data acquisition can be used to reproducibly quantify a moderate set of analytes. High-throughput profiling at a limited depth is becoming an increasingly utilized strategy for tackling large sample sets but the time spent on loading the sample, flushing the column(s), and re-equilibrating the system reduces the ratio of meaningful data acquired to total operation time and IU. The dual-trap single-column configuration (DTSC) presented here maximizes IU in rapid analysis (15 min per sample) of blood and cell lysates by parallelizing trap column cleaning and sample loading and desalting with the analysis of the previous sample. We achieved 90% IU in low microflow (9.5 µL/min) analysis of blood while reproducibly quantifying 300-400 proteins and over 6000 precursor ions. The same IU was achieved for cell lysates and over 4000 proteins (3000 at CV below 20%) and 40,000 precursor ions were quantified at a rate of 15 min/sample. Thus, DTSC enables high-throughput epidemiological blood-based biomarker cohort studies and cell-based perturbation screening.
Asunto(s)
Proteoma , Proteómica , Biomarcadores , Cromatografía Liquida/métodos , Humanos , Espectrometría de Masas/métodos , Proteoma/análisis , Proteómica/métodosRESUMEN
While much effort has been placed on comprehensive quantitative proteome analysis, certain applications demand the measurement of only a few target proteins from complex systems. Traditional approaches to targeted proteomics rely on nanoliquid chromatography (nLC) and targeted mass spectrometry (MS) methods, e.g., parallel reaction monitoring (PRM). However, the time requirement for nLC can limit the throughput of targeted proteomics. To achieve rapid and high-throughput targeted methods, here we show that nLC separations can be eliminated and replaced with direct infusion shotgun proteome analysis (DISPA) using high-field asymmetric waveform ion mobility spectrometry (FAIMS) with PRM. We demonstrate the application of DISPA-PRM for rapid targeted quantification of bacterial enzymes utilized in the production of biofuels by monitoring temporal expression in 72 metabolically engineered bacterial cultures in less than 2.5 h, with a measured dynamic range >1200-fold. We conclude that DISPA-PRM presents a valuable innovative tool with results comparable to nLC-MS/MS, enabling fast and rapid detection of targeted proteins in complex mixtures.
Asunto(s)
Proteoma , Espectrometría de Masas en Tándem , Espectrometría de Movilidad Iónica , Proteoma/análisis , Proteómica/métodosRESUMEN
Shotgun proteomics techniques infer the presence and quantity of proteins using peptide proxies produced by cleavage of the proteome with a protease. Most protein quantitation strategies assume that multiple peptides derived from a protein will behave quantitatively similar across treatment groups, but this assumption may be false due to (1) heterogeneous proteoforms and (2) technical artifacts. Here we describe a strategy called peptide correlation analysis (PeCorA) that detects quantitative disagreements between peptides mapped to the same protein. PeCorA fits linear models to assess whether a peptide's change across treatment groups differs from all other peptides assigned to the same protein. PeCorA revealed that â¼15% of proteins in a mouse microglia stress data set contain at least one discordant peptide. Inspection of the discordant peptides shows the utility of PeCorA for the direct and indirect detection of regulated post-translational modifications (PTMs) and also for the discovery of poorly quantified peptides. The exclusion of poorly quantified peptides before protein quantity summarization decreased false-positives in a benchmark data set. Finally, PeCorA suggests that the inactive isoform of prothrombin, a coagulation cascade protease, is more abundant in plasma from COVID-19 patients relative to non-COVID-19 controls. PeCorA is freely available as an R package that works with arbitrary tables of quantified peptides.
Asunto(s)
Péptidos/análisis , Proteómica , Animales , COVID-19/sangre , Humanos , Ratones , Microglía , Procesamiento Proteico-Postraduccional , Proteoma , Protrombina/análisisRESUMEN
Direct infusion shotgun proteome analysis (DISPA) is a new paradigm for expedited mass spectrometry-based proteomics, but the original data analysis workflow was onerous. Here, we introduce CsoDIAq, a user-friendly software package for the identification and quantification of peptides and proteins from DISPA data. In addition to establishing a complete and automated analysis workflow with a graphical user interface, CsoDIAq introduces algorithmic concepts to spectrum-spectrum matching to improve peptide identification speed and sensitivity. These include spectra pooling to reduce search time complexity and a new spectrum-spectrum match score called match count and cosine, which improves target discrimination in a target-decoy analysis. Fragment mass tolerance correction also increased the number of peptide identifications. Finally, we adapt CsoDIAq to standard LC-MS DIA and show that it outperforms other spectrum-spectrum matching software.
Asunto(s)
Proteoma , Programas Informáticos , Algoritmos , Cromatografía Liquida , Bases de Datos de Proteínas , ProteómicaRESUMEN
Empirical testing of chemicals for drug efficacy costs many billions of dollars every year. The ability to predict the action of molecules in silico would greatly increase the speed and decrease the cost of prioritizing drug leads. Here, we asked whether drug function, defined as MeSH "therapeutic use" classes, can be predicted from only a chemical structure. We evaluated two chemical-structure-derived drug classification methods, chemical images with convolutional neural networks and molecular fingerprints with random forests, both of which outperformed previous predictions that used drug-induced transcriptomic changes as chemical representations. This suggests that the structure of a chemical contains at least as much information about its therapeutic use as the transcriptional cellular response to that chemical. Furthermore, because training data based on chemical structure is not limited to a small set of molecules for which transcriptomic measurements are available, our strategy can leverage more training data to significantly improve predictive accuracy to 83-88%. Finally, we explore use of these models for prediction of side effects and drug-repurposing opportunities and demonstrate the effectiveness of this modeling strategy for multilabel classification.
Asunto(s)
Descubrimiento de Drogas/métodos , Simulación por Computador , Reposicionamiento de Medicamentos , Estructura Molecular , Redes Neurales de la Computación , Relación Estructura-ActividadRESUMEN
Progressive loss of proteostasis is a hallmark of aging that is marked by declines in various components of proteostasis machinery, including: autophagy, ubiquitin-mediated degradation, protein synthesis, and others. While declines in proteostasis have historically been observed as changes in these processes, or as bulk changes in the proteome, recent advances in proteomic methodologies have enabled the comprehensive measurement of turnover directly at the level of individual proteins in vivo. These methods, which utilize a combination of stable-isotope labeling, mass spectrometry, and specialized software analysis, have now been applied to various studies of aging and longevity. Here we review the role of proteostasis in aging and longevity, with a focus on the proteomic methods available to conduct protein turnover in aging models and the insights these studies have provided thus far.
Asunto(s)
Envejecimiento , Longevidad , Proteínas/metabolismo , Proteostasis , Animales , HumanosRESUMEN
Protein posttranslational modifications (PTMs) are of increasing interest in biomedical research, yet studies rarely examine more than one PTM. One barrier to multi-PTM studies is the time cost for both sample preparation and data acquisition, which scale linearly with the number of modifications. The most prohibitive requirement is often the need for large amounts of sample, which must be increased proportionally with the number of PTM enrichment steps. Here, a streamlined, quantitative label-free proteomic workflow-"one-pot" PTM enrichment-that enables comprehensive identification and quantification of peptides containing acetylated and succinylated lysine residues from a single sample containing as little as 1 mg mitochondria protein is described. Coupled with a label-free, data-independent acquisition (DIA), 2235 acetylated and 2173 succinylated peptides with the one-pot method are identified and quantified and peak areas are shown to be highly correlated between the one-pot and traditional single-PTM enrichments. The 'one-pot' method makes possible detection of multiple PTMs occurring on the same peptide, and it is shown that it can be used to make unique biological insights into PTM crosstalk. Compared to single-PTM enrichments, the one-pot workflow has equivalent reproducibility and enables direct assessment of PTM crosstalk from biological samples in less time from less tissue.
Asunto(s)
Cromatografía de Afinidad/métodos , Lisina/química , Mitocondrias Hepáticas/metabolismo , Proteínas Mitocondriales/metabolismo , Procesamiento Proteico-Postraduccional , Proteoma/análisis , Ácido Succínico/química , Acetilación , Animales , Ratones , Proteínas Mitocondriales/química , Espectrometría de Masas en TándemRESUMEN
INTRODUCTION: While selected/multiple-reaction monitoring (SRM or MRM) is considered the gold standard for quantitative protein measurement, emerging data-independent acquisition (DIA) using high-resolution scans have opened a new dimension of high-throughput, comprehensive quantitative proteomics. These newer methodologies are particularly well suited for discovery of biomarker candidates from human disease samples, and for investigating and understanding human disease pathways. Areas covered: This article reviews the current state of targeted and untargeted DIA mass spectrometry-based proteomic workflows, including SRM, parallel-reaction monitoring (PRM) and untargeted DIA (e.g., SWATH). Corresponding bioinformatics strategies, as well as application in biological and clinical studies are presented. Expert commentary: Nascent application of highly-multiplexed untargeted DIA, such as SWATH, for accurate protein quantification from clinically relevant and disease-related samples shows great potential to comprehensively investigate biomarker candidates and understand disease.
Asunto(s)
Técnicas de Diagnóstico Molecular/métodos , Proteómica/métodos , Biomarcadores/química , Humanos , Técnicas de Diagnóstico Molecular/normas , Proteómica/normasRESUMEN
Bottom-up proteomics studies traditionally involve proteome digestion with a single protease, trypsin. However, trypsin alone does not generate peptides that encompass the entire proteome. Alternative proteases have been explored, but most have specificity for charged amino acid side chains. Therefore, additional proteases that improve proteome coverage through cleavage at sequences complementary to trypsin's may increase proteome coverage. We demonstrate the novel application of two proteases for bottom-up proteomics: wild type α-lytic protease (WaLP) and an active site mutant of WaLP, M190A α-lytic protease (MaLP). We assess several relevant factors, including MS/MS fragmentation, peptide length, peptide yield, and protease specificity. When data from separate digestions with trypsin, LysC, WaLP, and MaLP were combined, proteome coverage was increased by 101% relative to that achieved with trypsin digestion alone. To demonstrate how the gained sequence coverage can yield additional post-translational modification information, we show the identification of a number of novel phosphorylation sites in the Schizosaccharomyces pombe proteome and include an illustrative example from the protein MPD2 wherein two novel sites are identified, one in a tryptic peptide too short to identify and the other in a sequence devoid of tryptic sites. The specificity of WaLP and MaLP for aliphatic amino acid side chains was particularly valuable for coverage of membrane protein sequences, which increased 350% when the data from trypsin, LysC, WaLP, and MaLP were combined.