Pesquisa | Biblioteca Virtual em Saúde

1.

Integrated Proteomics Analysis of Baseline Protein Expression in Pig Tissues.

Wang, Shengbo; Collins, Andrew; Prakash, Ananth; Fexova, Silvie; Papatheodorou, Irene; Jones, Andrew R; Vizcaíno, Juan Antonio.

J Proteome Res ; 2024 May 08.

Artigo em Inglês | MEDLINE | ID: mdl-38717300

RESUMO

The availability of an increasingly large amount of public proteomics data sets presents an opportunity for performing combined analyses to generate comprehensive organism-wide protein expression maps across different organisms and biological conditions. Sus scrofa, a domestic pig, is a model organism relevant for food production and for human biomedical research. Here, we reanalyzed 14 public proteomics data sets from the PRIDE database coming from pig tissues to assess baseline (without any biological perturbation) protein abundance in 14 organs, encompassing a total of 20 healthy tissues from 128 samples. The analysis involved the quantification of protein abundance in 599 mass spectrometry runs. We compared protein expression patterns among different pig organs and examined the distribution of proteins across these organs. Then, we studied how protein abundances were compared across different data sets and studied the tissue specificity of the detected proteins. Of particular interest, we conducted a comparative analysis of protein expression between pig and human tissues, revealing a high degree of correlation in protein expression among orthologs, particularly in brain, kidney, heart, and liver samples. We have integrated the protein expression results into the Expression Atlas resource for easy access and visualization of the protein expression data individually or alongside gene expression data.

2.

Dynamic Changes in the Thylakoid Proteome of Cyanobacteria during Light-Regulated Thylakoid Membrane Development.

Huang, Fang; Grauslys, Arturas; Huokko, Tuomas; Caamaño Gutiérrez, Eva; Jones, Andrew R; Liu, Lu-Ning.

Plants (Basel) ; 12(23)2023 Nov 25.

Artigo em Inglês | MEDLINE | ID: mdl-38068604

RESUMO

Cyanobacteria were among the oldest organisms to undertake oxygenic photosynthesis and have an essential impact on the atmosphere and carbon/nitrogen cycles on the planet. The thylakoid membrane of cyanobacteria represents an intricate compartment that houses a variety of multi-component (pigment-)protein complexes, assembly factors, and regulators, as well as transporters involved in photosynthetic light reactions, and respiratory electron transport. How these protein components are incorporated into membranes during thylakoid formation and how individual complexes are regulated to construct the functional machinery remains elusive. Here, we carried out an in-depth statistical analysis of the thylakoid proteome data obtained during light-induced thylakoid membrane biogenesis in the model cyanobacterium Synechococcus elongatus PCC 7942. A total of 1581 proteins were experimentally quantified, among which 457 proteins demonstrated statistically significant variations in abundance at distinct thylakoid biogenesis stages. Gene Ontology and KEGG enrichment analysis revealed that predominantly photosystems, light-harvesting antennae, ABC transporters, and pathway enzymes involved in oxidative stress responses and protein folding exhibited notable alternations in abundance between high light and growth light. Moreover, through cluster analysis the 1581 proteins were categorized into six distinct clusters that have significantly different trajectories of the change in their abundance during thylakoid development. Our study provides insights into the physiological regulation for the membrane integration of protein components and functionally linked complexes during the cyanobacterial TM biogenesis process. The findings and analytical methodologies developed in this study may be valuable for studying the global responses of TM biogenesis and photosynthetic acclimation in plants and algae.

3.

A meta-analysis of rice phosphoproteomics data to understand variation in cell signalling across the rice pan-genome.

Ramsbottom, Kerry A; Prakash, Ananth; Riverol, Yasset Perez; Camacho, Oscar Martin; Sun, Zhi; Kundu, Deepti J; Bowler-Barnett, Emily; Martin, Maria; Fan, Jun; Chebotarov, Dmytro; McNally, Kenneth L; Deutsch, Eric W; Vizcaíno, Juan Antonio; Jones, Andrew R.

bioRxiv ; 2023 Nov 17.

Artigo em Inglês | MEDLINE | ID: mdl-38014076

RESUMO

Phosphorylation is the most studied post-translational modification, and has multiple biological functions. In this study, we have re-analysed publicly available mass spectrometry proteomics datasets enriched for phosphopeptides from Asian rice (Oryza sativa). In total we identified 15,522 phosphosites on serine, threonine and tyrosine residues on rice proteins. We identified sequence motifs for phosphosites, and link motifs to enrichment of different biological processes, indicating different downstream regulation likely caused by different kinase groups. We cross-referenced phosphosites against the rice 3,000 genomes, to identify single amino acid variations (SAAVs) within or proximal to phosphosites that could cause loss of a site in a given rice variety. The data was clustered to identify groups of sites with similar patterns across rice family groups, for example those highly conserved in Japonica, but mostly absent in Aus type rice varieties - known to have different responses to drought. These resources can assist rice researchers to discover alleles with significantly different functional effects across rice varieties. The data has been loaded into UniProt Knowledge-Base - enabling researchers to visualise sites alongside other data on rice proteins e.g. structural models from AlphaFold2, PeptideAtlas and the PRIDE database - enabling visualisation of source evidence, including scores and supporting mass spectra.

4.

Custom Workflow for the Confident Identification of Sulfotyrosine-Containing Peptides and Their Discrimination from Phosphopeptides.

Daly, Leonard A; Byrne, Dominic P; Perkins, Simon; Brownridge, Philip J; McDonnell, Euan; Jones, Andrew R; Eyers, Patrick A; Eyers, Claire E.

J Proteome Res ; 22(12): 3754-3772, 2023 12 01.

Artigo em Inglês | MEDLINE | ID: mdl-37939282

RESUMO

Protein tyrosine sulfation (sY) is a post-translational modification (PTM) catalyzed by Golgi-resident tyrosyl protein sulfo transferases (TPSTs). Information on sY in humans is currently limited to â¼50 proteins, with only a handful having verified sites of sulfation. As such, the contribution of sulfation to the regulation of biological processes remains poorly defined. Mass spectrometry (MS)-based proteomics is the method of choice for PTM analysis but has yet to be applied for systematic investigation of the "sulfome", primarily due to issues associated with discrimination of sY-containing from phosphotyrosine (pY)-containing peptides. In this study, we developed an MS-based workflow for sY-peptide characterization, incorporating optimized Zr4+ immobilized metal-ion affinity chromatography (IMAC) and TiO2 enrichment strategies. Extensive characterization of a panel of sY- and pY-peptides using an array of fragmentation regimes (CID, HCD, EThcD, ETciD, UVPD) highlighted differences in the generation of site-determining product ions and allowed us to develop a strategy for differentiating sulfated peptides from nominally isobaric phosphopeptides based on low collision energy-induced neutral loss. Application of our "sulfomics" workflow to a HEK-293 cell extracellular secretome facilitated identification of 21 new sulfotyrosine-containing proteins, several of which we validate enzymatically, and reveals new interplay between enzymes relevant to both protein and glycan sulfation.

Assuntos

Fosfopeptídeos , Tirosina , Humanos , Fosfopeptídeos/análise , Células HEK293 , Fluxo de Trabalho , Tirosina/metabolismo , Proteínas , Fosfotirosina

5.

GET_PANGENES: calling pangenes from plant genome alignments confirms presence-absence variation.

Contreras-Moreira, Bruno; Saraf, Shradha; Naamati, Guy; Casas, Ana M; Amberkar, Sandeep S; Flicek, Paul; Jones, Andrew R; Dyer, Sarah.

Genome Biol ; 24(1): 223, 2023 10 05.

Artigo em Inglês | MEDLINE | ID: mdl-37798615

RESUMO

Crop pangenomes made from individual cultivar assemblies promise easy access to conserved genes, but genome content variability and inconsistent identifiers hamper their exploration. To address this, we define pangenes, which summarize a species coding potential and link back to original annotations. The protocol get_pangenes performs whole genome alignments (WGA) to call syntenic gene models based on coordinate overlaps. A benchmark with small and large plant genomes shows that pangenes recapitulate phylogeny-based orthologies and produce complete soft-core gene sets. Moreover, WGAs support lift-over and help confirm gene presence-absence variation. Source code and documentation: https://github.com/Ensembl/plant-scripts .

Assuntos

Genoma de Planta , Software

6.

A systems approach reveals species differences in hepatic stress response capacity.

Russomanno, Giusy; Sison-Young, Rowena; Livoti, Lucia A; Coghlan, Hannah; Jenkins, Rosalind E; Kunnen, Steven J; Fisher, Ciarán P; Reddyhoff, Dennis; Gardner, Iain; Rehman, Adeeb H; Fenwick, Stephen W; Jones, Andrew R; Vermeil De Conchard, Guy; Simonin, Gilles; Bertheux, Helene; Weaver, Richard J; Johnson, Robert L; Liguori, Michael J; Clausznitzer, Diana; Stevens, James L; Goldring, Christopher E; Copple, Ian M.

Toxicol Sci ; 196(1): 112-125, 2023 10 30.

Artigo em Inglês | MEDLINE | ID: mdl-37647630

RESUMO

To minimize the occurrence of unexpected toxicities in early phase preclinical studies of new drugs, it is vital to understand fundamental similarities and differences between preclinical species and humans. Species differences in sensitivity to acetaminophen (APAP) liver injury have been related to differences in the fraction of the drug that is bioactivated to the reactive metabolite N-acetyl-p-benzoquinoneimine (NAPQI). We have used physiologically based pharmacokinetic modeling to identify oral doses of APAP (300 and 1000 mg/kg in mice and rats, respectively) yielding similar hepatic burdens of NAPQI to enable the comparison of temporal liver tissue responses under conditions of equivalent chemical insult. Despite pharmacokinetic and biochemical verification of the equivalent NAPQI insult, serum biomarker and tissue histopathology analyses revealed that mice still exhibited a greater degree of liver injury than rats. Transcriptomic and proteomic analyses highlighted the stronger activation of stress response pathways (including the Nrf2 oxidative stress response and autophagy) in the livers of rats, indicative of a more robust transcriptional adaptation to the equivalent insult. Components of these pathways were also found to be expressed at a higher basal level in the livers of rats compared with both mice and humans. Our findings exemplify a systems approach to understanding differential species sensitivity to hepatotoxicity. Multiomics analysis indicated that rats possess a greater basal and adaptive capacity for hepatic stress responses than mice and humans, with important implications for species selection and human translation in the safety testing of new drug candidates associated with reactive metabolite formation.

Assuntos

Acetaminofen , Doença Hepática Induzida por Substâncias e Drogas , Ratos , Camundongos , Humanos , Animais , Acetaminofen/toxicidade , Acetaminofen/metabolismo , Proteômica , Especificidade da Espécie , Doença Hepática Induzida por Substâncias e Drogas/metabolismo , Fígado/metabolismo , Estresse Oxidativo , Análise de Sistemas

7.

Assessing Multiple Evidence Streams to Decide on Confidence for Identification of Post-Translational Modifications, within and Across Data Sets.

Camacho, Oscar M; Ramsbottom, Kerry A; Collins, Andrew; Jones, Andrew R.

J Proteome Res ; 22(6): 1828-1842, 2023 06 02.

Artigo em Inglês | MEDLINE | ID: mdl-37099386

RESUMO

Phosphorylation is a post-translational modification of great interest to researchers due to its relevance in many biological processes. LC-MS/MS techniques have enabled high-throughput data acquisition, with studies claiming identification and localization of thousands of phosphosites. The identification and localization of phosphosites emerge from different analytical pipelines and scoring algorithms, with uncertainty embedded throughout the pipeline. For many pipelines and algorithms, arbitrary thresholding is used, but little is known about the actual global false localization rate in these studies. Recently, it has been suggested to use decoy amino acids to estimate global false localization rates of phosphosites, among the peptide-spectrum matches reported. Here, we describe a simple pipeline aiming to maximize the information extracted from these studies by objectively collapsing from peptide-spectrum match to the peptidoform-site level, as well as combining findings from multiple studies while maintaining track of false localization rates. We show that the approach is more effective than current processes that use a simpler mechanism for handling phosphosite identification redundancy within and across studies. In our case study using eight rice phosphoproteomics data sets, 6368 unique sites were confidently identified using our decoy approach compared to 4687 using traditional thresholding in which false localization rates are unknown.

Assuntos

Proteômica , Rios , Cromatografia Líquida , Proteômica/métodos , Espectrometria de Massas em Tandem , Processamento de Proteína Pós-Traducional , Peptídeos/química , Algoritmos , Bases de Dados de Proteínas

8.

TriTrypDB: An integrated functional genomics resource for kinetoplastida.

Shanmugasundram, Achchuthan; Starns, David; Böhme, Ulrike; Amos, Beatrice; Wilkinson, Paul A; Harb, Omar S; Warrenfeltz, Susanne; Kissinger, Jessica C; McDowell, Mary Ann; Roos, David S; Crouch, Kathryn; Jones, Andrew R.

PLoS Negl Trop Dis ; 17(1): e0011058, 2023 01.

Artigo em Inglês | MEDLINE | ID: mdl-36656904

RESUMO

Parasitic diseases caused by kinetoplastid parasites are a burden to public health throughout tropical and subtropical regions of the world. TriTrypDB (https://tritrypdb.org) is a free online resource for data mining of genomic and functional data from these kinetoplastid parasites and is part of the VEuPathDB Bioinformatics Resource Center (https://veupathdb.org). As of release 59, TriTrypDB hosts 83 kinetoplastid genomes, nine of which, including Trypanosoma brucei brucei TREU927, Trypanosoma cruzi CL Brener and Leishmania major Friedlin, undergo manual curation by integrating information from scientific publications, high-throughput assays and user submitted comments. TriTrypDB also integrates transcriptomic, proteomic, epigenomic, population-level and isolate data, functional information from genome-wide RNAi knock-down and fluorescent tagging, and results from automated bioinformatics analysis pipelines. TriTrypDB offers a user-friendly web interface embedded with a genome browser, search strategy system and bioinformatics tools to support custom in silico experiments that leverage integrated data. A Galaxy workspace enables users to analyze their private data (e.g., RNA-sequencing, variant calling, etc.) and explore their results privately in the context of publicly available information in the database. The recent addition of an annotation platform based on Apollo enables users to provide both functional and structural changes that will appear as 'community annotations' immediately and, pending curatorial review, will be integrated into the official genome annotation.

Assuntos

Kinetoplastida , Software , Interface Usuário-Computador , Proteômica , Genômica/métodos , Biologia Computacional/métodos , Bases de Dados Genéticas , Internet

9.

Proteomics Standards Initiative at Twenty Years: Current Activities and Future Work.

Deutsch, Eric W; Vizcaíno, Juan Antonio; Jones, Andrew R; Binz, Pierre-Alain; Lam, Henry; Klein, Joshua; Bittremieux, Wout; Perez-Riverol, Yasset; Tabb, David L; Walzer, Mathias; Ricard-Blum, Sylvie; Hermjakob, Henning; Neumann, Steffen; Mak, Tytus D; Kawano, Shin; Mendoza, Luis; Van Den Bossche, Tim; Gabriels, Ralf; Bandeira, Nuno; Carver, Jeremy; Pullman, Benjamin; Sun, Zhi; Hoffmann, Nils; Shofstahl, Jim; Zhu, Yunping; Licata, Luana; Quaglia, Federica; Tosatto, Silvio C E; Orchard, Sandra E.

J Proteome Res ; 22(2): 287-301, 2023 02 03.

Artigo em Inglês | MEDLINE | ID: mdl-36626722

RESUMO

The Human Proteome Organization (HUPO) Proteomics Standards Initiative (PSI) has been successfully developing guidelines, data formats, and controlled vocabularies (CVs) for the proteomics community and other fields supported by mass spectrometry since its inception 20 years ago. Here we describe the general operation of the PSI, including its leadership, working groups, yearly workshops, and the document process by which proposals are thoroughly and publicly reviewed in order to be ratified as PSI standards. We briefly describe the current state of the many existing PSI standards, some of which remain the same as when originally developed, some of which have undergone subsequent revisions, and some of which have become obsolete. Then the set of proposals currently being developed are described, with an open call to the community for participation in the forging of the next generation of standards. Finally, we describe some synergies and collaborations with other organizations and look to the future in how the PSI will continue to promote the open sharing of data and thus accelerate the progress of the field of proteomics.

Assuntos

Proteoma , Proteômica , Humanos , Padrões de Referência , Vocabulário Controlado , Espectrometria de Massas , Bases de Dados de Proteínas

10.

Integrated View of Baseline Protein Expression in Human Tissues.

Prakash, Ananth; García-Seisdedos, David; Wang, Shengbo; Kundu, Deepti Jaiswal; Collins, Andrew; George, Nancy; Moreno, Pablo; Papatheodorou, Irene; Jones, Andrew R; Vizcaíno, Juan Antonio.

J Proteome Res ; 22(3): 729-742, 2023 03 03.

Artigo em Inglês | MEDLINE | ID: mdl-36577097

RESUMO

The availability of proteomics datasets in the public domain, and in the PRIDE database, in particular, has increased dramatically in recent years. This unprecedented large-scale availability of data provides an opportunity for combined analyses of datasets to get organism-wide protein abundance data in a consistent manner. We have reanalyzed 24 public proteomics datasets from healthy human individuals to assess baseline protein abundance in 31 organs. We defined tissue as a distinct functional or structural region within an organ. Overall, the aggregated dataset contains 67 healthy tissues, corresponding to 3,119 mass spectrometry runs covering 498 samples from 489 individuals. We compared protein abundances between different organs and studied the distribution of proteins across these organs. We also compared the results with data generated in analogous studies. Additionally, we performed gene ontology and pathway-enrichment analyses to identify organ-specific enriched biological processes and pathways. As a key point, we have integrated the protein abundance results into the resource Expression Atlas, where they can be accessed and visualized either individually or together with gene expression data coming from transcriptomics datasets. We believe this is a good mechanism to make proteomics data more accessible for life scientists.

Assuntos

Proteoma , Proteômica , Humanos , Proteoma/análise , Proteômica/métodos , Perfilação da Expressão Gênica , Bases de Dados Factuais , Espectrometria de Massas/métodos , Bases de Dados de Proteínas

11.

Is DIA proteomics data FAIR? Current data sharing practices, available bioinformatics infrastructure and recommendations for the future.

Jones, Andrew R; Deutsch, Eric W; Vizcaíno, Juan Antonio.

Proteomics ; 23(7-8): e2200014, 2023 04.

Artigo em Inglês | MEDLINE | ID: mdl-36074795

RESUMO

Data independent acquisition (DIA) proteomics techniques have matured enormously in recent years, thanks to multiple technical developments in, for example, instrumentation and data analysis approaches. However, there are many improvements that are still possible for DIA data in the area of the FAIR (Findability, Accessibility, Interoperability and Reusability) data principles. These include more tailored data sharing practices and open data standards since public databases and data standards for proteomics were mostly designed with DDA data in mind. Here we first describe the current state of the art in the context of FAIR data for proteomics in general, and for DIA approaches in particular. For improving the current situation for DIA data, we make the following recommendations for the future: (i) development of an open data standard for spectral libraries; (ii) make mandatory the availability of the spectral libraries used in DIA experiments in ProteomeXchange resources; (iii) improve the support for DIA data in the data standards developed by the Proteomics Standards Initiative; and (iv) improve the support for DIA datasets in ProteomeXchange resources, including more tailored metadata requirements.

Assuntos

Proteoma , Proteômica , Proteômica/métodos , Espectrometria de Massas/métodos , Biologia Computacional/métodos

12.

The INSR/AKT/mTOR pathway regulates the pace of myogenesis in a syndecan-3-dependent manner.

Jones, Fiona K; Phillips, Alexander M; Jones, Andrew R; Pisconti, Addolorata.

Matrix Biol ; 113: 61-82, 2022 Nov.

Artigo em Inglês | MEDLINE | ID: mdl-36152781

RESUMO

Muscle stem cells (MuSCs) are indispensable for muscle regeneration. A multitude of extracellular stimuli direct MuSC fate decisions from quiescent progenitors to differentiated myocytes. The activity of these signals is modulated by coreceptors such as syndecan-3 (SDC3). We investigated the global landscape of SDC3-mediated regulation of myogenesis using a phosphoproteomics approach which revealed, with the precision level of individual phosphosites, the large-scale extent of SDC3-mediated regulation of signal transduction in MuSCs. We then focused on INSR/AKT/mTOR as a key pathway regulated by SDC3 during myogenesis and mechanistically dissected SDC3-mediated inhibition of insulin receptor signaling in MuSCs. SDC3 interacts with INSR ultimately limiting signal transduction via AKT/mTOR. Both knockdown of INSR and inhibition of AKT restore Sdc3-/- MuSC differentiation to wild type levels. Since SDC3 is rapidly downregulated at the onset of differentiation, our study suggests that SDC3 acts a timekeeper to restrain proliferating MuSC response and prevent premature differentiation.

Assuntos

Músculo Esquelético , Proteínas Proto-Oncogênicas c-akt , Proteínas Proto-Oncogênicas c-akt/genética , Proteínas Proto-Oncogênicas c-akt/metabolismo , Sindecana-3/genética , Sindecana-3/metabolismo , Células Cultivadas , Músculo Esquelético/metabolismo , Desenvolvimento Muscular/genética , Serina-Treonina Quinases TOR/genética , Serina-Treonina Quinases TOR/metabolismo , Diferenciação Celular

13.

Understanding SUMO-mediated adaptive responses in plants to improve crop productivity.

Clark, Lisa; Sue-Ob, Kawinnat; Mukkawar, Vaishnavi; Jones, Andrew R; Sadanandom, Ari.

Essays Biochem ; 66(2): 155-168, 2022 08 05.

Artigo em Inglês | MEDLINE | ID: mdl-35920279

RESUMO

The response to abiotic and biotic stresses in plants and crops is considered a multifaceted process. Due to their sessile nature, plants have evolved unique mechanisms to ensure that developmental plasticity remains during their life cycle. Among these mechanisms, post-translational modifications (PTMs) are crucial components of adaptive responses in plants and transduce environmental stimuli into cellular signalling through the modulation of proteins. SUMOylation is an emerging PTM that has received recent attention due to its dynamic role in protein modification and has quickly been considered a significant component of adaptive mechanisms in plants during stress with great potential for agricultural improvement programs. In the present review, we outline the concept that small ubiquitin-like modifier (SUMO)-mediated response in plants and crops to abiotic and biotic stresses is a multifaceted process with each component of the SUMO cycle facilitating tolerance to several different environmental stresses. We also highlight the clear increase in SUMO genes in crops when compared with Arabidopsis thaliana. The SUMO system is understudied in crops, given the importance of SUMO for stress responses, and for some SUMO genes, the apparent expansion provides new avenues to discover SUMO-conjugated targets that could regulate beneficial agronomical traits.

Assuntos

Arabidopsis , Ubiquitina , Arabidopsis/genética , Arabidopsis/metabolismo , Produtos Agrícolas/genética , Produtos Agrícolas/metabolismo , Estresse Fisiológico , Sumoilação , Ubiquitina/metabolismo

14.

Integrated view and comparative analysis of baseline protein expression in mouse and rat tissues.

Wang, Shengbo; García-Seisdedos, David; Prakash, Ananth; Kundu, Deepti Jaiswal; Collins, Andrew; George, Nancy; Fexova, Silvie; Moreno, Pablo; Papatheodorou, Irene; Jones, Andrew R; Vizcaíno, Juan Antonio.

PLoS Comput Biol ; 18(6): e1010174, 2022 06.

Artigo em Inglês | MEDLINE | ID: mdl-35714157

RESUMO

The increasingly large amount of proteomics data in the public domain enables, among other applications, the combined analyses of datasets to create comparative protein expression maps covering different organisms and different biological conditions. Here we have reanalysed public proteomics datasets from mouse and rat tissues (14 and 9 datasets, respectively), to assess baseline protein abundance. Overall, the aggregated dataset contained 23 individual datasets, including a total of 211 samples coming from 34 different tissues across 14 organs, comprising 9 mouse and 3 rat strains, respectively. In all cases, we studied the distribution of canonical proteins between the different organs. The number of canonical proteins per dataset ranged from 273 (tendon) and 9,715 (liver) in mouse, and from 101 (tendon) and 6,130 (kidney) in rat. Then, we studied how protein abundances compared across different datasets and organs for both species. As a key point we carried out a comparative analysis of protein expression between mouse, rat and human tissues. We observed a high level of correlation of protein expression among orthologs between all three species in brain, kidney, heart and liver samples, whereas the correlation of protein expression was generally slightly lower between organs within the same species. Protein expression results have been integrated into the resource Expression Atlas for widespread dissemination.

Assuntos

Proteínas , Proteômica , Animais , Encéfalo/metabolismo , Camundongos , Proteínas/metabolismo , Ratos

15.

Profiling the Human Phosphoproteome to Estimate the True Extent of Protein Phosphorylation.

Kalyuzhnyy, Anton; Eyers, Patrick A; Eyers, Claire E; Bowler-Barnett, Emily; Martin, Maria J; Sun, Zhi; Deutsch, Eric W; Jones, Andrew R.

J Proteome Res ; 21(6): 1510-1524, 2022 06 03.

Artigo em Inglês | MEDLINE | ID: mdl-35532924

RESUMO

Public phosphorylation databases such as PhosphoSitePlus (PSP) and PeptideAtlas (PA) compile results from published papers or openly available mass spectrometry (MS) data. However, there is no database-level control for false discovery of sites, likely leading to the overestimation of true phosphosites. By profiling the human phosphoproteome, we estimate the false discovery rate (FDR) of phosphosites and predict a more realistic count of true identifications. We rank sites into phosphorylation likelihood sets and analyze them in terms of conservation across 100 species, sequence properties, and functional annotations. We demonstrate significant differences between the sets and develop a method for independent phosphosite FDR estimation. Remarkably, we report estimated FDRs of 84, 98, and 82% within sets of phosphoserine (pSer), phosphothreonine (pThr), and phosphotyrosine (pTyr) sites, respectively, that are supported by only a single piece of identification evidenceâthe majority of sites in PSP. We estimate that around 62â¯000 Ser, 8000 Thr, and 12â¯000 Tyr phosphosites in the human proteome are likely to be true, which is lower than most published estimates. Furthermore, our analysis estimates that 86â¯000 Ser, 50â¯000 Thr, and 26â¯000 Tyr phosphosites are likely false-positive identifications, highlighting the significant potential of false-positive data to be present in phosphorylation databases.

Assuntos

Fosfopeptídeos , Proteoma , Humanos , Espectrometria de Massas/métodos , Fosfopeptídeos/metabolismo , Fosfoproteínas/análise , Fosforilação , Proteoma/análise

16.

Method for Independent Estimation of the False Localization Rate for Phosphoproteomics.

Ramsbottom, Kerry A; Prakash, Ananth; Riverol, Yasset Perez; Camacho, Oscar Martin; Martin, Maria-Jesus; Vizcaíno, Juan Antonio; Deutsch, Eric W; Jones, Andrew R.

J Proteome Res ; 21(7): 1603-1615, 2022 07 01.

Artigo em Inglês | MEDLINE | ID: mdl-35640880

RESUMO

Phosphoproteomic methods are commonly employed to identify and quantify phosphorylation sites on proteins. In recent years, various tools have been developed, incorporating scores or statistics related to whether a given phosphosite has been correctly identified or to estimate the global false localization rate (FLR) within a given data set for all sites reported. These scores have generally been calibrated using synthetic datasets, and their statistical reliability on real datasets is largely unknown, potentially leading to studies reporting incorrectly localized phosphosites, due to inadequate statistical control. In this work, we develop the concept of scoring modifications on a decoy amino acid, that is, one that cannot be modified, to allow for independent estimation of global FLR. We test a variety of amino acids, on both synthetic and real data sets, demonstrating that the selection can make a substantial difference to the estimated global FLR. We conclude that while several different amino acids might be appropriate, the most reliable FLR results were achieved using alanine and leucine as decoys. We propose the use of a decoy amino acid to control false reporting in the literature and in public databases that re-distribute the data. Data are available via ProteomeXchange with identifier PXD028840.

Assuntos

Aminoácidos , Espectrometria de Massas em Tandem , Bases de Dados de Proteínas , Reprodutibilidade dos Testes , Espectrometria de Massas em Tandem/métodos

17.

Evidence Against Carbonization of the Thin-Film Filters of the Extreme Ultraviolet Variability Experiment onboard the Solar Dynamics Observatory.

Tarrio, Charles; Berg, Robert F; Lucatorto, Thomas B; Eparvier, Francis G; Jones, Andrew R; Templeman, Brian; Woodraska, Donald L; Dominique, Marie.

Sol Phys ; 296(3)2021.

Artigo em Inglês | MEDLINE | ID: mdl-34803188

RESUMO

In spite of strict limits on outgassing from organic materials, some spacecraft instruments making long-term measurements of solar extreme ultraviolet (EUV) radiation still suffer significant degradation. While such measures have reduced the rate of degradation, they have not completely eliminated it in some cases. For example, in five years, the aluminum filters used in the Extreme Ultraviolet Variability Experiment (EVE) instruments onboard the Solar Dynamics Observatory (SDO) suffered losses exceeding 40% at 30.4 nm. Comparing those losses with the negligible losses of nearby zirconium filters on the same instruments indicated that the problem was not due to carbonization on the Sun-facing side of the filter. To investigate whether the loss was due to carbon deposition on the downstream face of the Al filter, we exposed the backsides of Al and Zr filters to EUV in the presence of a volatile organic solvent in the laboratory and concluded that this could not be the cause. Given that the residual gas composition in the SDO spacecraft likely has water vapor as well as organics, these findings suggest that the transmission loss in the Al filter originated with oxidation caused by UV-activated adsorbed water.

18.

Characterising proteolysis during SARS-CoV-2 infection identifies viral cleavage sites and cellular targets with therapeutic potential.

Meyer, Bjoern; Chiaravalli, Jeanne; Gellenoncourt, Stacy; Brownridge, Philip; Bryne, Dominic P; Daly, Leonard A; Grauslys, Arturas; Walter, Marius; Agou, Fabrice; Chakrabarti, Lisa A; Craik, Charles S; Eyers, Claire E; Eyers, Patrick A; Gambin, Yann; Jones, Andrew R; Sierecki, Emma; Verdin, Eric; Vignuzzi, Marco; Emmott, Edward.

Nat Commun ; 12(1): 5553, 2021 09 21.

Artigo em Inglês | MEDLINE | ID: mdl-34548480

RESUMO

SARS-CoV-2 is the causative agent behind the COVID-19 pandemic, responsible for over 170 million infections, and over 3.7 million deaths worldwide. Efforts to test, treat and vaccinate against this pathogen all benefit from an improved understanding of the basic biology of SARS-CoV-2. Both viral and cellular proteases play a crucial role in SARS-CoV-2 replication. Here, we study proteolytic cleavage of viral and cellular proteins in two cell line models of SARS-CoV-2 replication using mass spectrometry to identify protein neo-N-termini generated through protease activity. We identify previously unknown cleavage sites in multiple viral proteins, including major antigens S and N: the main targets for vaccine and antibody testing efforts. We discover significant increases in cellular cleavage events consistent with cleavage by SARS-CoV-2 main protease, and identify 14 potential high-confidence substrates of the main and papain-like proteases. We show that siRNA depletion of these cellular proteins inhibits SARS-CoV-2 replication, and that drugs targeting two of these proteins: the tyrosine kinase SRC and Ser/Thr kinase MYLK, show a dose-dependent reduction in SARS-CoV-2 titres. Overall, our study provides a powerful resource to understand proteolysis in the context of viral infection, and to inform the development of targeted strategies to inhibit SARS-CoV-2 and treat COVID-19.

Assuntos

Antivirais/farmacologia , COVID-19/metabolismo , Inibidores de Proteases/farmacologia , SARS-CoV-2/efeitos dos fármacos , Animais , Linhagem Celular , Dipeptídeos/farmacologia , Humanos , Mutação , Quinase de Cadeia Leve de Miosina/antagonistas & inibidores , Quinase de Cadeia Leve de Miosina/genética , Quinase de Cadeia Leve de Miosina/metabolismo , Proteólise , Proteômica , RNA Interferente Pequeno/farmacologia , SARS-CoV-2/genética , Proteases Virais/metabolismo , Proteínas Virais/genética , Proteínas Virais/metabolismo , Internalização do Vírus/efeitos dos fármacos , Replicação Viral/efeitos dos fármacos , Quinases da Família src/antagonistas & inibidores , Quinases da Família src/genética , Quinases da Família src/metabolismo , Tratamento Farmacológico da COVID-19

19.

MHCVision: estimation of global and local false discovery rate for MHC class I peptide binding prediction.

Pearngam, Phorutai; Sriswasdi, Sira; Pisitkun, Trairak; Jones, Andrew R.

Bioinformatics ; 37(21): 3830-3838, 2021 11 05.

Artigo em Inglês | MEDLINE | ID: mdl-34196671

RESUMO

MOTIVATION: MHC-peptide binding prediction has been widely used for understanding the immune response of individuals or populations, each carrying different MHC molecules as well as for the development of immunotherapeutics. The results from MHC-peptide binding prediction tools are mostly reported as a predicted binding affinity (IC50) and the percentile rank score, and global thresholds e.g. IC50 value < 500 nM or percentile rank < 2% are generally recommended for distinguishing binding peptides from non-binding peptides. However, it is difficult to evaluate statistically the probability of an individual peptide binding prediction to be true or false solely considering predicted scores. Therefore, statistics describing the overall global false discovery rate (FDR) and local FDR, also called posterior error probability (PEP) are required to give statistical context to the natively produced scores. RESULT: We have developed an algorithm and code implementation, called MHCVision, for estimation of FDR and PEP values for the predicted results of MHC-peptide binding prediction from the NetMHCpan tool. MHCVision performs parameter estimation using a modified expectation maximization framework for a two-component beta mixture model, representing the distribution of true and false scores of the predicted dataset. We can then estimate the PEP of an individual peptide's predicted score, and conversely the probability that it is true. We demonstrate that the use of global FDR and PEP estimation can provide a better trade-off between sensitivity and precision over using currently recommended thresholds from tools. AVAILABILITY AND IMPLEMENTATION: https://github.com/PGB-LIV/MHCVision. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.

Assuntos

Algoritmos , Peptídeos , Humanos , Ligação Proteica , Peptídeos/química , Probabilidade

20.

A Proteome-Wide Immunoinformatics Tool to Accelerate T-Cell Epitope Discovery and Vaccine Design in the Context of Emerging Infectious Diseases: An Ethnicity-Oriented Approach.

Oyarzun, Patricio; Kashyap, Manju; Fica, Victor; Salas-Burgos, Alexis; Gonzalez-Galarza, Faviel F; McCabe, Antony; Jones, Andrew R; Middleton, Derek; Kobe, Bostjan.

Front Immunol ; 12: 598778, 2021.

Artigo em Inglês | MEDLINE | ID: mdl-33717077

RESUMO

Emerging infectious diseases (EIDs) caused by viruses are increasing in frequency, causing a high disease burden and mortality world-wide. The COVID-19 pandemic caused by the novel SARS-like coronavirus (SARS-CoV-2) underscores the need to innovate and accelerate the development of effective vaccination strategies against EIDs. Human leukocyte antigen (HLA) molecules play a central role in the immune system by determining the peptide repertoire displayed to the T-cell compartment. Genetic polymorphisms of the HLA system thus confer a strong variability in vaccine-induced immune responses and may complicate the selection of vaccine candidates, because the distribution and frequencies of HLA alleles are highly variable among different ethnic groups. Herein, we build on the emerging paradigm of rational epitope-based vaccine design, by describing an immunoinformatics tool (Predivac-3.0) for proteome-wide T-cell epitope discovery that accounts for ethnic-level variations in immune responsiveness. Predivac-3.0 implements both CD8+ and CD4+ T-cell epitope predictions based on HLA allele frequencies retrieved from the Allele Frequency Net Database. The tool was thoroughly assessed, proving comparable performances (AUC ~0.9) against four state-of-the-art pan-specific immunoinformatics methods capable of population-level analysis (NetMHCPan-4.0, Pickpocket, PSSMHCPan and SMM), as well as a strong accuracy on proteome-wide T-cell epitope predictions for HIV-specific immune responses in the Japanese population. The utility of the method was investigated for the COVID-19 pandemic, by performing in silico T-cell epitope mapping of the SARS-CoV-2 spike glycoprotein according to the ethnic context of the countries where the ChAdOx1 vaccine is currently initiating phase III clinical trials. Potentially immunodominant CD8+ and CD4+ T-cell epitopes and population coverages were predicted for each population (the Epitope Discovery mode), along with optimized sets of broadly recognized (promiscuous) T-cell epitopes maximizing coverage in the target populations (the Epitope Optimization mode). Population-specific epitope-rich regions (T-cell epitope clusters) were further predicted in protein antigens based on combined criteria of epitope density and population coverage. Overall, we conclude that Predivac-3.0 holds potential to contribute in the understanding of ethnic-level variations of vaccine-induced immune responsiveness and to guide the development of epitope-based next-generation vaccines against emerging pathogens, whose geographic distributions and populations in need of vaccinations are often well-defined for regional epidemics.

Assuntos

Linfócitos T CD4-Positivos/imunologia , Linfócitos T CD8-Positivos/imunologia , COVID-19/imunologia , Epitopos de Linfócito T/metabolismo , Etnicidade , Antígenos HLA/metabolismo , Proteômica/métodos , SARS-CoV-2/fisiologia , Glicoproteína da Espícula de Coronavírus/metabolismo , COVID-19/epidemiologia , Vacinas contra COVID-19 , Doenças Transmissíveis Emergentes , Epitopos de Linfócito T/genética , Antígenos HLA/genética , Humanos , Imunogenicidade da Vacina , Aplicações da Informática Médica , Pandemias/prevenção & controle , Polimorfismo Genético , Ligação Proteica , Software , Glicoproteína da Espícula de Coronavírus/genética

RESUMO

RESUMO

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

RESUMO

Assuntos

RESUMO

Assuntos

RESUMO

Assuntos

ENVIAR RESULTADO:

SELEÇÃO DE REFERÊNCIAS

DETALHE DA PESQUISA