RESUMO
The inherent diversity of approaches in proteomics research has led to a wide range of software solutions for data analysis. These software solutions encompass multiple tools, each employing different algorithms for various tasks such as peptide-spectrum matching, protein inference, quantification, statistical analysis, and visualization. To enable an unbiased comparison of commonly used bottom-up label-free proteomics workflows, we introduce WOMBAT-P, a versatile platform designed for automated benchmarking and comparison. WOMBAT-P simplifies the processing of public data by utilizing the sample and data relationship format for proteomics (SDRF-Proteomics) as input. This feature streamlines the analysis of annotated local or public ProteomeXchange data sets, promoting efficient comparisons among diverse outputs. Through an evaluation using experimental ground truth data and a realistic biological data set, we uncover significant disparities and a limited overlap in the quantified proteins. WOMBAT-P not only enables rapid execution and seamless comparison of workflows but also provides valuable insights into the capabilities of different software solutions. These benchmarking metrics are a valuable resource for researchers in selecting the most suitable workflow for their specific data sets. The modular architecture of WOMBAT-P promotes extensibility and customization. The software is available at https://github.com/wombat-p/WOMBAT-Pipelines.
Assuntos
Benchmarking , Proteômica , Fluxo de Trabalho , Software , Proteínas , Análise de DadosRESUMO
In recent years machine learning has made extensive progress in modeling many aspects of mass spectrometry data. We brought together proteomics data generators, repository managers, and machine learning experts in a workshop with the goals to evaluate and explore machine learning applications for realistic modeling of data from multidimensional mass spectrometry-based proteomics analysis of any sample or organism. Following this sample-to-data roadmap helped identify knowledge gaps and define needs. Being able to generate bespoke and realistic synthetic data has legitimate and important uses in system suitability, method development, and algorithm benchmarking, while also posing critical ethical questions. The interdisciplinary nature of the workshop informed discussions of what is currently possible and future opportunities and challenges. In the following perspective we summarize these discussions in the hope of conveying our excitement about the potential of machine learning in proteomics and to inspire future research.
Assuntos
Aprendizado de Máquina , Proteômica , Proteômica/métodos , Algoritmos , Espectrometria de MassasRESUMO
This article describes some use case studies and self-assessments of FAIR status of de.NBI services to illustrate the challenges and requirements for the definition of the needs of adhering to the FAIR (findable, accessible, interoperable and reusable) data principles in a large distributed bioinformatics infrastructure. We address the challenge of heterogeneity of wet lab technologies, data, metadata, software, computational workflows and the levels of implementation and monitoring of FAIR principles within the different bioinformatics sub-disciplines joint in de.NBI. On the one hand, this broad service landscape and the excellent network of experts are a strong basis for the development of useful research data management plans. On the other hand, the large number of tools and techniques maintained by distributed teams renders FAIR compliance challenging.
Assuntos
Gerenciamento de Dados/métodos , Metadados , Redes Neurais de Computação , Proteômica/métodos , Software , Genoma Humano , Sequenciamento de Nucleotídeos em Larga Escala , Humanos , Cooperação Internacional , Fenótipo , Plantas/genética , Proteoma , Autoavaliação (Psicologia) , Fluxo de TrabalhoRESUMO
AIMS: Target skeletal muscle fibres - defined by different concentric areas in oxidative enzyme staining - can occur in patients with neurogenic muscular atrophy. Here, we used our established hypothesis-free proteomic approach with the aim of deciphering the protein composition of targets. We also searched for potential novel interactions between target proteins. METHODS: Targets and control areas were laser microdissected from skeletal muscle sections of 20 patients with neurogenic muscular atrophy. Samples were analysed by a highly sensitive mass spectrometry approach, enabling relative protein quantification. The results were validated by immunofluorescence studies. Protein interactions were investigated by yeast two-hybrid assays, coimmunoprecipitation experiments and bimolecular fluorescence complementation. RESULTS: More than 1000 proteins were identified. Among these, 55 proteins were significantly over-represented and 40 proteins were significantly under-represented in targets compared to intraindividual control samples. The majority of over-represented proteins were associated with the myofibrillar Z-disc and actin dynamics, followed by myosin and myosin-associated proteins, proteins involved in protein biosynthesis and chaperones. Under-represented proteins were mainly mitochondrial proteins. Functional studies revealed that the LIM domain of the over-represented protein LIMCH1 interacts with isoform A of Xin actin-binding repeat-containing protein 1 (XinA). CONCLUSIONS: In particular, proteins involved in myofibrillogenesis are over-represented in target structures, which indicate an ongoing process of sarcomere assembly and/or remodelling within this specific area of the muscle fibres. We speculate that target structures are the result of reinnervation processes in which filamin C-associated myofibrillogenesis is tightly regulated by the BAG3-associated protein quality system.
Assuntos
Doenças do Sistema Nervoso Periférico , Humanos , Doenças do Sistema Nervoso Periférico/metabolismo , Actinas/análise , Actinas/metabolismo , Proteômica , Proteínas Musculares/metabolismo , Fibras Musculares Esqueléticas/química , Fibras Musculares Esqueléticas/metabolismo , Músculo Esquelético/metabolismo , Atrofia Muscular/metabolismo , Proteínas Adaptadoras de Transdução de Sinal/metabolismo , Proteínas Reguladoras de Apoptose/análise , Proteínas Reguladoras de Apoptose/metabolismoRESUMO
Currently data-dependent acquisition (DDA) is the method of choice for mass spectrometry-based proteomics discovery experiments, but data-independent acquisition (DIA) is steadily becoming more important. One of the most important requirements to perform a DIA analysis is the availability of suitable spectral libraries for peptide identification and quantification. Several studies were performed addressing the evaluation of spectral library performance for protein identification in DIA measurements. But so far only few experiments estimate the effect of these libraries on the quantitative level.In this work we created a gold standard spike-in sample set with known contents and ratios of proteins in a complex protein matrix that allowed a detailed comparison of DIA quantification data obtained with different spectral library approaches. We used in-house generated sample-specific spectral libraries created using varying sample preparation approaches and repeated DDA measurement. In addition, two different search engines were tested for protein identification from DDA data and subsequent library generation. In total, eight different spectral libraries were generated, and the quantification results compared with a library free method, as well as a default DDA analysis. Not only the number of identifications on peptide and protein level in the spectral libraries and the corresponding DIA analysis results was inspected, but also the number of expected and identified differentially abundant protein groups and their ratios.We found, that while libraries of prefractionated samples were generally larger, there was no significant increase in DIA identifications compared with repetitive non-fractionated measurements. Furthermore, we show that the accuracy of the quantification is strongly dependent on the applied spectral library and whether the quantification is based on peptide or protein level. Overall, the reproducibility and accuracy of DIA quantification is superior to DDA in all applied approaches.Data has been deposited to the ProteomeXchange repository with identifiers PXD012986, PXD012987, PXD012988 and PXD014956.
Assuntos
Confiabilidade dos Dados , Biblioteca de Peptídeos , Proteoma/análise , Proteômica/métodos , Animais , Linhagem Celular , Cromatografia Líquida/métodos , Bases de Dados de Proteínas , Camundongos , Mioblastos/metabolismo , Peptídeos/análise , Proteínas/análise , Reprodutibilidade dos Testes , Sensibilidade e Especificidade , Análise de Sequência de Proteína , Software , Espectrometria de Massas em Tandem/métodosRESUMO
Chronic obstructive pulmonary disease (COPD) is a major risk factor for the development of lung adenocarcinoma (AC). AC often develops on underlying COPD; thus, the differentiation of both entities by biomarker is challenging. Although survival of AC patients strongly depends on early diagnosis, a biomarker panel for AC detection and differentiation from COPD is still missing. Plasma samples from 176 patients with AC with or without underlying COPD, COPD patients, and hospital controls were analyzed using mass-spectrometry-based proteomics. We performed univariate statistics and additionally evaluated machine learning algorithms regarding the differentiation of AC vs. COPD and AC with COPD vs. COPD. Univariate statistics revealed significantly regulated proteins that were significantly regulated between the patient groups. Furthermore, random forest classification yielded the best performance for differentiation of AC vs. COPD (area under the curve (AUC) 0.935) and AC with COPD vs. COPD (AUC 0.916). The most influential proteins were identified by permutation feature importance and compared to those identified by univariate testing. We demonstrate the great potential of machine learning for differentiation of highly similar disease entities and present a panel of biomarker candidates that should be considered for the development of a future biomarker panel.
Assuntos
Adenocarcinoma de Pulmão , Neoplasias Pulmonares , Doença Pulmonar Obstrutiva Crônica , Biomarcadores , Humanos , Neoplasias Pulmonares/diagnóstico , Neoplasias Pulmonares/patologia , Proteômica , Doença Pulmonar Obstrutiva Crônica/patologiaRESUMO
Protein sequence databases play a crucial role in the majority of the currently applied mass-spectrometry-based proteomics workflows. Here UniProtKB serves as one of the major sources, as it combines the information of several smaller databases and enriches the entries with additional biological information. For the identification of peptides in a sample by tandem mass spectra, as generated by data-dependent acquisition, protein sequence databases provide the basis for most spectrum identification search engines. In addition, for targeted proteomics approaches like selected reaction monitoring (SRM) and parallel reaction monitoring (PRM), knowledge of the peptide sequences, their masses, and whether they are unique for a protein is essential. Because most bottom-up proteomics approaches use trypsin to cleave the proteins in a sample, the tryptic peptides contained in a protein database are of great interest. We present a database, called MaCPepDB (mass-centric peptide database), that consists of the complete tryptic digest of the Swiss-Prot and TrEMBL parts of UniProtKB. This database is especially designed to not only allow queries of peptide sequences and return the respective information about connected proteins and thus whether a peptide is unique but also allow queries of specific masses of peptides or precursors of MS/MS spectra. Furthermore, posttranslational modifications can be considered in a query as well as different mass deviations for posttranslational modifications. Hence the database can be used by a sequence query not only to, for example, check in which proteins of the UniProt database a tryptic peptide can be found but also to find possibly interfering peptides in PRM/SRM experiments using the mass query. The complete database contains currently 5â¯939â¯244â¯990 peptides from 185â¯561â¯610 proteins (UniProt version 2020_03), for which a single query usually takes less than 1 s. For easy exploration of the data, a web interface was developed. A REST application programming interface (API) for programmatic and workflow access is also available at https://macpepdb.mpc.rub.de.
Assuntos
Peptídeos , Espectrometria de Massas em Tandem , Bases de Dados de Proteínas , Proteínas , ProteômicaRESUMO
The European Bioinformatics Community for Mass Spectrometry (EuBIC-MS; eubic-ms.org) was founded in 2014 to unite European computational mass spectrometry researchers and proteomics bioinformaticians working in academia and industry. EuBIC-MS maintains educational resources (proteomics-academy.org) and organises workshops at national and international conferences on proteomics and mass spectrometry. Furthermore, EuBIC-MS is actively involved in several community initiatives such as the Human Proteome Organization's Proteomics Standards Initiative (HUPO-PSI). Apart from these collaborations, EuBIC-MS has organised two Winter Schools and two Developers' Meetings that have contributed to the strengthening of the European mass spectrometry network and fostered international collaboration in this field, even beyond Europe. Moreover, EuBIC-MS is currently actively developing a community-driven standard dedicated to mass spectrometry data annotation (SDRF-Proteomics) that will facilitate data reuse and collaboration. This manuscript highlights what EuBIC-MS is, what it does, and what it already has achieved. A warm invitation is extended to new researchers at all career stages to join the EuBIC-MS community on its Slack channel (eubic.slack.com).
RESUMO
The PRoteomics IDEntifications (PRIDE) database (https://www.ebi.ac.uk/pride/) is the world's largest data repository of mass spectrometry-based proteomics data, and is one of the founding members of the global ProteomeXchange (PX) consortium. In this manuscript, we summarize the developments in PRIDE resources and related tools since the previous update manuscript was published in Nucleic Acids Research in 2016. In the last 3 years, public data sharing through PRIDE (as part of PX) has definitely become the norm in the field. In parallel, data re-use of public proteomics data has increased enormously, with multiple applications. We first describe the new architecture of PRIDE Archive, the archival component of PRIDE. PRIDE Archive and the related data submission framework have been further developed to support the increase in submitted data volumes and additional data types. A new scalable and fault tolerant storage backend, Application Programming Interface and web interface have been implemented, as a part of an ongoing process. Additionally, we emphasize the improved support for quantitative proteomics data through the mzTab format. At last, we outline key statistics on the current data contents and volume of downloads, and how PRIDE data are starting to be disseminated to added-value resources including Ensembl, UniProt and Expression Atlas.
Assuntos
Bases de Dados de Proteínas , Espectrometria de Massas , Proteômica , Peptídeos/química , SoftwareRESUMO
Proteomics using LC-MS/MS has become one of the main methods to analyze the proteins in biological samples in high-throughput. But the existing mass-spectrometry instruments are still limited with respect to resolution and measurable mass ranges, which is one of the main reasons why shotgun proteomics is the major approach. Here proteins are digested, which leads to the identification and quantification of peptides instead. While often neglected, the important step of protein inference needs to be conducted to infer from the identified peptides to the actual proteins in the original sample. In this work, we highlight some of the previously published and newly added features of the tool PIA - Protein Inference Algorithms, which helps the user with the protein inference of measured samples. We also highlight the importance of the usage of PSI standard file formats, as PIA is the only current software supporting all available standards used for spectrum identification and protein inference. Additionally, we briefly describe the benefits of working with workflow environments for proteomics analyses and show the new features of the PIA nodes for the KNIME Analytics Platform. Finally, we benchmark PIA against a recently published data set for isoform detection. PIA is open source and available for download on GitHub ( https://github.com/mpc-bioinformatics/pia ) or directly via the community extensions inside the KNIME analytics platform.
Assuntos
Biologia Computacional/métodos , Peptídeos/análise , Proteômica/métodos , Software , Fluxo de Trabalho , Algoritmos , Benchmarking , Cromatografia Líquida , Isoformas de Proteínas , Espectrometria de Massas em TandemRESUMO
The first stable version of the Proteomics Standards Initiative mzIdentML open data standard (version 1.1) was published in 2012-capturing the outputs of peptide and protein identification software. In the intervening years, the standard has become well-supported in both commercial and open software, as well as a submission and download format for public repositories. Here we report a new release of mzIdentML (version 1.2) that is required to keep pace with emerging practice in proteome informatics. New features have been added to support: (1) scores associated with localization of modifications on peptides; (2) statistics performed at the level of peptides; (3) identification of cross-linked peptides; and (4) support for proteogenomics approaches. In addition, there is now improved support for the encoding of de novo sequencing of peptides, spectral library searches, and protein inference. As a key point, the underlying XML schema has only undergone very minor modifications to simplify as much as possible the transition from version 1.1 to version 1.2 for implementers, but there have been several notable updates to the format specification, implementation guidelines, controlled vocabularies and validation software. mzIdentML 1.2 can be described as backwards compatible, in that reading software designed for mzIdentML 1.1 should function in most cases without adaptation. We anticipate that these developments will provide a continued stable base for software teams working to implement the standard. All the related documentation is accessible at http://www.psidev.info/mzidentml.
Assuntos
Biologia Computacional/normas , Proteômica/normas , Bases de Dados de Proteínas , SoftwareRESUMO
Cerebrospinal fluid (CSF) is in direct contact with the brain and serves as a valuable specimen to examine diseases of the central nervous system through analyzing its components. These include the analysis of metabolites, cells as well as proteins. For identifying new suitable diagnostic protein biomarkers bottom-up data-dependent acquisition (DDA) mass spectrometry-based approaches are most popular. Drawbacks of this method are stochastic and irreproducible precursor ion selection. Recently, data-independent acquisition (DIA) emerged as an alternative method. It overcomes several limitations of DDA, since it combines the benefits of DDA and targeted methods like selected reaction monitoring (SRM). We established a DIA method for in-depth proteome analysis of CSF. For this, four spectral libraries were generated with samples from native CSF ( n = 5), CSF fractionation (15 in total) and substantia nigra fractionation (54 in total) and applied to three CSF DIA replicates. The DDA and DIA methods for CSF were conducted with the same nanoLC parameters using a 180 min gradient. Compared to a conventional DDA method, our DIA approach increased the number of identified protein groups from 648 identifications in DDA to 1574 in DIA using a comprehensive spectral library generated with DDA measurements from five native CSF and 54 substantia nigra fractions. We also could show that a sample specific spectral library generated from native CSF only increased the identification reproducibility from three DIA replicates to 90% (77% with a DDA method). Moreover, by utilizing a substantia nigra specific spectral library for CSF DIA, over 60 brain-originated proteins could be identified compared to only 11 with DDA. In conclusion, the here presented optimized DIA method substantially outperforms DDA and could develop into a powerful tool for biomarker discovery in CSF. Data are available via ProteomeXchange with the identifiers PXD010698, PXD010708, PXD010690, PXD010705, and PXD009624.
Assuntos
Hidrocefalia/líquido cefalorraquidiano , Espectrometria de Massas/métodos , Proteoma/metabolismo , Proteômica/métodos , Biomarcadores/líquido cefalorraquidiano , Biomarcadores/metabolismo , Humanos , Reprodutibilidade dos Testes , Substância Negra/metabolismoRESUMO
MOTIVATION: BioContainers (biocontainers.pro) is an open-source and community-driven framework which provides platform independent executable environments for bioinformatics software. BioContainers allows labs of all sizes to easily install bioinformatics software, maintain multiple versions of the same software and combine tools into powerful analysis pipelines. BioContainers is based on popular open-source projects Docker and rkt frameworks, that allow software to be installed and executed under an isolated and controlled environment. Also, it provides infrastructure and basic guidelines to create, manage and distribute bioinformatics containers with a special focus on omics technologies. These containers can be integrated into more comprehensive bioinformatics pipelines and different architectures (local desktop, cloud environments or HPC clusters). AVAILABILITY AND IMPLEMENTATION: The software is freely available at github.com/BioContainers/. CONTACT: yperez@ebi.ac.uk.
Assuntos
Biologia Computacional/métodos , Software , Genômica/métodos , Metabolômica/métodos , Proteômica/métodosRESUMO
OBJECTIVE: Sporadic inclusion body myositis (sIBM) pathogenesis is unknown; however, rimmed vacuoles (RVs) are a constant feature. We propose to identify proteins that accumulate within RVs. METHODS: RVs and intact myofibers were laser microdissected from skeletal muscle of 18 sIBM patients and analyzed by a sensitive mass spectrometry approach using label-free spectral count-based relative protein quantification. Whole exome sequencing was performed on 62 sIBM patients. Immunofluorescence was performed on patient and mouse skeletal muscle. RESULTS: A total of 213 proteins were enriched by >1.5 -fold in RVs compared to controls and included proteins previously reported to accumulate in sIBM tissue or when mutated cause myopathies with RVs. Proteins associated with protein folding and autophagy were the largest group represented. One autophagic adaptor protein not previously identified in sIBM was FYCO1. Rare missense coding FYCO1 variants were present in 11.3% of sIBM patients compared with 2.6% of controls (p = 0.003). FYCO1 colocalized at RVs with autophagic proteins such as MAP1LC3 and SQSTM1 in sIBM and other RV myopathies. One FYCO1 variant protein had reduced colocalization with MAP1LC3 when expressed in mouse muscle. INTERPRETATION: This study used an unbiased proteomic approach to identify RV proteins in sIBM that included a novel protein involved in sIBM pathogenesis. FYCO1 accumulates at RVs, and rare missense variants in FYCO1 are overrepresented in sIBM patients. These FYCO1 variants may impair autophagic function, leading to RV formation in sIBM patient muscle. FYCO1 functionally connects autophagic and endocytic pathways, supporting the hypothesis that impaired endolysosomal degradation underlies the pathogenesis of sIBM. Ann Neurol 2017;81:227-239.
Assuntos
Proteínas de Ligação a DNA/metabolismo , Músculo Esquelético/metabolismo , Miosite de Corpos de Inclusão/metabolismo , Proteômica/métodos , Fatores de Transcrição/metabolismo , Vacúolos/metabolismo , Idoso , Idoso de 80 Anos ou mais , Alelos , Animais , Proteínas de Ligação a DNA/genética , Feminino , Humanos , Masculino , Camundongos , Proteínas Associadas aos Microtúbulos , Pessoa de Meia-Idade , Miosite de Corpos de Inclusão/genética , Risco , Fatores de Transcrição/genéticaRESUMO
The original PRIDE Inspector tool was developed as an open source standalone tool to enable the visualization and validation of mass-spectrometry (MS)-based proteomics data before data submission or already publicly available in the Proteomics Identifications (PRIDE) database. The initial implementation of the tool focused on visualizing PRIDE data by supporting the PRIDE XML format and a direct access to private (password protected) and public experiments in PRIDE.The ProteomeXchange (PX) Consortium has been set up to enable a better integration of existing public proteomics repositories, maximizing its benefit to the scientific community through the implementation of standard submission and dissemination pipelines. Within the Consortium, PRIDE is focused on supporting submissions of tandem MS data. The increasing use and popularity of the new Proteomics Standards Initiative (PSI) data standards such as mzIdentML and mzTab, and the diversity of workflows supported by the PX resources, prompted us to design and implement a new suite of algorithms and libraries that would build upon the success of the original PRIDE Inspector and would enable users to visualize and validate PX "complete" submissions. The PRIDE Inspector Toolsuite supports the handling and visualization of different experimental output files, ranging from spectra (mzML, mzXML, and the most popular peak lists formats) and peptide and protein identification results (mzIdentML, PRIDE XML, mzTab) to quantification data (mzTab, PRIDE XML), using a modular and extensible set of open-source, cross-platform libraries. We believe that the PRIDE Inspector Toolsuite represents a milestone in the visualization and quality assessment of proteomics data. It is freely available at http://github.com/PRIDE-Toolsuite/.
Assuntos
Biologia Computacional/métodos , Bases de Dados de Proteínas , Proteoma/metabolismo , Proteômica/métodos , Software , Internet , Reprodutibilidade dos Testes , Espectrometria de Massas em TandemRESUMO
UNLABELLED: The ms-data-core-api is a free, open-source library for developing computational proteomics tools and pipelines. The Application Programming Interface, written in Java, enables rapid tool creation by providing a robust, pluggable programming interface and common data model. The data model is based on controlled vocabularies/ontologies and captures the whole range of data types included in common proteomics experimental workflows, going from spectra to peptide/protein identifications to quantitative results. The library contains readers for three of the most used Proteomics Standards Initiative standard file formats: mzML, mzIdentML, and mzTab. In addition to mzML, it also supports other common mass spectra data formats: dta, ms2, mgf, pkl, apl (text-based), mzXML and mzData (XML-based). Also, it can be used to read PRIDE XML, the original format used by the PRIDE database, one of the world-leading proteomics resources. Finally, we present a set of algorithms and tools whose implementation illustrates the simplicity of developing applications using the library. AVAILABILITY AND IMPLEMENTATION: The software is freely available at https://github.com/PRIDE-Utilities/ms-data-core-api. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online CONTACT: juan@ebi.ac.uk.
Assuntos
Algoritmos , Biologia Computacional/métodos , Bases de Dados de Proteínas , Espectrometria de Massas/métodos , Proteínas/análise , Proteômica/métodos , Software , Humanos , Fragmentos de Peptídeos/análise , Fluxo de TrabalhoRESUMO
FE65 is a cytosolic adapter protein and an important binding partner of amyloid precursor protein. Dependent on Thr668 phosphorylation in amyloid precursor protein, which influences amyloidogenic amyloid precursor protein processing, FE65 undergoes nuclear translocation, thereby transmitting a signal from the cell membrane to the nucleus. As this translocation may be relevant in Alzheimer disease, and as FE65 consists of three protein-protein interaction domains able to bind and affect a variety of other proteins and downstream signaling pathways, the identification of the FE65 interactome is of central interest in Alzheimer disease research. In this study, we identified 121 proteins as new potential FE65 interacting proteins in a pulldown/mass spectrometry approach using human post-mortem brain samples as protein pools for recombinantly expressed FE65. Co-immunoprecipitation assays further validated the interaction of FE65 with the candidates SV2A and SERCA2. In parallel, we investigated the whole cell proteome of primary hippocampal neurons from FE65/FE65L1 double knockout mice. Notably, the validated FE65 binding proteins were also found to be differentially abundant in neurons derived from the FE65 knockout mice relative to wild-type control neurons. SERCA2 is an important player in cellular calcium homeostasis, which was found to be up-regulated in double knockout neurons. Indeed, knock-down of FE65 in HEK293T cells also evoked an elevated sensitivity to thapsigargin, a stressor specifically targeting the activity of SERCA2. Thus, our results suggest that FE65 is involved in the regulation of intracellular calcium homeostasis. Whereas transfection of FE65 alone caused a typical dot-like phenotype in the nucleus, co-transfection of SV2A significantly reduced the percentage of FE65 dot-positive cells, pointing to a possible role for SV2A in the modulation of FE65 intracellular targeting. Given that SV2A has a signaling function at the presynapse, its effect on FE65 intracellular localization suggests that the SV2A/FE65 interaction might play a role in synaptic signal transduction.
Assuntos
Encéfalo/metabolismo , Glicoproteínas de Membrana/metabolismo , Proteínas do Tecido Nervoso/metabolismo , Proteínas Nucleares/metabolismo , Mapas de Interação de Proteínas , Animais , Encéfalo/patologia , Células Cultivadas , Embrião de Mamíferos , Células HEK293 , Humanos , Imunoprecipitação , Glicoproteínas de Membrana/genética , Glicoproteínas de Membrana/isolamento & purificação , Camundongos , Camundongos Knockout , Proteínas do Tecido Nervoso/genética , Proteínas do Tecido Nervoso/isolamento & purificação , Neurônios/metabolismo , Neurônios/patologia , Proteínas Nucleares/genética , Ligação Proteica , Mapas de Interação de Proteínas/genética , Sinapses/genética , Sinapses/metabolismoRESUMO
Protein inference connects the peptide spectrum matches (PSMs) obtained from database search engines back to proteins, which are typically at the heart of most proteomics studies. Different search engines yield different PSMs and thus different protein lists. Analysis of results from one or multiple search engines is often hampered by different data exchange formats and lack of convenient and intuitive user interfaces. We present PIA, a flexible software suite for combining PSMs from different search engine runs and turning these into consistent results. PIA can be integrated into proteomics data analysis workflows in several ways. A user-friendly graphical user interface can be run either locally or (e.g., for larger core facilities) from a central server. For automated data processing, stand-alone tools are available. PIA implements several established protein inference algorithms and can combine results from different search engines seamlessly. On several benchmark data sets, we show that PIA can identify a larger number of proteins at the same protein FDR when compared to that using inference based on a single search engine. PIA supports the majority of established search engines and data in the mzIdentML standard format. It is implemented in Java and freely available at https://github.com/mpc-bioinformatics/pia.
Assuntos
Bases de Dados de Proteínas , Internet , Proteínas/química , Interface Usuário-Computador , AlgoritmosRESUMO
The intracellular domain of the amyloid precursor protein (AICD) is generated following cleavage of the precursor by the γ-secretase complex and is involved in membrane to nucleus signaling, for which the binding of AICD to the adapter protein FE65 is essential. Here we show that FE65 knockdown causes a downregulation of the protein Bloom syndrome protein (BLM) and the minichromosome maintenance (MCM) protein family and that elevated nuclear levels of FE65 result in stabilization of the BLM protein in nuclear mobile spheres. These spheres are able to grow and fuse, and potentially correspond to the nuclear domain 10. BLM plays a role in DNA replication and repair mechanisms and FE65 was also shown to play a role in DNA damage response in the cell. A set of proliferation assays in our work revealed that FE65 knockdown in HEK293T cells reduced cell replication. On the basis of these results, we hypothesize that nuclear FE65 levels (nuclear FE65/BLM containing spheres) may regulate cell cycle re-entry in neurons as a result of increased interaction of FE65 with BLM and/or an increase in MCM protein levels. Thus, FE65 interactions with BLM and MCM proteins may contribute to the neuronal cell cycle re-entry observed in brains affected by Alzheimer's disease.