Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 76
Filtrar
1.
PLoS One ; 19(3): e0300914, 2024.
Artigo em Inglês | MEDLINE | ID: mdl-38527011

RESUMO

BACKGROUND: Multiple sclerosis is an inflammatory and degenerative disease of the central nervous system leading to demyelination and axonal loss. Relapsing-remitting multiple sclerosis (RRMS) is commonly treated by anti-inflammatory drugs, where one of the most effective drugs to date is the monoclonal antibody natalizumab. METHODS: The cerebrospinal fluid (CSF) proteome was analyzed in 56 patients with RRMS before and after natalizumab treatment, using label-free mass spectrometry and a subset of the changed proteins were verified by parallel reaction monitoring in a new cohort of 20 patients, confirming the majority of observed changes. RESULTS: A total of 287 differentially abundant proteins were detected including (i) the decrease of proteins with roles in immunity, such as immunoglobulin heavy constant mu, chitinase-3-like protein 1 and chitotriosidase, (ii) an increase of proteins involved in metabolism, such as lactate dehydrogenase A and B and malate-dehydrogenase cytoplasmic, and (iii) an increase of proteins associated with the central nervous system, including lactadherin and amyloid precursor protein. Comparison with the CSF-PR database provided evidence that natalizumab counters protein changes commonly observed in RRMS. Furthermore, vitamin-D binding protein and apolipoprotein 1 and 2 were unchanged during treatment with natalizumab, implying that these may be involved in disease activity unaffected by natalizumab. CONCLUSIONS: Our study revealed that some of the previously suggested biomarkers for MS were affected by the natalizumab treatment while others were not. Proteins not previously suggested as biomarkers were also found affected by the treatment. In sum, the results provide new information on how the natalizumab treatment impacts the CSF proteome of MS patients, and points towards processes affected by the treatment. These findings ought to be explored further to disclose potential novel disease mechanisms and predict treatment responses.


Assuntos
Esclerose Múltipla Recidivante-Remitente , Esclerose Múltipla , Humanos , Esclerose Múltipla/tratamento farmacológico , Esclerose Múltipla/líquido cefalorraquidiano , Natalizumab/uso terapêutico , Proteoma , Esclerose Múltipla Recidivante-Remitente/tratamento farmacológico , Esclerose Múltipla Recidivante-Remitente/líquido cefalorraquidiano , Anti-Inflamatórios/uso terapêutico , Biomarcadores/metabolismo , Fatores Imunológicos/uso terapêutico
2.
Bioinformatics ; 39(10)2023 Oct 03.
Artigo em Inglês | MEDLINE | ID: mdl-37756698

RESUMO

MOTIVATION: Biological network analysis for high-throughput biomedical data interpretation relies heavily on topological characteristics. Networks are commonly composed of nodes representing genes or proteins that are connected by edges when interacting. In this study, we use the rich information available in the Reactome pathway database to build biological networks accounting for small molecules and proteoforms modeled using protein isoforms and post-translational modifications to study the topological changes induced by this refinement of the network representation. RESULTS: We find that improving the interactome modeling increases the number of nodes and interactions, but that isoform and post-translational modification annotation is still limited compared to what can be expected biologically. We also note that small molecule information can distort the topology of the network due to the high connectedness of these molecules, which does not necessarily represent the reality of biology. However, by restricting the connections of small molecules to the context of biochemical reactions, we find that these improve the overall connectedness of the network and reduce the prevalence of isolated components and nodes. Overall, changing the representation of the network alters the prevalence of articulation points and bridges globally but also within and across pathways. Hence, some molecules can gain or lose in biological importance depending on the level of detail of the representation of the biological system, which might in turn impact network-based studies of diseases or druggability. AVAILABILITY AND IMPLEMENTATION: Networks are constructed based on data publicly available in the Reactome Pathway knowledgebase: reactome.org.

3.
Biostatistics ; 24(4): 1031-1044, 2023 10 18.
Artigo em Inglês | MEDLINE | ID: mdl-35536588

RESUMO

Experimental design usually focuses on the setting where treatments and/or other aspects of interest can be manipulated. However, in observational biomedical studies with sequential processing, the set of available samples is often fixed, and the problem is thus rather the ordering and allocation of samples to batches such that comparisons between different treatments can be made with similar precision. In certain situations, this allocation can be done by hand, but this rapidly becomes impractical with more challenging cohort setups. Here, we present a fast and intuitive algorithm to generate balanced allocations of samples to batches for any single-variable model where the treatment variable is nominal. This greatly simplifies the grouping of samples into batches, makes the process reproducible, and provides a marked improvement over completely random allocations. The general challenges of allocation and why good solutions can be hard to find are also discussed, as well as potential extensions to multivariable settings.


Assuntos
Algoritmos , Estudos Observacionais como Assunto , Humanos , Projetos de Pesquisa
4.
J Proteome Res ; 20(12): 5419-5423, 2021 12 03.
Artigo em Inglês | MEDLINE | ID: mdl-34709836

RESUMO

Mass spectrometry-based proteomics is a high-throughput technology generating ever-larger amounts of data per project. However, storing, processing, and interpreting these data can be a challenge. A key element in simplifying this process is the development of interactive frameworks focusing on visualization that can greatly simplify both the interpretation of data and the generation of new knowledge. Here we present PeptideShaker Online, a user-friendly web-based framework for the identification of mass spectrometry-based proteomics data, from raw file conversion to interactive visualization of the resulting data. Storage and processing of the data are performed via the versatile Galaxy platform (through SearchGUI, PeptideShaker, and moFF), while the interaction with the results happens via a locally installed web server, thus enabling researchers to process and interpret their own data without requiring advanced bioinformatics skills or direct access to compute-intensive infrastructures. The source code, additional documentation, and a fully functional demo is available at https://github.com/barsnes-group/peptide-shaker-online.


Assuntos
Proteômica , Software , Biologia Computacional/métodos , Internet , Espectrometria de Massas , Proteômica/métodos
5.
Sci Rep ; 11(1): 7174, 2021 03 30.
Artigo em Inglês | MEDLINE | ID: mdl-33785790

RESUMO

Two pathophysiological different experimental models for multiple sclerosis were analyzed in parallel using quantitative proteomics in attempts to discover protein alterations applicable as diagnostic-, prognostic-, or treatment targets in human disease. The cuprizone model reflects de- and remyelination in multiple sclerosis, and the experimental autoimmune encephalomyelitis (EAE, MOG1-125) immune-mediated events. The frontal cortex, peripheral to severely inflicted areas in the CNS, was dissected and analyzed. The frontal cortex had previously not been characterized by proteomics at different disease stages, and novel protein alterations involved in protecting healthy tissue and assisting repair of inflicted areas might be discovered. Using TMT-labelling and mass spectrometry, 1871 of the proteins quantified overlapped between the two experimental models, and the fold change compared to controls was verified using label-free proteomics. Few similarities in frontal cortex between the two disease models were observed when regulated proteins and signaling pathways were compared. Legumain and C1Q complement proteins were among the most upregulated proteins in cuprizone and hemopexin in the EAE model. Immunohistochemistry showed that legumain expression in post-mortem multiple sclerosis brain tissue (n = 19) was significantly higher in the center and at the edge of white matter active and chronic active lesions. Legumain was associated with increased lesion activity and might be valuable as a drug target using specific inhibitors as already suggested for Parkinson's and Alzheimer's disease. Cerebrospinal fluid levels of legumain, C1q and hemopexin were not significantly different between multiple sclerosis patients, other neurological diseases, or healthy controls.


Assuntos
Encefalomielite Autoimune Experimental/diagnóstico , Lobo Frontal/patologia , Esclerose Múltipla/diagnóstico , Adulto , Idoso , Idoso de 80 Anos ou mais , Animais , Biomarcadores/análise , Biomarcadores/metabolismo , Complemento C1q/análise , Complemento C1q/metabolismo , Cuprizona/administração & dosagem , Cuprizona/toxicidade , Cisteína Endopeptidases/análise , Cisteína Endopeptidases/metabolismo , Encefalomielite Autoimune Experimental/induzido quimicamente , Encefalomielite Autoimune Experimental/imunologia , Encefalomielite Autoimune Experimental/patologia , Feminino , Lobo Frontal/efeitos dos fármacos , Lobo Frontal/imunologia , Regulação da Expressão Gênica/imunologia , Hemopexina/análise , Hemopexina/metabolismo , Humanos , Imuno-Histoquímica , Masculino , Camundongos , Pessoa de Meia-Idade , Esclerose Múltipla/induzido quimicamente , Esclerose Múltipla/imunologia , Esclerose Múltipla/patologia , Proteômica , Adulto Jovem
6.
J Proteome Res ; 20(1): 122-128, 2021 01 01.
Artigo em Inglês | MEDLINE | ID: mdl-32969222

RESUMO

Randomization is used in experimental design to reduce the prevalence of unanticipated confounders. Complete randomization can however create imbalanced designs, for example, grouping all samples of the same condition in the same batch. Block randomization is an approach that can prevent severe imbalances in sample allocation with respect to both known and unknown confounders. This feature provides the reader with an introduction to blocking and randomization, and insights into how to effectively organize samples during experimental design, with special considerations with respect to proteomics.


Assuntos
Proteômica , Projetos de Pesquisa , Distribuição Aleatória
7.
Stem Cell Reports ; 15(5): 1067-1079, 2020 11 10.
Artigo em Inglês | MEDLINE | ID: mdl-33125875

RESUMO

The role of leptin receptor (OB-R) signaling in linking pluripotency with growth and development and the consequences of dysfunctional leptin signaling on progression of metabolic disease is poorly understood. Using a global unbiased proteomics approach we report that embryonic fibroblasts (MEFs) carrying the db/db mutation exhibit metabolic abnormalities, while their reprogrammed induced pluripotent stem cells (iPSCs) show altered expression of proteins involved in embryonic development. An upregulation in expression of eukaryotic translation initiation factor 4e (Eif4e) and Stat3 binding to the Eif4e promoter was supported by enhanced protein synthesis in mutant iPSCs. Directed differentiation of db/db iPSCs toward the neuronal lineage showed defects. Gene editing to correct the point mutation in db/db iPSCs using CRISPR-Cas9, restored expression of neuronal markers and protein synthesis while reversing the metabolic defects. These data imply a direct role for OB-R in regulating metabolism in embryonic fibroblasts and key developmental pathways in iPSCs.


Assuntos
Fator de Iniciação 4E em Eucariotos/metabolismo , Células-Tronco Pluripotentes Induzidas/metabolismo , Biossíntese de Proteínas , Receptores para Leptina/metabolismo , Fator de Transcrição STAT3/metabolismo , Transdução de Sinais , Animais , Sistemas CRISPR-Cas , Diferenciação Celular , Linhagem da Célula , Fator de Iniciação 4E em Eucariotos/genética , Fibroblastos/metabolismo , Edição de Genes , Regulação da Expressão Gênica no Desenvolvimento , Metaboloma , Camundongos , Camundongos Knockout , Neurogênese , Proteínas , Proteômica , Receptores para Leptina/genética
8.
Clin Proteomics ; 17: 33, 2020.
Artigo em Inglês | MEDLINE | ID: mdl-32963504

RESUMO

BACKGROUND: Verification of cerebrospinal fluid (CSF) biomarkers for multiple sclerosis and other neurological diseases is a major challenge due to a large number of candidates, limited sample material availability, disease and biological heterogeneity, and the lack of standardized assays. Furthermore, verification studies are often based on a low number of proteins from a single discovery experiment in medium-sized cohorts, where antibodies and surrogate peptides may differ, thus only providing an indication of proteins affected by the disease and not revealing the bigger picture or concluding on the validity of the markers. We here present a standard approach for locating promising biomarker candidates based on existing knowledge, resulting in high-quality assays covering the main biological processes affected by multiple sclerosis for comparable measurements over time. METHODS: Biomarker candidates were located in CSF-PR (proteomics.uib.no/csf-pr), and further filtered based on estimated concentration in CSF and biological function. Peptide surrogates for internal standards were selected according to relevant criteria, parallel reaction monitoring (PRM) assays created, and extensive assay quality testing performed, i.e. intra- and inter-day variation, trypsin digestion status over time, and whether the peptides were able to separate multiple sclerosis patients and controls. RESULTS: Assays were developed for 25 proteins, represented by 72 peptides selected according to relevant guidelines and available literature and tested for assay peptide suitability. Stability testing revealed 64 peptides with low intra- and inter-day variations, with 44 also being stably digested after 16 h of trypsin digestion, and 37 furthermore showing a significant difference between multiple sclerosis and controls, thereby confirming literature findings. Calibration curves and the linear area of measurement have, so far, been determined for 17 of these peptides. CONCLUSIONS: We present 37 high-quality PRM assays across 21 CSF-proteins found to be affected by multiple sclerosis, along with a recommended workflow for future development of new assays. The assays can directly be used by others, thus enabling better comparison between studies. Finally, the assays can robustly and stably monitor biological processes in multiple sclerosis patients over time, thus potentially aiding in diagnosis and prognosis, and ultimately in treatment decisions.

9.
J Proteome Res ; 19(8): 3562-3566, 2020 08 07.
Artigo em Inglês | MEDLINE | ID: mdl-32431147

RESUMO

Although metaproteomics, the study of the collective proteome of microbial communities, has become increasingly powerful and popular over the past few years, the field has lagged behind on the availability of user-friendly, end-to-end pipelines for data analysis. We therefore describe the connection from two commonly used metaproteomics data processing tools in the field, MetaProteomeAnalyzer and PeptideShaker, to Unipept for downstream analysis. Through these connections, direct end-to-end pipelines are built from database searching to taxonomic and functional annotation.


Assuntos
Análise de Dados , Microbiota , Proteoma , Proteômica , Software
10.
Mass Spectrom Rev ; 39(3): 292-306, 2020 05.
Artigo em Inglês | MEDLINE | ID: mdl-28902424

RESUMO

Sequence database search engines are bioinformatics algorithms that identify peptides from tandem mass spectra using a reference protein sequence database. Two decades of development, notably driven by advances in mass spectrometry, have provided scientists with more than 30 published search engines, each with its own properties. In this review, we present the common paradigm behind the different implementations, and its limitations for modern mass spectrometry datasets. We also detail how the search engines attempt to alleviate these limitations, and provide an overview of the different software frameworks available to the researcher. Finally, we highlight alternative approaches for the identification of proteomic mass spectrometry datasets, either as a replacement for, or as a complement to, sequence database search engines.


Assuntos
Espectrometria de Massas/métodos , Proteínas/química , Proteômica/métodos , Ferramenta de Busca/métodos , Animais , Humanos , Fluxo de Trabalho
11.
Stem Cells ; 38(4): 542-555, 2020 04.
Artigo em Inglês | MEDLINE | ID: mdl-31828876

RESUMO

A comprehensive characterization of the molecular processes controlling cell fate decisions is essential to derive stable progenitors and terminally differentiated cells that are functional from human pluripotent stem cells (hPSCs). Here, we report the use of quantitative proteomics to describe early proteome adaptations during hPSC differentiation toward pancreatic progenitors. We report that the use of unbiased quantitative proteomics allows the simultaneous profiling of numerous proteins at multiple time points, and is a valuable tool to guide the discovery of signaling events and molecular signatures underlying cellular differentiation. We also monitored the activity level of pathways whose roles are pivotal in the early pancreas differentiation, including the Hippo signaling pathway. The quantitative proteomics data set provides insights into the dynamics of the global proteome during the transition of hPSCs from a pluripotent state toward pancreatic differentiation.


Assuntos
Pâncreas/metabolismo , Células-Tronco Pluripotentes/metabolismo , Proteoma/metabolismo , Proteômica/métodos , Diferenciação Celular , Humanos , Pâncreas/citologia
12.
J Proteome Res ; 19(1): 537-542, 2020 01 03.
Artigo em Inglês | MEDLINE | ID: mdl-31755270

RESUMO

The field of computational proteomics is approaching the big data age, driven both by a continuous growth in the number of samples analyzed per experiment as well as by the growing amount of data obtained in each analytical run. In order to process these large amounts of data, it is increasingly necessary to use elastic compute resources such as Linux-based cluster environments and cloud infrastructures. Unfortunately, the vast majority of cross-platform proteomics tools are not able to operate directly on the proprietary formats generated by the diverse mass spectrometers. Here, we present ThermoRawFileParser, an open-source, cross-platform tool that converts Thermo RAW files into open file formats such as MGF and the HUPO-PSI standard file format mzML. To ensure the broadest possible availability and to increase integration capabilities with popular workflow systems such as Galaxy or Nextflow, we have also built Conda package and BioContainers container around ThermoRawFileParser. In addition, we implemented a user-friendly interface (ThermoRawFileParserGUI) for those users not familiar with command-line tools. Finally, we performed a benchmark of ThermoRawFileParser and msconvert to verify that the converted mzML files contain reliable quantitative results.


Assuntos
Biologia Computacional/métodos , Proteômica/métodos , Software , Bases de Dados de Proteínas , Proteínas de Saccharomyces cerevisiae/metabolismo , Fluxo de Trabalho
13.
Methods Mol Biol ; 2044: 377-391, 2019.
Artigo em Inglês | MEDLINE | ID: mdl-31432427

RESUMO

Every year, a large number of published studies present biomarkers for various neurological disorders. Many of these studies are based on mass spectrometry proteomics data and describe comparison of the abundance of proteins in cerebrospinal fluid between two or more disease groups. As the number of such studies is growing, it is no longer straightforward to obtain an overview of which specific proteins are increased or decreased between the numerous relevant diseases and their many subcategories, or to see the larger picture or trends between related diseases. To alleviate this situation, we therefore mined the literature for mass spectrometry-based proteomics studies including quantitative protein data from cerebrospinal fluid of patients with multiple sclerosis, Alzheimer's disease, and Parkinson's disease and organized the extracted data in the Cerebrospinal Fluid Proteome Resource (CSF-PR). CSF-PR is freely available online at http://probe.uib.no/csf-pr , is highly interactive, and allows for easy navigation, visualization, and export of the published scientific data. This chapter will guide the user through some of the most important features of the tool and show examples of the suggested use cases.


Assuntos
Doença de Alzheimer/líquido cefalorraquidiano , Proteínas do Líquido Cefalorraquidiano/metabolismo , Esclerose Múltipla/líquido cefalorraquidiano , Doença de Parkinson/líquido cefalorraquidiano , Proteoma/análise , Biomarcadores/líquido cefalorraquidiano , Biomarcadores/química , Biomarcadores/metabolismo , Proteínas do Líquido Cefalorraquidiano/química , Mineração de Dados , Bases de Dados de Proteínas , Humanos , Espectrometria de Massas , Peptídeos/química , Proteoma/química , Proteoma/metabolismo , Proteômica
14.
Gigascience ; 8(8)2019 08 01.
Artigo em Inglês | MEDLINE | ID: mdl-31363752

RESUMO

BACKGROUND: Mapping biomedical data to functional knowledge is an essential task in bioinformatics and can be achieved by querying identifiers (e.g., gene sets) in pathway knowledge bases. However, the isoform and posttranslational modification states of proteins are lost when converting input and pathways into gene-centric lists. FINDINGS: Based on the Reactome knowledge base, we built a network of protein-protein interactions accounting for the documented isoform and modification statuses of proteins. We then implemented a command line application called PathwayMatcher (github.com/PathwayAnalysisPlatform/PathwayMatcher) to query this network. PathwayMatcher supports multiple types of omics data as input and outputs the possibly affected biochemical reactions, subnetworks, and pathways. CONCLUSIONS: PathwayMatcher enables refining the network representation of pathways by including proteoforms defined as protein isoforms with posttranslational modifications. The specificity of pathway analyses is hence adapted to different levels of granularity, and it becomes possible to distinguish interactions between different forms of the same protein.


Assuntos
Biologia Computacional/métodos , Redes Reguladoras de Genes , Transdução de Sinais , Software , Humanos , Polimorfismo de Nucleotídeo Único , Mapeamento de Interação de Proteínas/métodos , Processamento de Proteína Pós-Traducional
15.
Eur J Mass Spectrom (Chichester) ; 25(6): 451-456, 2019 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-31189351

RESUMO

Single amino acids and small endogenous peptides play important roles in maintaining a properly functioning organism. These molecules are however currently only routinely identified in targeted approaches. In a small proof-of-concept mass spectrometry experiment, we found that by combining isobaric tags and peptidomics, and by targeting singly charged molecules, we were able to identify a significant amount of single amino acids and small endogenous peptides using a basic mass-based identification approach. While there is still room for improvement, our simple test indicates that a limited amount of extra work when setting up the mass spectrometry experiment could potentially lead to a wealth of additional information.


Assuntos
Aminoácidos/química , Peptídeos/química , Espectrometria de Massas , Proteômica
16.
J Proteome Res ; 18(6): 2686-2692, 2019 06 07.
Artigo em Inglês | MEDLINE | ID: mdl-31081335

RESUMO

Mass-spectrometry-based proteomics enables the high-throughput identification and quantification of proteins, including sequence variants and post-translational modifications (PTMs) in biological samples. However, most workflows require that such variations be included in the search space used to analyze the data, and doing so remains challenging with most analysis tools. In order to facilitate the search for known sequence variants and PTMs, the Proteomics Standards Initiative (PSI) has designed and implemented the PSI extended FASTA format (PEFF). PEFF is based on the very popular FASTA format but adds a uniform mechanism for encoding substantially more metadata about the sequence collection as well as individual entries, including support for encoding known sequence variants, PTMs, and proteoforms. The format is very nearly backward compatible, and as such, existing FASTA parsers will require little or no changes to be able to read PEFF files as FASTA files, although without supporting any of the extra capabilities of PEFF. PEFF is defined by a full specification document, controlled vocabulary terms, a set of example files, software libraries, and a file validator. Popular software and resources are starting to support PEFF, including the sequence search engine Comet and the knowledge bases neXtProt and UniProtKB. Widespread implementation of PEFF is expected to further enable proteogenomics and top-down proteomics applications by providing a standardized mechanism for encoding protein sequences and their known variations. All the related documentation, including the detailed file format specification and example files, are available at http://www.psidev.info/peff .


Assuntos
Proteômica/normas , Humanos , Armazenamento e Recuperação da Informação , Espectrometria de Massas , Software
17.
J Proteome Res ; 17(11): 3801-3809, 2018 11 02.
Artigo em Inglês | MEDLINE | ID: mdl-30251541

RESUMO

Biochemical pathways are commonly used as a reference to conduct functional analysis on biomedical omics data sets, where experimental results are mapped to knowledgebases comprising known molecular interactions collected from the literature. Due to their central role, the content of the functional knowledgebases directly influences the outcome of pathway analyses. In this study, we investigate the structure of the current pathway knowledge, as exemplified by Reactome, discuss the consequences for biological interpretation, and outline possible improvements in the use of pathway knowledgebases. By providing a view of the underlying protein interaction network, we aim to help pathway analysis users manage their expectations and better identify possible artifacts in the results.


Assuntos
Biologia Computacional/métodos , Linfócitos/metabolismo , Células Mieloides/metabolismo , Mapeamento de Interação de Proteínas/métodos , Proteômica/métodos , Bases de Dados de Proteínas , Humanos , Bases de Conhecimento , Linfócitos/citologia , Redes e Vias Metabólicas/fisiologia , Células Mieloides/citologia , Mapas de Interação de Proteínas
18.
J Proteome Res ; 17(7): 2552-2555, 2018 07 06.
Artigo em Inglês | MEDLINE | ID: mdl-29774740

RESUMO

Mass-spectrometry-based proteomics has become the standard approach for identifying and quantifying proteins. A vital step consists of analyzing experimentally generated mass spectra to identify the underlying peptide sequences for later mapping to the originating proteins. We here present the latest developments in SearchGUI, a common open-source interface for the most frequently used freely available proteomics search and de novo engines that has evolved into a central component in numerous bioinformatics workflows.


Assuntos
Proteômica/métodos , Ferramenta de Busca/métodos , Espectrometria de Massas em Tandem , Algoritmos , Biologia Computacional , Proteínas/análise , Fluxo de Trabalho
19.
Cancer Res ; 77(21): e43-e46, 2017 11 01.
Artigo em Inglês | MEDLINE | ID: mdl-29092937

RESUMO

Proteogenomics has emerged as a valuable approach in cancer research, which integrates genomic and transcriptomic data with mass spectrometry-based proteomics data to directly identify expressed, variant protein sequences that may have functional roles in cancer. This approach is computationally intensive, requiring integration of disparate software tools into sophisticated workflows, challenging its adoption by nonexpert, bench scientists. To address this need, we have developed an extensible, Galaxy-based resource aimed at providing more researchers access to, and training in, proteogenomic informatics. Our resource brings together software from several leading research groups to address two foundational aspects of proteogenomics: (i) generation of customized, annotated protein sequence databases from RNA-Seq data; and (ii) accurate matching of tandem mass spectrometry data to putative variants, followed by filtering to confirm their novelty. Directions for accessing software tools and workflows, along with instructional documentation, can be found at z.umn.edu/canresgithub. Cancer Res; 77(21); e43-46. ©2017 AACR.


Assuntos
Biologia Computacional/métodos , Genômica/métodos , Neoplasias/genética , Software , Genoma Humano , Humanos , Proteômica/métodos , Espectrometria de Massas em Tandem , Transcriptoma/genética
20.
Proteomics ; 17(19)2017 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-28792687

RESUMO

The availability of user-friendly software to annotate biological datasets and experimental details is becoming essential in data management practices, both in local storage systems and in public databases. The Ontology Lookup Service (OLS, http://www.ebi.ac.uk/ols) is a popular centralized service to query, browse and navigate biomedical ontologies and controlled vocabularies. Recently, the OLS framework has been completely redeveloped (version 3.0), including enhancements in the data model, like the added support for Web Ontology Language based ontologies, among many other improvements. However, the new OLS is not backwards compatible and new software tools are needed to enable access to this widely used framework now that the previous version is no longer available. We here present the OLS Client as a free, open-source Java library to retrieve information from the new version of the OLS. It enables rapid tool creation by providing a robust, pluggable programming interface and common data model to programmatically access the OLS. The library has already been integrated and is routinely used by several bioinformatics resources and related data annotation tools. Secondly, we also introduce an updated version of the OLS Dialog (version 2.0), a Java graphical user interface that can be easily plugged into Java desktop applications to access the OLS. The software and related documentation are freely available at https://github.com/PRIDE-Utilities/ols-client and https://github.com/PRIDE-Toolsuite/ols-dialog.


Assuntos
Ontologias Biológicas , Biologia Computacional/métodos , Bases de Dados Factuais , Software , Genômica , Humanos , Armazenamento e Recuperação da Informação , Metabolômica , Proteômica , Interface Usuário-Computador
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA