Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 13 de 13
Filtrar
1.
Nucleic Acids Res ; 39(Web Server issue): W450-4, 2011 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-21622961

RESUMO

In this article, we present CoPub 5.0, a publicly available text mining system, which uses Medline abstracts to calculate robust statistics for keyword co-occurrences. CoPub was initially developed for the analysis of microarray data, but we broadened the scope by implementing new technology and new thesauri. In CoPub 5.0, we integrated existing CoPub technology with new features, and provided a new advanced interface, which can be used to answer a variety of biological questions. CoPub 5.0 allows searching for keywords of interest and its relations to curated thesauri and provides highlighting and sorting mechanisms, using its statistics, to retrieve the most important abstracts in which the terms co-occur. It also provides a way to search for indirect relations between genes, drugs, pathways and diseases, following an ABC principle, in which A and C have no direct connection but are connected via shared B intermediates. With CoPub 5.0, it is possible to create, annotate and analyze networks using the layout and highlight options of Cytoscape web, allowing for literature based systems biology. Finally, operations of the CoPub 5.0 Web service enable to implement the CoPub technology in bioinformatics workflows. CoPub 5.0 can be accessed through the CoPub portal http://www.copub.org.


Assuntos
Mineração de Dados/métodos , Software , Redes Reguladoras de Genes , Internet , PubMed
2.
Rapid Commun Mass Spectrom ; 26(20): 2461-71, 2012 Oct 30.
Artigo em Inglês | MEDLINE | ID: mdl-22976213

RESUMO

RATIONALE: High-resolution multistage MS(n) data contains detailed information that can be used for structural elucidation of compounds observed in metabolomics studies. However, full exploitation of this complex data requires significant analysis efforts by human experts. In silico methods currently used to support data annotation by assigning substructures of candidate molecules are limited to a single level of MS fragmentation. METHODS: We present an extended substructure-based approach which allows annotation of hierarchical spectral trees obtained from high-resolution multistage MS(n) experiments. The algorithm yields a hierarchical tree of substructures of a candidate molecule to explain the fragment peaks observed at consecutive levels of the multistage MS(n) spectral tree. A matching score is calculated that indicates how well the candidate structure can explain the observed hierarchical fragmentation pattern. RESULTS: The method is applied to MS(n) spectral trees of a set of compounds representing important chemical classes in metabolomics. Based on the calculated score, the correct molecules were successfully prioritized among extensive sets of candidates structures retrieved from the PubChem database. CONCLUSIONS: The results indicate that the inclusion of subsequent levels of fragmentation in the automatic annotation of MS(n) data improves the identification of the correct compounds. We show that, especially in the case of lower mass accuracy, this improvement is not only due to the inclusion of additional fragment ions in the analysis, but also to the specific hierarchical information present in the MS(n) spectral trees. This method may significantly reduce the time required by MS experts to analyze complex MS(n) data.


Assuntos
Algoritmos , Espectrometria de Massas/métodos , Bases de Dados Factuais , Metabolômica/métodos
3.
Arthritis Rheum ; 63(5): 1265-73, 2011 May.
Artigo em Inglês | MEDLINE | ID: mdl-21305530

RESUMO

OBJECTIVE: Rheumatoid arthritis (RA) is characterized by inflammation and joint destruction, with the degree of damage varying greatly among patients. Prediction of disease severity using known clinical and serologic risk factors is inaccurate. This study was undertaken to identify new serologic markers for RA severity using an in silico model of the rheumatic joint. METHODS: An in silico model of a prototypical rheumatic joint was used to predict candidate markers associated with erosiveness. The following 4 markers were chosen for validation: tartrate-resistant acid phosphatase 5b (TRAP-5b), N-telopeptide of type I collagen (NTX), angiopoietin 2 (Ang-2), and CXCL13. Serum from 74 RA patients was used to study whether radiologic joint destruction (total erosion score and total Sharp/van der Heijde score [SHS]) after 4 years of disease was associated with serum levels at the time of diagnosis. Serum marker levels were determined using enzyme-linked immunosorbent assays. For confirmation, baseline serum levels were analyzed for an association with progression of joint damage over 7 years of followup in a cohort of 155 patients with early RA. RESULTS: Comparison of high and low quartiles of erosion score and SHS at 4 years showed a difference in baseline serum CXCL13 level (P = 0.011 and P = 0.018, respectively). In the confirmation cohort, elevated baseline CXCL13 levels were associated with increased rates of joint destruction during 7 years of followup (P < 0.001 unadjusted and P ≤ 0.004 with adjustment for C-reactive protein level). Analyzing anti-CCP-2-positive and anti-CCP-2­negative RA separately yielded a significant result only in the anti-CCP-2-negative group (P ≤ 0.001). CONCLUSION: Our findings indicate that CXCL13 is a novel serologic marker predictive of RA severity.This marker was identified with the help of an in silicomodel of the RA joint.


Assuntos
Artrite Reumatoide/sangue , Quimiocina CXCL13/sangue , Articulações do Pé/diagnóstico por imagem , Articulação da Mão/diagnóstico por imagem , Adulto , Idoso , Antirreumáticos/uso terapêutico , Artrite Reumatoide/diagnóstico por imagem , Artrite Reumatoide/tratamento farmacológico , Artrografia , Biomarcadores/sangue , Progressão da Doença , Ensaio de Imunoadsorção Enzimática , Feminino , Humanos , Masculino , Pessoa de Meia-Idade , Modelos Anatômicos , Indução de Remissão , Índice de Gravidade de Doença , Estatísticas não Paramétricas , Resultado do Tratamento
4.
PLoS Comput Biol ; 6(9)2010 Sep 23.
Artigo em Inglês | MEDLINE | ID: mdl-20885778

RESUMO

The scientific literature represents a rich source for retrieval of knowledge on associations between biomedical concepts such as genes, diseases and cellular processes. A commonly used method to establish relationships between biomedical concepts from literature is co-occurrence. Apart from its use in knowledge retrieval, the co-occurrence method is also well-suited to discover new, hidden relationships between biomedical concepts following a simple ABC-principle, in which A and C have no direct relationship, but are connected via shared B-intermediates. In this paper we describe CoPub Discovery, a tool that mines the literature for new relationships between biomedical concepts. Statistical analysis using ROC curves showed that CoPub Discovery performed well over a wide range of settings and keyword thesauri. We subsequently used CoPub Discovery to search for new relationships between genes, drugs, pathways and diseases. Several of the newly found relationships were validated using independent literature sources. In addition, new predicted relationships between compounds and cell proliferation were validated and confirmed experimentally in an in vitro cell proliferation assay. The results show that CoPub Discovery is able to identify novel associations between genes, drugs, pathways and diseases that have a high probability of being biologically valid. This makes CoPub Discovery a useful tool to unravel the mechanisms behind disease, to find novel drug targets, or to find novel applications for existing drugs.


Assuntos
Biologia Computacional/métodos , Mineração de Dados/métodos , Doença , Genes , Preparações Farmacêuticas , Descoberta de Drogas , Humanos , Leucócitos Mononucleares/fisiologia , MEDLINE , Redes e Vias Metabólicas , Reconhecimento Automatizado de Padrão/métodos , Curva ROC , Reprodutibilidade dos Testes , Transdução de Sinais , Software
5.
BMC Bioinformatics ; 11: 158, 2010 Mar 26.
Artigo em Inglês | MEDLINE | ID: mdl-20346140

RESUMO

BACKGROUND: Gene expression data can be analyzed by summarizing groups of individual gene expression profiles based on GO annotation information. The mean expression profile per group can then be used to identify interesting GO categories in relation to the experimental settings. However, the expression profiles present in GO classes are often heterogeneous, i.e., there are several different expression profiles within one class. As a result, important experimental findings can be obscured because the summarizing profile does not seem to be of interest. We propose to tackle this problem by finding homogeneous subclasses within GO categories: preclustering. RESULTS: Two microarray datasets are analyzed. First, a selection of genes from a well-known Saccharomyces cerevisiae dataset is used. The GO class "cell wall organization and biogenesis" is shown as a specific example. After preclustering, this term can be associated with different phases in the cell cycle, where it could not be associated with a specific phase previously. Second, a dataset of differentiation of human Mesenchymal Stem Cells (MSC) into osteoblasts is used. For this dataset results are shown in which the GO term "skeletal development" is a specific example of a heterogeneous GO class for which better associations can be made after preclustering. The Intra Cluster Correlation (ICC), a measure of cluster tightness, is applied to identify relevant clusters. CONCLUSIONS: We show that this method leads to an improved interpretability of results in Principal Component Analysis.


Assuntos
Perfilação da Expressão Gênica/métodos , Expressão Gênica , Análise de Componente Principal , Ciclo Celular/genética , Diferenciação Celular/genética , Análise por Conglomerados , Bases de Dados Genéticas , Humanos , Células-Tronco Mesenquimais/citologia , Saccharomyces cerevisiae/genética
6.
Nucleic Acids Res ; 36(Web Server issue): W406-10, 2008 Jul 01.
Artigo em Inglês | MEDLINE | ID: mdl-18442992

RESUMO

Medline is a rich information source, from which links between genes and keywords describing biological processes, pathways, drugs, pathologies and diseases can be extracted. We developed a publicly available tool called CoPub that uses the information in the Medline database for the biological interpretation of microarray data. CoPub allows batch input of multiple human, mouse or rat genes and produces lists of keywords from several biomedical thesauri that are significantly correlated with the set of input genes. These lists link to Medline abstracts in which the co-occurring input genes and correlated keywords are highlighted. Furthermore, CoPub can graphically visualize differentially expressed genes and over-represented keywords in a network, providing detailed insight in the relationships between genes and keywords, and revealing the most influential genes as highly connected hubs. CoPub is freely accessible at http://services.nbic.nl/cgi-bin/copub/CoPub.pl.


Assuntos
Perfilação da Expressão Gênica , MEDLINE , Análise de Sequência com Séries de Oligonucleotídeos , Software , Animais , Humanos , Internet , Camundongos , Ratos , Interface Usuário-Computador
7.
Sci Data ; 3: 160018, 2016 Mar 15.
Artigo em Inglês | MEDLINE | ID: mdl-26978244

RESUMO

There is an urgent need to improve the infrastructure supporting the reuse of scholarly data. A diverse set of stakeholders-representing academia, industry, funding agencies, and scholarly publishers-have come together to design and jointly endorse a concise and measureable set of principles that we refer to as the FAIR Data Principles. The intent is that these may act as a guideline for those wishing to enhance the reusability of their data holdings. Distinct from peer initiatives that focus on the human scholar, the FAIR Principles put specific emphasis on enhancing the ability of machines to automatically find and use the data, in addition to supporting its reuse by individuals. This Comment is the first formal publication of the FAIR Principles, and includes the rationale behind them, and some exemplar implementations in the community.


Assuntos
Coleta de Dados , Curadoria de Dados , Projetos de Pesquisa , Sistemas de Gerenciamento de Base de Dados , Guias como Assunto , Reprodutibilidade dos Testes
8.
Drug Discov Today ; 19(7): 859-68, 2014 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-24361338

RESUMO

Science, and the way we undertake research, is changing. The increasing rate of data generation across all scientific disciplines is providing incredible opportunities for data-driven research, with the potential to transform our current practices. The exploitation of so-called 'big data' will enable us to undertake research projects never previously possible but should also stimulate a re-evaluation of all our data practices. Data-driven medicinal chemistry approaches have the potential to improve decision making in drug discovery projects, providing that all researchers embrace the role of 'data scientist' and uncover the meaningful relationships and patterns in available data.


Assuntos
Química Farmacêutica/tendências , Descoberta de Drogas/tendências , Estatística como Assunto/tendências , Animais , Química Farmacêutica/métodos , Descoberta de Drogas/métodos , Humanos , Estatística como Assunto/métodos
10.
BioData Min ; 6(1): 2, 2013 Feb 04.
Artigo em Inglês | MEDLINE | ID: mdl-23379763

RESUMO

BACKGROUND: Glucocorticoids are potent anti-inflammatory agents used for the treatment of diseases such as rheumatoid arthritis, asthma, inflammatory bowel disease and psoriasis. Unfortunately, usage is limited because of metabolic side-effects, e.g. insulin resistance, glucose intolerance and diabetes. To gain more insight into the mechanisms behind glucocorticoid induced insulin resistance, it is important to understand which genes play a role in the development of insulin resistance and which genes are affected by glucocorticoids.Medline abstracts contain many studies about insulin resistance and the molecular effects of glucocorticoids and thus are a good resource to study these effects. RESULTS: We developed CoPubGene a method to automatically identify gene-disease associations in Medline abstracts. We used this method to create a literature network of genes related to insulin resistance and to evaluate the importance of the genes in this network for glucocorticoid induced metabolic side effects and anti-inflammatory processes.With this approach we found several genes that already are considered markers of GC induced IR, such as phosphoenolpyruvate carboxykinase (PCK) and glucose-6-phosphatase, catalytic subunit (G6PC). In addition, we found genes involved in steroid synthesis that have not yet been recognized as mediators of GC induced IR. CONCLUSIONS: With this approach we are able to construct a robust informative literature network of insulin resistance related genes that gave new insights to better understand the mechanisms behind GC induced IR. The method has been set up in a generic way so it can be applied to a wide variety of disease networks.

11.
PLoS One ; 7(11): e48385, 2012.
Artigo em Inglês | MEDLINE | ID: mdl-23152771

RESUMO

Glucocorticoids (GCs) such as prednisolone are potent immunosuppressive drugs but suffer from severe adverse effects, including the induction of insulin resistance. Therefore, development of so-called Selective Glucocorticoid Receptor Modulators (SGRM) is highly desirable. Here we describe a non-steroidal Glucocorticoid Receptor (GR)-selective compound (Org 214007-0) with a binding affinity to GR similar to that of prednisolone. Structural modelling of the GR-Org 214007-0 binding site shows disturbance of the loop between helix 11 and helix 12 of GR, confirmed by partial recruitment of the TIF2-3 peptide. Using various cell lines and primary human cells, we show here that Org 214007-0 acts as a partial GC agonist, since it repressed inflammatory genes and was less effective in induction of metabolic genes. More importantly, in vivo studies in mice indicated that Org 214007-0 retained full efficacy in acute inflammation models as well as in a chronic collagen-induced arthritis (CIA) model. Gene expression profiling of muscle tissue derived from arthritic mice showed a partial activity of Org 214007-0 at an equi-efficacious dosage of prednisolone, with an increased ratio in repression versus induction of genes. Finally, in mice Org 214007-0 did not induce elevated fasting glucose nor the shift in glucose/glycogen balance in the liver seen with an equi-efficacious dose of prednisolone. All together, our data demonstrate that Org 214007-0 is a novel SGRMs with an improved therapeutic index compared to prednisolone. This class of SGRMs can contribute to effective anti-inflammatory therapy with a lower risk for metabolic side effects.


Assuntos
Anti-Inflamatórios não Esteroides/farmacologia , Dibenzazepinas/farmacologia , Receptores de Glucocorticoides/agonistas , Tiadiazóis/farmacologia , Animais , Anti-Inflamatórios não Esteroides/química , Anti-Inflamatórios não Esteroides/uso terapêutico , Artrite Experimental/tratamento farmacológico , Artrite Experimental/genética , Glicemia , Dibenzazepinas/uso terapêutico , Feminino , Regulação da Expressão Gênica/efeitos dos fármacos , Humanos , Cinética , Fígado/efeitos dos fármacos , Fígado/enzimologia , Masculino , Camundongos , Simulação de Acoplamento Molecular , Prednisolona/farmacologia , Prednisolona/uso terapêutico , Ligação Proteica , Receptores de Glucocorticoides/química , Receptores de Glucocorticoides/metabolismo , Tiadiazóis/uso terapêutico
12.
Drug Discov Today ; 16(13-14): 555-68, 2011 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-21605698

RESUMO

The difference between biologically active molecules and drugs is that the latter balance an array of related and unrelated properties required for administration to patients. Inevitability, during optimization, some of these multiple factors will conflict. Although informatics has a crucial role in addressing the challenges of modern compound optimization, it is arguably still undervalued and underutilized. We present here some of the basic requirements of multi-parameter drug design, the crucial role of informatics and examples of favorable practice. The most crucial of these best practices are the need for informaticians to align their technologies and insights directly to discovery projects and for all scientists in drug discovery to become more proficient in the use of in silico methods.


Assuntos
Biologia Computacional/métodos , Simulação por Computador , Desenho de Fármacos , Descoberta de Drogas/métodos , Humanos , Modelos Moleculares
13.
Pharmacogenomics ; 8(11): 1521-34, 2007 Nov.
Artigo em Inglês | MEDLINE | ID: mdl-18034617

RESUMO

INTRODUCTION: To reduce continuously increasing costs in drug development, adverse effects of drugs need to be detected as early as possible in the process. In recent years, compound-induced gene expression profiling methodologies have been developed to assess compound toxicity, including Gene Ontology term and pathway over-representation analyses. The objective of this study was to introduce an additional approach, in which literature information is used for compound profiling to evaluate compound toxicity and mode of toxicity. METHODS: Gene annotations were built by text mining in Medline abstracts for retrieval of co-publications between genes, pathology terms, biological processes and pathways. This literature information was used to generate compound-specific keyword fingerprints, representing over-represented keywords calculated in a set of regulated genes after compound administration. To see whether keyword fingerprints can be used for assessment of compound toxicity, we analyzed microarray data sets of rat liver treated with 11 hepatotoxicants. RESULTS: Analysis of keyword fingerprints of two genotoxic carcinogens, two nongenotoxic carcinogens, two peroxisome proliferators and two randomly generated gene sets, showed that each compound produced a specific keyword fingerprint that correlated with the experimentally observed histopathological events induced by the individual compounds. By contrast, the random sets produced a flat aspecific keyword profile, indicating that the fingerprints induced by the compounds reflect biological events rather than random noise. A more detailed analysis of the keyword profiles of diethylhexylphthalate, dimethylnitrosamine and methapyrilene (MPy) showed that the differences in the keyword fingerprints of these three compounds are based upon known distinct modes of action. Visualization of MPy-linked keywords and MPy-induced genes in a literature network enabled us to construct a mode of toxicity proposal for MPy, which is in agreement with known effects of MPy in literature. CONCLUSION: Compound keyword fingerprinting based on information retrieved from literature is a powerful approach for compound profiling, allowing evaluation of compound toxicity and analysis of the mode of action.


Assuntos
Carcinógenos/toxicidade , Bases de Dados Bibliográficas , Perfilação da Expressão Gênica , Mutagênicos/toxicidade , Proliferadores de Peroxissomos/toxicidade , Toxicogenética/métodos , Algoritmos , Animais , Bases de Dados Genéticas , Fígado/efeitos dos fármacos , MEDLINE , Processamento de Linguagem Natural , Ratos , Vocabulário Controlado
SELEÇÃO DE REFERÊNCIAS
Detalhe da pesquisa