Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 13 de 13
Filtrar
1.
PLoS Comput Biol ; 16(7): e1007976, 2020 07.
Artigo em Inglês | MEDLINE | ID: mdl-32702016

RESUMO

ELIXIR is a pan-European intergovernmental organisation for life science that aims to coordinate bioinformatics resources in a single infrastructure across Europe; bioinformatics training is central to its strategy, which aims to develop a training community that spans all ELIXIR member states. In an evidence-based approach for strengthening bioinformatics training programmes across Europe, the ELIXIR Training Platform, led by the ELIXIR EXCELERATE Quality and Impact Assessment Subtask in collaboration with the ELIXIR Training Coordinators Group, has implemented an assessment strategy to measure quality and impact of its entire training portfolio. Here, we present ELIXIR's framework for assessing training quality and impact, which includes the following: specifying assessment aims, determining what data to collect in order to address these aims, and our strategy for centralised data collection to allow for ELIXIR-wide analyses. In addition, we present an overview of the ELIXIR training data collected over the past 4 years. We highlight the importance of a coordinated and consistent data collection approach and the relevance of defining specific metrics and answer scales for consortium-wide analyses as well as for comparison of data across iterations of the same course.


Assuntos
Biologia Computacional/educação , Controle de Qualidade , Algoritmos , Pesquisa Biomédica , Biologia Computacional/normas , Currículo , Coleta de Dados , Bases de Dados Factuais , Educação Continuada , Europa (Continente) , Avaliação de Programas e Projetos de Saúde , Reprodutibilidade dos Testes , Pesquisadores , Software , Interface Usuário-Computador
2.
PLoS Comput Biol ; 14(8): e1006191, 2018 08.
Artigo em Inglês | MEDLINE | ID: mdl-30161124

RESUMO

Workshops are used to explore a specific topic, to transfer knowledge, to solve identified problems, or to create something new. In funded research projects and other research endeavours, workshops are the mechanism used to gather the wider project, community, or interested people together around a particular topic. However, natural questions arise: how do we measure the impact of these workshops? Do we know whether they are meeting the goals and objectives we set for them? What indicators should we use? In response to these questions, this paper will outline rules that will improve the measurement of the impact of workshops.


Assuntos
Educação/normas , Humanos , Conhecimento , Aprendizagem , Pesquisa , Pesos e Medidas
3.
Nucleic Acids Res ; 45(D1): D945-D954, 2017 01 04.
Artigo em Inglês | MEDLINE | ID: mdl-27899562

RESUMO

ChEMBL is an open large-scale bioactivity database (https://www.ebi.ac.uk/chembl), previously described in the 2012 and 2014 Nucleic Acids Research Database Issues. Since then, alongside the continued extraction of data from the medicinal chemistry literature, new sources of bioactivity data have also been added to the database. These include: deposited data sets from neglected disease screening; crop protection data; drug metabolism and disposition data and bioactivity data from patents. A number of improvements and new features have also been incorporated. These include the annotation of assays and targets using ontologies, the inclusion of targets and indications for clinical candidates, addition of metabolic pathways for drugs and calculation of structural alerts. The ChEMBL data can be accessed via a web-interface, RDF distribution, data downloads and RESTful web-services.


Assuntos
Bases de Dados de Compostos Químicos , Bases de Dados de Ácidos Nucleicos , Ferramenta de Busca , Biologia Computacional/métodos , Proteção de Cultivos , Descoberta de Drogas , Ontologia Genética , Humanos , Anotação de Sequência Molecular , Farmacologia/métodos , Interface Usuário-Computador , Navegador
4.
Nucleic Acids Res ; 43(W1): W612-20, 2015 Jul 01.
Artigo em Inglês | MEDLINE | ID: mdl-25883136

RESUMO

ChEMBL is now a well-established resource in the fields of drug discovery and medicinal chemistry research. The ChEMBL database curates and stores standardized bioactivity, molecule, target and drug data extracted from multiple sources, including the primary medicinal chemistry literature. Programmatic access to ChEMBL data has been improved by a recent update to the ChEMBL web services (version 2.0.x, https://www.ebi.ac.uk/chembl/api/data/docs), which exposes significantly more data from the underlying database and introduces new functionality. To complement the data-focused services, a utility service (version 1.0.x, https://www.ebi.ac.uk/chembl/api/utils/docs), which provides RESTful access to commonly used cheminformatics methods, has also been concurrently developed. The ChEMBL web services can be used together or independently to build applications and data processing workflows relevant to drug discovery and chemical biology.


Assuntos
Bases de Dados de Compostos Químicos , Descoberta de Drogas , Internet , Integração de Sistemas , Interface Usuário-Computador
5.
Nucleic Acids Res ; 42(Database issue): D1083-90, 2014 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-24214965

RESUMO

ChEMBL is an open large-scale bioactivity database (https://www.ebi.ac.uk/chembl), previously described in the 2012 Nucleic Acids Research Database Issue. Since then, a variety of new data sources and improvements in functionality have contributed to the growth and utility of the resource. In particular, more comprehensive tracking of compounds from research stages through clinical development to market is provided through the inclusion of data from United States Adopted Name applications; a new richer data model for representing drug targets has been developed; and a number of methods have been put in place to allow users to more easily identify reliable data. Finally, access to ChEMBL is now available via a new Resource Description Framework format, in addition to the web-based interface, data downloads and web services.


Assuntos
Bases de Dados de Compostos Químicos , Descoberta de Drogas , Sítios de Ligação , Humanos , Internet , Ligantes , Preparações Farmacêuticas/química , Proteínas/química , Proteínas/efeitos dos fármacos
6.
Drug Discov Today Technol ; 14: 17-24, 2015 Jul.
Artigo em Inglês | MEDLINE | ID: mdl-26194583

RESUMO

There is a wealth of valuable chemical information in publicly available databases for use by scientists undertaking drug discovery. However finite curation resource, limitations of chemical structure software and differences in individual database applications mean that exact chemical structure equivalence between databases is unlikely to ever be a reality. The ability to identify compound equivalence has been made significantly easier by the use of the International Chemical Identifier (InChI), a non-proprietary line-notation for describing a chemical structure. More importantly, advances in methods to identify compounds that are the same at various levels of similarity, such as those containing the same parent component or having the same connectivity, are now enabling related compounds to be linked between databases where the structure matches are not exact.


Assuntos
Bases de Dados de Compostos Químicos , Descoberta de Drogas , Estrutura Molecular , Software
7.
Nucleic Acids Res ; 40(Database issue): D1100-7, 2012 Jan.
Artigo em Inglês | MEDLINE | ID: mdl-21948594

RESUMO

ChEMBL is an Open Data database containing binding, functional and ADMET information for a large number of drug-like bioactive compounds. These data are manually abstracted from the primary published literature on a regular basis, then further curated and standardized to maximize their quality and utility across a wide range of chemical biology and drug-discovery research problems. Currently, the database contains 5.4 million bioactivity measurements for more than 1 million compounds and 5200 protein targets. Access is available through a web-based interface, data downloads and web services at: https://www.ebi.ac.uk/chembldb.


Assuntos
Bases de Dados Factuais , Descoberta de Drogas , Bases de Dados de Proteínas , Humanos , Preparações Farmacêuticas/química , Proteínas/química , Proteínas/metabolismo , Interface Usuário-Computador
8.
J Chem Inf Model ; 53(10): 2499-505, 2013 Oct 28.
Artigo em Inglês | MEDLINE | ID: mdl-24160896

RESUMO

Bioactivity databases are routinely used in drug discovery to look-up and, using prediction tools, to predict potential targets for small molecules. These databases are typically manually curated from patents and scientific articles. Apart from errors in the source document, the human factor can cause errors during the extraction process. These errors can lead to wrong decisions in the early drug discovery process. In the current work, we have compared bioactivity data from three large databases (ChEMBL, Liceptor, and WOMBAT) who have curated data from the same source documents. As a result, we are able to report error rate estimates for individual activity parameters and individual bioactivity databases. Small molecule structures have the greatest estimated error rate followed by target, activity value, and activity type. This order is also reflected in supplier-specific error rate estimates. The results are also useful in identifying data points for recuration. We hope the results will lead to a more widespread awareness among scientists on the frequencies and types of errors in bioactivity data.


Assuntos
Bibliometria , Descoberta de Drogas/estatística & dados numéricos , Proteínas/química , Viés de Publicação , Bibliotecas de Moléculas Pequenas/química , Bases de Dados Bibliográficas , Bases de Dados de Compostos Químicos , Bases de Dados de Produtos Farmacêuticos , Humanos , Ligantes , Patentes como Assunto , Proteínas/agonistas , Proteínas/antagonistas & inibidores
9.
Biochem Soc Trans ; 39(5): 1365-70, 2011 Oct.
Artigo em Inglês | MEDLINE | ID: mdl-21936816

RESUMO

The challenge of translating the huge amount of genomic and biochemical data into new drugs is a costly and challenging task. Historically, there has been comparatively little focus on linking the biochemical and chemical worlds. To address this need, we have developed ChEMBL, an online resource of small-molecule SAR (structure-activity relationship) data, which can be used to support chemical biology, lead discovery and target selection in drug discovery. The database contains the abstracted structures, properties and biological activities for over 700000 distinct compounds and in excess of more than 3 million bioactivity records abstracted from over 40000 publications. Additional public domain resources can be readily integrated into the same data model (e.g. PubChem BioAssay data). The compounds in ChEMBL are largely extracted from the primary medicinal chemistry literature, and are therefore usually 'drug-like' or 'lead-like' small molecules with full experimental context. The data cover a significant fraction of the discovery of modern drugs, and are useful in a wide range of drug design and discovery tasks. In addition to the compound data, ChEMBL also contains information for over 8000 protein, cell line and whole-organism 'targets', with over 4000 of those being proteins linked to their underlying genes. The database is searchable both chemically, using an interactive compound sketch tool, protein sequences, family hierarchies, SMILES strings, compound research codes and key words, and biologically, using a variety of gene identifiers, protein sequence similarity and protein families. The information retrieved can then be readily filtered and downloaded into various formats. ChEMBL can be accessed online at https://www.ebi.ac.uk/chembldb.


Assuntos
Mineração de Dados , Bases de Dados Factuais , Descoberta de Drogas , Animais , Biologia Computacional/métodos , Genômica , Humanos , Armazenamento e Recuperação da Informação , Estrutura Molecular , Preparações Farmacêuticas/química , Preparações Farmacêuticas/metabolismo , Proteínas/química , Relação Estrutura-Atividade
10.
J Cheminform ; 12(1): 51, 2020 Sep 01.
Artigo em Inglês | MEDLINE | ID: mdl-33431044

RESUMO

BACKGROUND: The ChEMBL database is one of a number of public databases that contain bioactivity data on small molecule compounds curated from diverse sources. Incoming compounds are typically not standardised according to consistent rules. In order to maintain the quality of the final database and to easily compare and integrate data on the same compound from different sources it is necessary for the chemical structures in the database to be appropriately standardised. RESULTS: A chemical curation pipeline has been developed using the open source toolkit RDKit. It comprises three components: a Checker to test the validity of chemical structures and flag any serious errors; a Standardizer which formats compounds according to defined rules and conventions and a GetParent component that removes any salts and solvents from the compound to create its parent. This pipeline has been applied to the latest version of the ChEMBL database as well as uncurated datasets from other sources to test the robustness of the process and to identify common issues in database molecular structures. CONCLUSION: All the components of the structure pipeline have been made freely available for other researchers to use and adapt for their own use. The code is available in a GitHub repository and it can also be accessed via the ChEMBL Beaker webservices. It has been used successfully to standardise the nearly 2 million compounds in the ChEMBL database and the compound validity checker has been used to identify compounds with the most serious issues so that they can be prioritised for manual curation.

11.
Heliyon ; 4(12): e01057, 2018 Dec.
Artigo em Inglês | MEDLINE | ID: mdl-30582061

RESUMO

In the last decade, bioinformatics has become an indispensable branch of modern science research, experiencing an explosion in financial support, developed applications and data collection. The growth of the datasets that are emerging from research laboratories, industry, the health sector, etc., are increasingly raising the levels of demand in computing power and storage. Processing biological data, in the large scales of these datasets, often requires the use of High Performance Computing (HPC) resources, especially when dealing with certain types of omics data, such as genomic and metagenomic data. Such computational resources not only require substantial investments, but they also involve high maintenance costs. More importantly, in order to keep good returns from the investments, specific training needs to be put in place to ensure that wasting is minimized. Furthermore, given that bioinformatics is a highly interdisciplinary field where several other domains intersect (such as biology, chemistry, physics and computer science), researchers from these areas also require bioinformatics-specific training in HPC, in order to fully take advantage of supercomputing centers. In this document, we describe our experience in training researchers from several different disciplines in HPC, as applied to bioinformatics under the framework of the leading European bioinformatics platform ELIXIR, and analyze both the content and outcomes of the course.

12.
Sci Data ; 2: 150032, 2015.
Artigo em Inglês | MEDLINE | ID: mdl-26175909

RESUMO

ChEMBL is a large-scale drug discovery database containing bioactivity information primarily extracted from scientific literature. Due to the medicinal chemistry focus of the journals from which data are extracted, the data are currently of most direct value in the field of human health research. However, many of the scientific use-cases for the current data set are equally applicable in other fields, such as crop protection research: for example, identification of chemical scaffolds active against a particular target or endpoint, the de-convolution of the potential targets of a phenotypic assay, or the potential targets/pathways for safety liabilities. In order to broaden the applicability of the ChEMBL database and allow more widespread use in crop protection research, an extensive data set of bioactivity data of insecticidal, fungicidal and herbicidal compounds and assays was collated and added to the database.


Assuntos
Proteção de Cultivos , Bases de Dados de Compostos Químicos , Bioensaio , Herbicidas , Inseticidas
13.
J Cheminform ; 5(1): 3, 2013 Jan 14.
Artigo em Inglês | MEDLINE | ID: mdl-23317286

RESUMO

UniChem is a freely available compound identifier mapping service on the internet, designed to optimize the efficiency with which structure-based hyperlinks may be built and maintained between chemistry-based resources. In the past, the creation and maintenance of such links at EMBL-EBI, where several chemistry-based resources exist, has required independent efforts by each of the separate teams. These efforts were complicated by the different data models, release schedules, and differing business rules for compound normalization and identifier nomenclature that exist across the organization. UniChem, a large-scale, non-redundant database of Standard InChIs with pointers between these structures and chemical identifiers from all the separate chemistry resources, was developed as a means of efficiently sharing the maintenance overhead of creating these links. Thus, for each source represented in UniChem, all links to and from all other sources are automatically calculated and immediately available for all to use. Updated mappings are immediately available upon loading of new data releases from the sources. Web services in UniChem provide users with a single simple automatable mechanism for maintaining all links from their resource to all other sources represented in UniChem. In addition, functionality to track changes in identifier usage allows users to monitor which identifiers are current, and which are obsolete. Lastly, UniChem has been deliberately designed to allow additional resources to be included with minimal effort. Indeed, the recent inclusion of data sources external to EMBL-EBI has provided a simple means of providing users with an even wider selection of resources with which to link to, all at no extra cost, while at the same time providing a simple mechanism for external resources to link to all EMBL-EBI chemistry resources.

SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA