Búsqueda | Biblioteca Virtual en Salud

1.

A framework to assess the quality and impact of bioinformatics training across ELIXIR.

Gurwitz, Kim T; Singh Gaur, Prakash; Bellis, Louisa J; Larcombe, Lee; Alloza, Eva; Balint, Balint Laszlo; Botzki, Alexander; Dimec, Jure; Dominguez Del Angel, Victoria; Fernandes, Pedro L; Korpelainen, Eija; Krause, Roland; Kuzak, Mateusz; Le Pera, Loredana; Leskosek, Brane; Lindvall, Jessica M; Marek, Diana; Martinez, Paula A; Muyldermans, Tuur; Nygård, Ståle; Palagi, Patricia M; Peterson, Hedi; Psomopoulos, Fotis; Spiwok, Vojtech; van Gelder, Celia W G; Via, Allegra; Vidak, Marko; Wibberg, Daniel; Morgan, Sarah L; Rustici, Gabriella.

PLoS Comput Biol ; 16(7): e1007976, 2020 07.

Artículo en Inglés | MEDLINE | ID: mdl-32702016

RESUMEN

ELIXIR is a pan-European intergovernmental organisation for life science that aims to coordinate bioinformatics resources in a single infrastructure across Europe; bioinformatics training is central to its strategy, which aims to develop a training community that spans all ELIXIR member states. In an evidence-based approach for strengthening bioinformatics training programmes across Europe, the ELIXIR Training Platform, led by the ELIXIR EXCELERATE Quality and Impact Assessment Subtask in collaboration with the ELIXIR Training Coordinators Group, has implemented an assessment strategy to measure quality and impact of its entire training portfolio. Here, we present ELIXIR's framework for assessing training quality and impact, which includes the following: specifying assessment aims, determining what data to collect in order to address these aims, and our strategy for centralised data collection to allow for ELIXIR-wide analyses. In addition, we present an overview of the ELIXIR training data collected over the past 4 years. We highlight the importance of a coordinated and consistent data collection approach and the relevance of defining specific metrics and answer scales for consortium-wide analyses as well as for comparison of data across iterations of the same course.

Asunto(s)

Biología Computacional/educación , Control de Calidad , Algoritmos , Investigación Biomédica , Biología Computacional/normas , Curriculum , Recolección de Datos , Bases de Datos Factuales , Educación Continua , Europa (Continente) , Evaluación de Programas y Proyectos de Salud , Reproducibilidad de los Resultados , Investigadores , Programas Informáticos , Interfaz Usuario-Computador

2.

Ten simple rules for measuring the impact of workshops.

Sufi, Shoaib; Nenadic, Aleksandra; Silva, Raniere; Duckles, Beth; Simera, Iveta; de Beyer, Jennifer A; Struthers, Caroline; Nurmikko-Fuller, Terhi; Bellis, Louisa; Miah, Wadud; Wilde, Adriana; Emsley, Iain; Philippe, Olivier; Balzano, Melissa; Coelho, Sara; Ford, Heather; Jones, Catherine; Higgins, Vanessa.

PLoS Comput Biol ; 14(8): e1006191, 2018 08.

Artículo en Inglés | MEDLINE | ID: mdl-30161124

RESUMEN

Workshops are used to explore a specific topic, to transfer knowledge, to solve identified problems, or to create something new. In funded research projects and other research endeavours, workshops are the mechanism used to gather the wider project, community, or interested people together around a particular topic. However, natural questions arise: how do we measure the impact of these workshops? Do we know whether they are meeting the goals and objectives we set for them? What indicators should we use? In response to these questions, this paper will outline rules that will improve the measurement of the impact of workshops.

Asunto(s)

Educación/normas , Humanos , Conocimiento , Aprendizaje , Investigación , Pesos y Medidas

3.

The ChEMBL database in 2017.

Gaulton, Anna; Hersey, Anne; Nowotka, Michal; Bento, A Patrícia; Chambers, Jon; Mendez, David; Mutowo, Prudence; Atkinson, Francis; Bellis, Louisa J; Cibrián-Uhalte, Elena; Davies, Mark; Dedman, Nathan; Karlsson, Anneli; Magariños, María Paula; Overington, John P; Papadatos, George; Smit, Ines; Leach, Andrew R.

Nucleic Acids Res ; 45(D1): D945-D954, 2017 01 04.

Artículo en Inglés | MEDLINE | ID: mdl-27899562

RESUMEN

ChEMBL is an open large-scale bioactivity database (https://www.ebi.ac.uk/chembl), previously described in the 2012 and 2014 Nucleic Acids Research Database Issues. Since then, alongside the continued extraction of data from the medicinal chemistry literature, new sources of bioactivity data have also been added to the database. These include: deposited data sets from neglected disease screening; crop protection data; drug metabolism and disposition data and bioactivity data from patents. A number of improvements and new features have also been incorporated. These include the annotation of assays and targets using ontologies, the inclusion of targets and indications for clinical candidates, addition of metabolic pathways for drugs and calculation of structural alerts. The ChEMBL data can be accessed via a web-interface, RDF distribution, data downloads and RESTful web-services.

Asunto(s)

Bases de Datos de Compuestos Químicos , Bases de Datos de Ácidos Nucleicos , Motor de Búsqueda , Biología Computacional/métodos , Protección de Cultivos , Descubrimiento de Drogas , Ontología de Genes , Humanos , Anotación de Secuencia Molecular , Farmacología/métodos , Interfaz Usuario-Computador , Navegador Web

4.

ChEMBL web services: streamlining access to drug discovery data and utilities.

Davies, Mark; Nowotka, Michal; Papadatos, George; Dedman, Nathan; Gaulton, Anna; Atkinson, Francis; Bellis, Louisa; Overington, John P.

Nucleic Acids Res ; 43(W1): W612-20, 2015 Jul 01.

Artículo en Inglés | MEDLINE | ID: mdl-25883136

RESUMEN

ChEMBL is now a well-established resource in the fields of drug discovery and medicinal chemistry research. The ChEMBL database curates and stores standardized bioactivity, molecule, target and drug data extracted from multiple sources, including the primary medicinal chemistry literature. Programmatic access to ChEMBL data has been improved by a recent update to the ChEMBL web services (version 2.0.x, https://www.ebi.ac.uk/chembl/api/data/docs), which exposes significantly more data from the underlying database and introduces new functionality. To complement the data-focused services, a utility service (version 1.0.x, https://www.ebi.ac.uk/chembl/api/utils/docs), which provides RESTful access to commonly used cheminformatics methods, has also been concurrently developed. The ChEMBL web services can be used together or independently to build applications and data processing workflows relevant to drug discovery and chemical biology.

Asunto(s)

Bases de Datos de Compuestos Químicos , Descubrimiento de Drogas , Internet , Integración de Sistemas , Interfaz Usuario-Computador

5.

The ChEMBL bioactivity database: an update.

Bento, A Patrícia; Gaulton, Anna; Hersey, Anne; Bellis, Louisa J; Chambers, Jon; Davies, Mark; Krüger, Felix A; Light, Yvonne; Mak, Lora; McGlinchey, Shaun; Nowotka, Michal; Papadatos, George; Santos, Rita; Overington, John P.

Nucleic Acids Res ; 42(Database issue): D1083-90, 2014 Jan.

Artículo en Inglés | MEDLINE | ID: mdl-24214965

RESUMEN

ChEMBL is an open large-scale bioactivity database (https://www.ebi.ac.uk/chembl), previously described in the 2012 Nucleic Acids Research Database Issue. Since then, a variety of new data sources and improvements in functionality have contributed to the growth and utility of the resource. In particular, more comprehensive tracking of compounds from research stages through clinical development to market is provided through the inclusion of data from United States Adopted Name applications; a new richer data model for representing drug targets has been developed; and a number of methods have been put in place to allow users to more easily identify reliable data. Finally, access to ChEMBL is now available via a new Resource Description Framework format, in addition to the web-based interface, data downloads and web services.

Asunto(s)

Bases de Datos de Compuestos Químicos , Descubrimiento de Drogas , Sitios de Unión , Humanos , Internet , Ligandos , Preparaciones Farmacéuticas/química , Proteínas/química , Proteínas/efectos de los fármacos

6.

Chemical databases: curation or integration by user-defined equivalence?

Hersey, Anne; Chambers, Jon; Bellis, Louisa; Patrícia Bento, A; Gaulton, Anna; Overington, John P.

Drug Discov Today Technol ; 14: 17-24, 2015 Jul.

Artículo en Inglés | MEDLINE | ID: mdl-26194583

RESUMEN

There is a wealth of valuable chemical information in publicly available databases for use by scientists undertaking drug discovery. However finite curation resource, limitations of chemical structure software and differences in individual database applications mean that exact chemical structure equivalence between databases is unlikely to ever be a reality. The ability to identify compound equivalence has been made significantly easier by the use of the International Chemical Identifier (InChI), a non-proprietary line-notation for describing a chemical structure. More importantly, advances in methods to identify compounds that are the same at various levels of similarity, such as those containing the same parent component or having the same connectivity, are now enabling related compounds to be linked between databases where the structure matches are not exact.

Asunto(s)

Bases de Datos de Compuestos Químicos , Descubrimiento de Drogas , Estructura Molecular , Programas Informáticos

7.

ChEMBL: a large-scale bioactivity database for drug discovery.

Gaulton, Anna; Bellis, Louisa J; Bento, A Patricia; Chambers, Jon; Davies, Mark; Hersey, Anne; Light, Yvonne; McGlinchey, Shaun; Michalovich, David; Al-Lazikani, Bissan; Overington, John P.

Nucleic Acids Res ; 40(Database issue): D1100-7, 2012 Jan.

Artículo en Inglés | MEDLINE | ID: mdl-21948594

RESUMEN

ChEMBL is an Open Data database containing binding, functional and ADMET information for a large number of drug-like bioactive compounds. These data are manually abstracted from the primary published literature on a regular basis, then further curated and standardized to maximize their quality and utility across a wide range of chemical biology and drug-discovery research problems. Currently, the database contains 5.4 million bioactivity measurements for more than 1 million compounds and 5200 protein targets. Access is available through a web-based interface, data downloads and web services at: https://www.ebi.ac.uk/chembldb.

Asunto(s)

Bases de Datos Factuales , Descubrimiento de Drogas , Bases de Datos de Proteínas , Humanos , Preparaciones Farmacéuticas/química , Proteínas/química , Proteínas/metabolismo , Interfaz Usuario-Computador

8.

Estimating error rates in bioactivity databases.

Tiikkainen, Pekka; Bellis, Louisa; Light, Yvonne; Franke, Lutz.

J Chem Inf Model ; 53(10): 2499-505, 2013 Oct 28.

Artículo en Inglés | MEDLINE | ID: mdl-24160896

RESUMEN

Bioactivity databases are routinely used in drug discovery to look-up and, using prediction tools, to predict potential targets for small molecules. These databases are typically manually curated from patents and scientific articles. Apart from errors in the source document, the human factor can cause errors during the extraction process. These errors can lead to wrong decisions in the early drug discovery process. In the current work, we have compared bioactivity data from three large databases (ChEMBL, Liceptor, and WOMBAT) who have curated data from the same source documents. As a result, we are able to report error rate estimates for individual activity parameters and individual bioactivity databases. Small molecule structures have the greatest estimated error rate followed by target, activity value, and activity type. This order is also reflected in supplier-specific error rate estimates. The results are also useful in identifying data points for recuration. We hope the results will lead to a more widespread awareness among scientists on the frequencies and types of errors in bioactivity data.

Asunto(s)

Bibliometría , Descubrimiento de Drogas/estadística & datos numéricos , Proteínas/química , Sesgo de Publicación , Bibliotecas de Moléculas Pequeñas/química , Bases de Datos Bibliográficas , Bases de Datos de Compuestos Químicos , Bases de Datos Farmacéuticas , Humanos , Ligandos , Patentes como Asunto , Proteínas/agonistas , Proteínas/antagonistas & inhibidores

9.

Collation and data-mining of literature bioactivity data for drug discovery.

Bellis, Louisa J; Akhtar, Ruth; Al-Lazikani, Bissan; Atkinson, Francis; Bento, A Patricia; Chambers, Jon; Davies, Mark; Gaulton, Anna; Hersey, Anne; Ikeda, Kazuyoshi; Krüger, Felix A; Light, Yvonne; McGlinchey, Shaun; Santos, Rita; Stauch, Benjamin; Overington, John P.

Biochem Soc Trans ; 39(5): 1365-70, 2011 Oct.

Artículo en Inglés | MEDLINE | ID: mdl-21936816

RESUMEN

The challenge of translating the huge amount of genomic and biochemical data into new drugs is a costly and challenging task. Historically, there has been comparatively little focus on linking the biochemical and chemical worlds. To address this need, we have developed ChEMBL, an online resource of small-molecule SAR (structure-activity relationship) data, which can be used to support chemical biology, lead discovery and target selection in drug discovery. The database contains the abstracted structures, properties and biological activities for over 700000 distinct compounds and in excess of more than 3 million bioactivity records abstracted from over 40000 publications. Additional public domain resources can be readily integrated into the same data model (e.g. PubChem BioAssay data). The compounds in ChEMBL are largely extracted from the primary medicinal chemistry literature, and are therefore usually 'drug-like' or 'lead-like' small molecules with full experimental context. The data cover a significant fraction of the discovery of modern drugs, and are useful in a wide range of drug design and discovery tasks. In addition to the compound data, ChEMBL also contains information for over 8000 protein, cell line and whole-organism 'targets', with over 4000 of those being proteins linked to their underlying genes. The database is searchable both chemically, using an interactive compound sketch tool, protein sequences, family hierarchies, SMILES strings, compound research codes and key words, and biologically, using a variety of gene identifiers, protein sequence similarity and protein families. The information retrieved can then be readily filtered and downloaded into various formats. ChEMBL can be accessed online at https://www.ebi.ac.uk/chembldb.

Asunto(s)

Minería de Datos , Bases de Datos Factuales , Descubrimiento de Drogas , Animales , Biología Computacional/métodos , Genómica , Humanos , Almacenamiento y Recuperación de la Información , Estructura Molecular , Preparaciones Farmacéuticas/química , Preparaciones Farmacéuticas/metabolismo , Proteínas/química , Relación Estructura-Actividad

10.

An open source chemical structure curation pipeline using RDKit.

Bento, A Patrícia; Hersey, Anne; Félix, Eloy; Landrum, Greg; Gaulton, Anna; Atkinson, Francis; Bellis, Louisa J; De Veij, Marleen; Leach, Andrew R.

J Cheminform ; 12(1): 51, 2020 Sep 01.

Artículo en Inglés | MEDLINE | ID: mdl-33431044

RESUMEN

BACKGROUND: The ChEMBL database is one of a number of public databases that contain bioactivity data on small molecule compounds curated from diverse sources. Incoming compounds are typically not standardised according to consistent rules. In order to maintain the quality of the final database and to easily compare and integrate data on the same compound from different sources it is necessary for the chemical structures in the database to be appropriately standardised. RESULTS: A chemical curation pipeline has been developed using the open source toolkit RDKit. It comprises three components: a Checker to test the validity of chemical structures and flag any serious errors; a Standardizer which formats compounds according to defined rules and conventions and a GetParent component that removes any salts and solvents from the compound to create its parent. This pipeline has been applied to the latest version of the ChEMBL database as well as uncurated datasets from other sources to test the robustness of the process and to identify common issues in database molecular structures. CONCLUSION: All the components of the structure pipeline have been made freely available for other researchers to use and adapt for their own use. The code is available in a GitHub repository and it can also be accessed via the ChEMBL Beaker webservices. It has been used successfully to standardise the nearly 2 million compounds in the ChEMBL database and the compound validity checker has been used to identify compounds with the most serious issues so that they can be prioritised for manual curation.

11.

Training bioinformaticians in High Performance Computing.

Pérez-Wohlfeil, Esteban; Torreno, Oscar; Bellis, Louisa J; Fernandes, Pedro L; Leskosek, Brane; Trelles, Oswaldo.

Heliyon ; 4(12): e01057, 2018 Dec.

Artículo en Inglés | MEDLINE | ID: mdl-30582061

RESUMEN

In the last decade, bioinformatics has become an indispensable branch of modern science research, experiencing an explosion in financial support, developed applications and data collection. The growth of the datasets that are emerging from research laboratories, industry, the health sector, etc., are increasingly raising the levels of demand in computing power and storage. Processing biological data, in the large scales of these datasets, often requires the use of High Performance Computing (HPC) resources, especially when dealing with certain types of omics data, such as genomic and metagenomic data. Such computational resources not only require substantial investments, but they also involve high maintenance costs. More importantly, in order to keep good returns from the investments, specific training needs to be put in place to ensure that wasting is minimized. Furthermore, given that bioinformatics is a highly interdisciplinary field where several other domains intersect (such as biology, chemistry, physics and computer science), researchers from these areas also require bioinformatics-specific training in HPC, in order to fully take advantage of supercomputing centers. In this document, we describe our experience in training researchers from several different disciplines in HPC, as applied to bioinformatics under the framework of the leading European bioinformatics platform ELIXIR, and analyze both the content and outcomes of the course.

12.

A large-scale crop protection bioassay data set.

Gaulton, Anna; Kale, Namrata; van Westen, Gerard J P; Bellis, Louisa J; Bento, A Patrícia; Davies, Mark; Hersey, Anne; Papadatos, George; Forster, Mark; Wege, Philip; Overington, John P.

Sci Data ; 2: 150032, 2015.

Artículo en Inglés | MEDLINE | ID: mdl-26175909

RESUMEN

ChEMBL is a large-scale drug discovery database containing bioactivity information primarily extracted from scientific literature. Due to the medicinal chemistry focus of the journals from which data are extracted, the data are currently of most direct value in the field of human health research. However, many of the scientific use-cases for the current data set are equally applicable in other fields, such as crop protection research: for example, identification of chemical scaffolds active against a particular target or endpoint, the de-convolution of the potential targets of a phenotypic assay, or the potential targets/pathways for safety liabilities. In order to broaden the applicability of the ChEMBL database and allow more widespread use in crop protection research, an extensive data set of bioactivity data of insecticidal, fungicidal and herbicidal compounds and assays was collated and added to the database.

Asunto(s)

Protección de Cultivos , Bases de Datos de Compuestos Químicos , Bioensayo , Herbicidas , Insecticidas

13.

UniChem: a unified chemical structure cross-referencing and identifier tracking system.

Chambers, Jon; Davies, Mark; Gaulton, Anna; Hersey, Anne; Velankar, Sameer; Petryszak, Robert; Hastings, Janna; Bellis, Louisa; McGlinchey, Shaun; Overington, John P.

J Cheminform ; 5(1): 3, 2013 Jan 14.

Artículo en Inglés | MEDLINE | ID: mdl-23317286

RESUMEN

UniChem is a freely available compound identifier mapping service on the internet, designed to optimize the efficiency with which structure-based hyperlinks may be built and maintained between chemistry-based resources. In the past, the creation and maintenance of such links at EMBL-EBI, where several chemistry-based resources exist, has required independent efforts by each of the separate teams. These efforts were complicated by the different data models, release schedules, and differing business rules for compound normalization and identifier nomenclature that exist across the organization. UniChem, a large-scale, non-redundant database of Standard InChIs with pointers between these structures and chemical identifiers from all the separate chemistry resources, was developed as a means of efficiently sharing the maintenance overhead of creating these links. Thus, for each source represented in UniChem, all links to and from all other sources are automatically calculated and immediately available for all to use. Updated mappings are immediately available upon loading of new data releases from the sources. Web services in UniChem provide users with a single simple automatable mechanism for maintaining all links from their resource to all other sources represented in UniChem. In addition, functionality to track changes in identifier usage allows users to monitor which identifiers are current, and which are obsolete. Lastly, UniChem has been deliberately designed to allow additional resources to be included with minimal effort. Indeed, the recent inclusion of data sources external to EMBL-EBI has provided a simple means of providing users with an even wider selection of resources with which to link to, all at no extra cost, while at the same time providing a simple mechanism for external resources to link to all EMBL-EBI chemistry resources.

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

Asunto(s)

RESUMEN

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

ENVIAR RESULTADO:

SELECCIÓN DE REFERENCIAS

Detalles de la búsqueda