Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 49
Filtrar
1.
Nucleic Acids Res ; 52(D1): D1180-D1192, 2024 Jan 05.
Artigo em Inglês | MEDLINE | ID: mdl-37933841

RESUMO

ChEMBL (https://www.ebi.ac.uk/chembl/) is a manually curated, high-quality, large-scale, open, FAIR and Global Core Biodata Resource of bioactive molecules with drug-like properties, previously described in the 2012, 2014, 2017 and 2019 Nucleic Acids Research Database Issues. Since its introduction in 2009, ChEMBL's content has changed dramatically in size and diversity of data types. Through incorporation of multiple new datasets from depositors since the 2019 update, ChEMBL now contains slightly more bioactivity data from deposited data vs data extracted from literature. In collaboration with the EUbOPEN consortium, chemical probe data is now regularly deposited into ChEMBL. Release 27 made curated data available for compounds screened for potential anti-SARS-CoV-2 activity from several large-scale drug repurposing screens. In addition, new patent bioactivity data have been added to the latest ChEMBL releases, and various new features have been incorporated, including a Natural Product likeness score, updated flags for Natural Products, a new flag for Chemical Probes, and the initial annotation of the action type for ∼270 000 bioactivity measurements.


Assuntos
Descoberta de Drogas , Bases de Dados Factuais , Fatores de Tempo
2.
PeerJ ; 11: e15153, 2023.
Artigo em Inglês | MEDLINE | ID: mdl-37151295

RESUMO

The patent literature is a potentially valuable source of bioactivity data. In this article we describe a process to prioritise 3.7 million life science relevant patents obtained from the SureChEMBL database (https://www.surechembl.org/), according to how likely they were to contain bioactivity data for potent small molecules on less-studied targets, based on the classification developed by the Illuminating the Druggable Genome (IDG) project. The overall goal was to select a smaller number of patents that could be manually curated and incorporated into the ChEMBL database. Using relatively simple annotation and filtering pipelines, we have been able to identify a substantial number of patents containing quantitative bioactivity data for understudied targets that had not previously been reported in the peer-reviewed medicinal chemistry literature. We quantify the added value of such methods in terms of the numbers of targets that are so identified, and provide some specific illustrative examples. Our work underlines the potential value in searching the patent corpus in addition to the more traditional peer-reviewed literature. The small molecules found in these patents, together with their measured activity against the targets, are now accessible via the ChEMBL database.


Assuntos
Química Farmacêutica , Descoberta de Drogas , Descoberta de Drogas/métodos , Bases de Dados Factuais
4.
Nat Commun ; 12(1): 6120, 2021 10 21.
Artigo em Inglês | MEDLINE | ID: mdl-34675202

RESUMO

Drug target Mendelian randomization (MR) studies use DNA sequence variants in or near a gene encoding a drug target, that alter the target's expression or function, as a tool to anticipate the effect of drug action on the same target. Here we apply MR to prioritize drug targets for their causal relevance for coronary heart disease (CHD). The targets are further prioritized using independent replication, co-localization, protein expression profiles and data from the British National Formulary and clinicaltrials.gov. Out of the 341 drug targets identified through their association with blood lipids (HDL-C, LDL-C and triglycerides), we robustly prioritize 30 targets that might elicit beneficial effects in the prevention or treatment of CHD, including NPC1L1 and PCSK9, the targets of drugs used in CHD prevention. We discuss how this approach can be generalized to other targets, disease biomarkers and endpoints to help prioritize and validate targets during the drug development process.


Assuntos
Doença das Coronárias/tratamento farmacológico , Doença das Coronárias/genética , Análise da Randomização Mendeliana , HDL-Colesterol/sangue , LDL-Colesterol/sangue , Doença das Coronárias/sangue , Humanos , Proteínas de Membrana Transportadoras/genética , Pró-Proteína Convertase 9/genética , Triglicerídeos/sangue
5.
J Med Chem ; 64(11): 7210-7230, 2021 06 10.
Artigo em Inglês | MEDLINE | ID: mdl-33983732

RESUMO

Physicochemical descriptors commonly used to define "drug-likeness" and ligand efficiency measures are assessed for their ability to differentiate marketed drugs from compounds reported to bind to their efficacious target or targets. Using ChEMBL version 26, a data set of 643 drugs acting on 271 targets was assembled, comprising 1104 drug-target pairs having ≥100 published compounds per target. Taking into account changes in their physicochemical properties over time, drugs are analyzed according to their target class, therapy area, and route of administration. Recent drugs, approved in 2010-2020, display no overall differences in molecular weight, lipophilicity, hydrogen bonding, or polar surface area from their target comparator compounds. Drugs are differentiated from target comparators by higher potency, ligand efficiency (LE), lipophilic ligand efficiency (LLE), and lower carboaromaticity. Overall, 96% of drugs have LE or LLE values, or both, greater than the median values of their target comparator compounds.


Assuntos
Ligantes , Preparações Farmacêuticas/química , Bases de Dados de Compostos Químicos , Vias de Administração de Medicamentos , Ligação de Hidrogênio , Interações Hidrofóbicas e Hidrofílicas , Peso Molecular , Preparações Farmacêuticas/metabolismo
6.
Nat Med ; 27(4): 668-676, 2021 04.
Artigo em Inglês | MEDLINE | ID: mdl-33837377

RESUMO

Drug repurposing provides a rapid approach to meet the urgent need for therapeutics to address COVID-19. To identify therapeutic targets relevant to COVID-19, we conducted Mendelian randomization analyses, deriving genetic instruments based on transcriptomic and proteomic data for 1,263 actionable proteins that are targeted by approved drugs or in clinical phase of drug development. Using summary statistics from the Host Genetics Initiative and the Million Veteran Program, we studied 7,554 patients hospitalized with COVID-19 and >1 million controls. We found significant Mendelian randomization results for three proteins (ACE2, P = 1.6 × 10-6; IFNAR2, P = 9.8 × 10-11 and IL-10RB, P = 2.3 × 10-14) using cis-expression quantitative trait loci genetic instruments that also had strong evidence for colocalization with COVID-19 hospitalization. To disentangle the shared expression quantitative trait loci signal for IL10RB and IFNAR2, we conducted phenome-wide association scans and pathway enrichment analysis, which suggested that IFNAR2 is more likely to play a role in COVID-19 hospitalization. Our findings prioritize trials of drugs targeting IFNAR2 and ACE2 for early management of COVID-19.


Assuntos
COVID-19/genética , Reposicionamento de Medicamentos , Análise da Randomização Mendeliana/métodos , SARS-CoV-2 , Enzima de Conversão de Angiotensina 2/genética , Enzima de Conversão de Angiotensina 2/fisiologia , Estudo de Associação Genômica Ampla , Humanos , Subunidade beta de Receptor de Interleucina-10/genética , Subunidade beta de Receptor de Interleucina-10/fisiologia , Locos de Características Quantitativas , Receptor de Interferon alfa e beta/genética , Receptor de Interferon alfa e beta/fisiologia , Tratamento Farmacológico da COVID-19
7.
Chem Res Toxicol ; 34(2): 385-395, 2021 02 15.
Artigo em Inglês | MEDLINE | ID: mdl-33507738

RESUMO

The safety of marketed drugs is an ongoing concern, with some of the more frequently prescribed medicines resulting in serious or life-threatening adverse effects in some patients. Safety-related information for approved drugs has been curated to include the assignment of toxicity class(es) based on their withdrawn status and/or black box warning information described on medicinal product labels. The ChEMBL resource contains a wide range of bioactivity data types, from early "Discovery" stage preclinical data for individual compounds through to postclinical data on marketed drugs; the inclusion of the curated drug safety data set within this framework can support a wide range of safety-related drug discovery questions. The curated drug safety data set will be made freely available through ChEMBL and updated in future database releases.


Assuntos
Preparações Farmacêuticas/química , Curadoria de Dados , Aprovação de Drogas , Efeitos Colaterais e Reações Adversas Relacionados a Medicamentos , Humanos , Modelos Moleculares
8.
Cell ; 182(3): 685-712.e19, 2020 08 06.
Artigo em Inglês | MEDLINE | ID: mdl-32645325

RESUMO

The causative agent of the coronavirus disease 2019 (COVID-19) pandemic, severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2), has infected millions and killed hundreds of thousands of people worldwide, highlighting an urgent need to develop antiviral therapies. Here we present a quantitative mass spectrometry-based phosphoproteomics survey of SARS-CoV-2 infection in Vero E6 cells, revealing dramatic rewiring of phosphorylation on host and viral proteins. SARS-CoV-2 infection promoted casein kinase II (CK2) and p38 MAPK activation, production of diverse cytokines, and shutdown of mitotic kinases, resulting in cell cycle arrest. Infection also stimulated a marked induction of CK2-containing filopodial protrusions possessing budding viral particles. Eighty-seven drugs and compounds were identified by mapping global phosphorylation profiles to dysregulated kinases and pathways. We found pharmacologic inhibition of the p38, CK2, CDK, AXL, and PIKFYVE kinases to possess antiviral efficacy, representing potential COVID-19 therapies.


Assuntos
Betacoronavirus/metabolismo , Infecções por Coronavirus/metabolismo , Avaliação Pré-Clínica de Medicamentos/métodos , Pneumonia Viral/metabolismo , Proteômica/métodos , Células A549 , Enzima de Conversão de Angiotensina 2 , Animais , Antivirais/farmacologia , COVID-19 , Células CACO-2 , Caseína Quinase II/antagonistas & inibidores , Caseína Quinase II/metabolismo , Chlorocebus aethiops , Infecções por Coronavirus/virologia , Quinases Ciclina-Dependentes/antagonistas & inibidores , Quinases Ciclina-Dependentes/metabolismo , Células HEK293 , Interações Hospedeiro-Patógeno , Humanos , Pandemias , Peptidil Dipeptidase A/genética , Peptidil Dipeptidase A/metabolismo , Fosfatidilinositol 3-Quinases/metabolismo , Inibidores de Fosfoinositídeo-3 Quinase/farmacologia , Fosforilação , Pneumonia Viral/virologia , Inibidores de Proteínas Quinases/farmacologia , Proteínas Proto-Oncogênicas/antagonistas & inibidores , Proteínas Proto-Oncogênicas/metabolismo , Receptores Proteína Tirosina Quinases/antagonistas & inibidores , Receptores Proteína Tirosina Quinases/metabolismo , SARS-CoV-2 , Glicoproteína da Espícula de Coronavírus/metabolismo , Células Vero , Proteínas Quinases p38 Ativadas por Mitógeno/antagonistas & inibidores , Proteínas Quinases p38 Ativadas por Mitógeno/metabolismo , Receptor Tirosina Quinase Axl
9.
J Cheminform ; 12(1): 51, 2020 Sep 01.
Artigo em Inglês | MEDLINE | ID: mdl-33431044

RESUMO

BACKGROUND: The ChEMBL database is one of a number of public databases that contain bioactivity data on small molecule compounds curated from diverse sources. Incoming compounds are typically not standardised according to consistent rules. In order to maintain the quality of the final database and to easily compare and integrate data on the same compound from different sources it is necessary for the chemical structures in the database to be appropriately standardised. RESULTS: A chemical curation pipeline has been developed using the open source toolkit RDKit. It comprises three components: a Checker to test the validity of chemical structures and flag any serious errors; a Standardizer which formats compounds according to defined rules and conventions and a GetParent component that removes any salts and solvents from the compound to create its parent. This pipeline has been applied to the latest version of the ChEMBL database as well as uncurated datasets from other sources to test the robustness of the process and to identify common issues in database molecular structures. CONCLUSION: All the components of the structure pipeline have been made freely available for other researchers to use and adapt for their own use. The code is available in a GitHub repository and it can also be accessed via the ChEMBL Beaker webservices. It has been used successfully to standardise the nearly 2 million compounds in the ChEMBL database and the compound validity checker has been used to identify compounds with the most serious issues so that they can be prioritised for manual curation.

10.
Sci Rep ; 9(1): 18911, 2019 12 11.
Artigo em Inglês | MEDLINE | ID: mdl-31827124

RESUMO

Lack of efficacy in the intended disease indication is the major cause of clinical phase drug development failure. Explanations could include the poor external validity of pre-clinical (cell, tissue, and animal) models of human disease and the high false discovery rate (FDR) in preclinical science. FDR is related to the proportion of true relationships available for discovery (γ), and the type 1 (false-positive) and type 2 (false negative) error rates of the experiments designed to uncover them. We estimated the FDR in preclinical science, its effect on drug development success rates, and improvements expected from use of human genomics rather than preclinical studies as the primary source of evidence for drug target identification. Calculations were based on a sample space defined by all human diseases - the 'disease-ome' - represented as columns; and all protein coding genes - 'the protein-coding genome'- represented as rows, producing a matrix of unique gene- (or protein-) disease pairings. We parameterised the space based on 10,000 diseases, 20,000 protein-coding genes, 100 causal genes per disease and 4000 genes encoding druggable targets, examining the effect of varying the parameters and a range of underlying assumptions, on the inferences drawn. We estimated γ, defined mathematical relationships between preclinical FDR and drug development success rates, and estimated improvements in success rates based on human genomics (rather than orthodox preclinical studies). Around one in every 200 protein-disease pairings was estimated to be causal (γ = 0.005) giving an FDR in preclinical research of 92.6%, which likely makes a major contribution to the reported drug development failure rate of 96%. Observed success rate was only slightly greater than expected for a random pick from the sample space. Values for γ back-calculated from reported preclinical and clinical drug development success rates were also close to the a priori estimates. Substituting genome wide (or druggable genome wide) association studies for preclinical studies as the major information source for drug target identification was estimated to reverse the probability of late stage failure because of the more stringent type 1 error rate employed and the ability to interrogate every potential druggable target in the same experiment. Genetic studies conducted at much larger scale, with greater resolution of disease end-points, e.g. by connecting genomics and electronic health record data within healthcare systems has the potential to produce radical improvement in drug development success rate.


Assuntos
Desenvolvimento de Medicamentos , Genômica , Estudo de Associação Genômica Ampla , Humanos
11.
J Cheminform ; 11(1): 4, 2019 Jan 10.
Artigo em Inglês | MEDLINE | ID: mdl-30631996

RESUMO

Structure-activity relationship modelling is frequently used in the early stage of drug discovery to assess the activity of a compound on one or several targets, and can also be used to assess the interaction of compounds with liability targets. QSAR models have been used for these and related applications over many years, with good success. Conformal prediction is a relatively new QSAR approach that provides information on the certainty of a prediction, and so helps in decision-making. However, it is not always clear how best to make use of this additional information. In this article, we describe a case study that directly compares conformal prediction with traditional QSAR methods for large-scale predictions of target-ligand binding. The ChEMBL database was used to extract a data set comprising data from 550 human protein targets with different bioactivity profiles. For each target, a QSAR model and a conformal predictor were trained and their results compared. The models were then evaluated on new data published since the original models were built to simulate a "real world" application. The comparative study highlights the similarities between the two techniques but also some differences that it is important to bear in mind when the methods are used in practical drug discovery applications.

12.
J Cheminform ; 11(1): 64, 2019 Nov 06.
Artigo em Inglês | MEDLINE | ID: mdl-33430932

RESUMO

In response to Krstajic's letter to the editor concerning our published paper, we here take the opportunity to reply, to re-iterate that no errors in our work were identified, to provide further details, and to re-emphasise the outputs of our study. Moreover, we highlight that all of the data are freely available for the wider scientific community (including the aforementioned correspondent) to undertake follow-on studies and comparisons.

13.
Nucleic Acids Res ; 47(D1): D930-D940, 2019 01 08.
Artigo em Inglês | MEDLINE | ID: mdl-30398643

RESUMO

ChEMBL is a large, open-access bioactivity database (https://www.ebi.ac.uk/chembl), previously described in the 2012, 2014 and 2017 Nucleic Acids Research Database Issues. In the last two years, several important improvements have been made to the database and are described here. These include more robust capture and representation of assay details; a new data deposition system, allowing updating of data sets and deposition of supplementary data; and a completely redesigned web interface, with enhanced search and filtering capabilities.


Assuntos
Bases de Dados de Produtos Farmacêuticos , Descoberta de Drogas , Bioensaio , Publicações Periódicas como Assunto , Interface Usuário-Computador
14.
Sci Data ; 5: 180230, 2018 10 23.
Artigo em Inglês | MEDLINE | ID: mdl-30351302

RESUMO

ChEMBL is a large-scale, open-access drug discovery resource containing bioactivity information primarily extracted from scientific literature. A substantial dataset of more than 135,000 in vivo assays has been collated as a key resource of animal models for translational medicine within drug discovery. To improve the utility of the in vivo data, an extensive data curation task has been undertaken that allows the assays to be grouped by animal disease model or phenotypic endpoint. The dataset contains previously unavailable information about compounds or drugs tested in animal models and, in conjunction with assay data on protein targets or cell- or tissue- based systems, allows the investigation of the effects of compounds at differing levels of biological complexity. Equally, it enables researchers to identify compounds that have been investigated for a group of disease-, pharmacology- or toxicity-relevant assays.


Assuntos
Bioensaio , Bases de Dados de Compostos Químicos , Descoberta de Drogas/métodos , Animais , Avaliação Pré-Clínica de Medicamentos , Modelos Animais
16.
Nat Rev Drug Discov ; 17(5): 317-332, 2018 05.
Artigo em Inglês | MEDLINE | ID: mdl-29472638

RESUMO

A large proportion of biomedical research and the development of therapeutics is focused on a small fraction of the human genome. In a strategic effort to map the knowledge gaps around proteins encoded by the human genome and to promote the exploration of currently understudied, but potentially druggable, proteins, the US National Institutes of Health launched the Illuminating the Druggable Genome (IDG) initiative in 2014. In this article, we discuss how the systematic collection and processing of a wide array of genomic, proteomic, chemical and disease-related resource data by the IDG Knowledge Management Center have enabled the development of evidence-based criteria for tracking the target development level (TDL) of human proteins, which indicates a substantial knowledge deficit for approximately one out of three proteins in the human proteome. We then present spotlights on the TDL categories as well as key drug target classes, including G protein-coupled receptors, protein kinases and ion channels, which illustrate the nature of the unexplored opportunities for biomedical research and therapeutic development.

17.
Expert Opin Drug Discov ; 12(8): 757-767, 2017 08.
Artigo em Inglês | MEDLINE | ID: mdl-28602100

RESUMO

INTRODUCTION: ChEMBL is a manually curated database of bioactivity data on small drug-like molecules, used by drug discovery scientists. Among many access methods, a REST API provides programmatic access, allowing the remote retrieval of ChEMBL data and its integration into other applications. This approach allows scientists to move from a world where they go to the ChEMBL web site to search for relevant data, to one where ChEMBL data can be simply integrated into their everyday tools and work environment. Areas covered: This review highlights some of the audiences who may benefit from using the ChEMBL API, and the goals they can address, through the description of several use cases. The examples cover a team communication tool (Slack), a data analytics platform (KNIME), batch job management software (Luigi) and Rich Internet Applications. Expert opinion: The advent of web technologies, cloud computing and micro services oriented architectures have made REST APIs an essential ingredient of modern software development models. The widespread availability of tools consuming RESTful resources have made them useful for many groups of users. The ChEMBL API is a valuable resource of drug discovery bioactivity data for professional chemists, chemistry students, data scientists, scientific and web developers.


Assuntos
Bases de Dados de Compostos Químicos , Descoberta de Drogas/métodos , Preparações Farmacêuticas/química , Computação em Nuvem , Humanos , Internet , Software
18.
Sci Transl Med ; 9(383)2017 03 29.
Artigo em Inglês | MEDLINE | ID: mdl-28356508

RESUMO

Target identification (determining the correct drug targets for a disease) and target validation (demonstrating an effect of target perturbation on disease biomarkers and disease end points) are important steps in drug development. Clinically relevant associations of variants in genes encoding drug targets model the effect of modifying the same targets pharmacologically. To delineate drug development (including repurposing) opportunities arising from this paradigm, we connected complex disease- and biomarker-associated loci from genome-wide association studies to an updated set of genes encoding druggable human proteins, to agents with bioactivity against these targets, and, where there were licensed drugs, to clinical indications. We used this set of genes to inform the design of a new genotyping array, which will enable association studies of druggable genes for drug target selection and validation in human disease.


Assuntos
Descoberta de Drogas , Genoma Humano , Terapia de Alvo Molecular , Reposicionamento de Medicamentos , Loci Gênicos , Estudo de Associação Genômica Ampla , Humanos , Desequilíbrio de Ligação/genética , Fenótipo , Polimorfismo de Nucleotídeo Único/genética , Reprodutibilidade dos Testes , Pesquisa Translacional Biomédica
19.
Nat Rev Drug Discov ; 16(1): 19-34, 2017 01.
Artigo em Inglês | MEDLINE | ID: mdl-27910877

RESUMO

The success of mechanism-based drug discovery depends on the definition of the drug target. This definition becomes even more important as we try to link drug response to genetic variation, understand stratified clinical efficacy and safety, rationalize the differences between drugs in the same therapeutic class and predict drug utility in patient subgroups. However, drug targets are often poorly defined in the literature, both for launched drugs and for potential therapeutic agents in discovery and development. Here, we present an updated comprehensive map of molecular targets of approved drugs. We curate a total of 893 human and pathogen-derived biomolecules through which 1,578 US FDA-approved drugs act. These biomolecules include 667 human-genome-derived proteins targeted by drugs for human disease. Analysis of these drug targets indicates the continued dominance of privileged target families across disease areas, but also the growth of novel first-in-class mechanisms, particularly in oncology. We explore the relationships between bioactivity class and clinical success, as well as the presence of orthologues between human and animal models and between pathogen and human genomes. Through the collaboration of three independent teams, we highlight some of the ongoing challenges in accurately defining the targets of molecular therapeutics and present conventions for deconvoluting the complexities of molecular pharmacology and drug efficacy.


Assuntos
Sistemas de Liberação de Medicamentos/tendências , Descoberta de Drogas/tendências , Farmacogenética/tendências , Bases de Dados de Produtos Farmacêuticos , Aprovação de Drogas , Prescrições de Medicamentos/estatística & dados numéricos , Variação Genética , Genoma Humano , Humanos , Estados Unidos , United States Food and Drug Administration
20.
Nucleic Acids Res ; 45(D1): D945-D954, 2017 01 04.
Artigo em Inglês | MEDLINE | ID: mdl-27899562

RESUMO

ChEMBL is an open large-scale bioactivity database (https://www.ebi.ac.uk/chembl), previously described in the 2012 and 2014 Nucleic Acids Research Database Issues. Since then, alongside the continued extraction of data from the medicinal chemistry literature, new sources of bioactivity data have also been added to the database. These include: deposited data sets from neglected disease screening; crop protection data; drug metabolism and disposition data and bioactivity data from patents. A number of improvements and new features have also been incorporated. These include the annotation of assays and targets using ontologies, the inclusion of targets and indications for clinical candidates, addition of metabolic pathways for drugs and calculation of structural alerts. The ChEMBL data can be accessed via a web-interface, RDF distribution, data downloads and RESTful web-services.


Assuntos
Bases de Dados de Compostos Químicos , Bases de Dados de Ácidos Nucleicos , Ferramenta de Busca , Biologia Computacional/métodos , Proteção de Cultivos , Descoberta de Drogas , Ontologia Genética , Humanos , Anotação de Sequência Molecular , Farmacologia/métodos , Interface Usuário-Computador , Navegador
SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA