RESUMO
ChEMBL (https://www.ebi.ac.uk/chembl/) is a manually curated, high-quality, large-scale, open, FAIR and Global Core Biodata Resource of bioactive molecules with drug-like properties, previously described in the 2012, 2014, 2017 and 2019 Nucleic Acids Research Database Issues. Since its introduction in 2009, ChEMBL's content has changed dramatically in size and diversity of data types. Through incorporation of multiple new datasets from depositors since the 2019 update, ChEMBL now contains slightly more bioactivity data from deposited data vs data extracted from literature. In collaboration with the EUbOPEN consortium, chemical probe data is now regularly deposited into ChEMBL. Release 27 made curated data available for compounds screened for potential anti-SARS-CoV-2 activity from several large-scale drug repurposing screens. In addition, new patent bioactivity data have been added to the latest ChEMBL releases, and various new features have been incorporated, including a Natural Product likeness score, updated flags for Natural Products, a new flag for Chemical Probes, and the initial annotation of the action type for â¼270 000 bioactivity measurements.
Assuntos
Descoberta de Drogas , Bases de Dados Factuais , Fatores de TempoRESUMO
Posttraumatic stress disorder (PTSD) is a significant public health concern, with only a third of patients recovering within a year of treatment. While PTSD often disrupts the sense of body ownership and sense of agency (SA), attention to the SA in trauma has been lacking. This perspective paper explores the loss of the SA in PTSD and its relevance in the development of symptoms. Trauma is viewed as a breakdown of the SA, related to a freeze response, with peritraumatic dissociation increasing the risk of PTSD. Drawing from embodied cognition, we propose an enactive perspective of PTSD, suggesting therapies that restore the SA through direct engagement with the body and environment. We discuss the potential of agency-based therapies and innovative technologies such as gesture sonification, which translates body movements into sounds to enhance the SA. Gesture sonification offers a screen-free, noninvasive approach that could complement existing trauma-focused therapies. We emphasize the need for interdisciplinary collaboration and clinical research to further explore these approaches in preventing and treating PTSD.
Assuntos
Transtornos de Estresse Pós-Traumáticos , Humanos , Transtornos de Estresse Pós-Traumáticos/terapia , Transtornos de Estresse Pós-Traumáticos/psicologia , GestosRESUMO
The safety of marketed drugs is an ongoing concern, with some of the more frequently prescribed medicines resulting in serious or life-threatening adverse effects in some patients. Safety-related information for approved drugs has been curated to include the assignment of toxicity class(es) based on their withdrawn status and/or black box warning information described on medicinal product labels. The ChEMBL resource contains a wide range of bioactivity data types, from early "Discovery" stage preclinical data for individual compounds through to postclinical data on marketed drugs; the inclusion of the curated drug safety data set within this framework can support a wide range of safety-related drug discovery questions. The curated drug safety data set will be made freely available through ChEMBL and updated in future database releases.
Assuntos
Preparações Farmacêuticas/química , Curadoria de Dados , Aprovação de Drogas , Efeitos Colaterais e Reações Adversas Relacionados a Medicamentos , Humanos , Modelos MolecularesRESUMO
ChEMBL is a large, open-access bioactivity database (https://www.ebi.ac.uk/chembl), previously described in the 2012, 2014 and 2017 Nucleic Acids Research Database Issues. In the last two years, several important improvements have been made to the database and are described here. These include more robust capture and representation of assay details; a new data deposition system, allowing updating of data sets and deposition of supplementary data; and a completely redesigned web interface, with enhanced search and filtering capabilities.
Assuntos
Bases de Dados de Produtos Farmacêuticos , Descoberta de Drogas , Bioensaio , Publicações Periódicas como Assunto , Interface Usuário-ComputadorRESUMO
BACKGROUND: Compound selectivity is an important issue when developing a new drug. In many instances, a lack of selectivity can translate to increased toxicity. Protein kinases are particularly concerned with this issue because they share high sequence and structural similarity. However, selectivity may be assessed early on using data generated from protein kinase profiling panels. RESULTS: To guide lead optimization in drug discovery projects, we propose herein two new selectivity metrics, namely window score (WS) and ranking score (RS). These metrics can be applied to standard in vitro data-including intrinsic enzyme activity/affinity (Ki, IC50 or percentage of inhibition), cell-based potency (percentage of effect, EC50) or even kinetics data (Kd, Kon and Koff). They are both easy to compute and offer different viewpoints from which to consider compound selectivity. CONCLUSIONS: We performed a comparative analysis of their respective performance on several data sets against already published selectivity metrics and analyzed how they might influence compound selection. Our results showed that the two new metrics bring additional information to prioritize compound selection. Two novel metrics were developed to better estimate selectivity of compounds screened on multiple proteins.
Assuntos
Biologia Computacional/métodos , Inibidores de Proteínas Quinases/farmacologia , Proteínas Quinases/metabolismo , Linhagem Celular Tumoral , Bases de Dados Factuais , Descoberta de Drogas , Humanos , Modelos TeóricosRESUMO
So far, 518 protein kinases have been identified in the human genome. They share a common mechanism of protein phosphorylation and are involved in many critical biological processes of eukaryotic cells. Deregulation of the kinase phosphorylation function induces severe illnesses such as cancer, diabetes, or inflammatory diseases. Many actors in the pharmaceutical domain have made significant efforts to design potent and selective protein kinase inhibitors as new potential drugs. Because the ATP binding site is highly conserved in the protein kinase family, the design of selective inhibitors remains a challenge and has negatively impacted the progression of drug candidates to late-stage clinical development. The work presented here adopts a 2.5D kinochemometrics (KCM) approach, derived from proteochemometrics (PCM), in which protein kinases are depicted by a novel 3D descriptor and the ligands by 2D fingerprints. We demonstrate in two examples that the protein descriptor successfully classified protein kinases based on their group membership and their Asp-Phe-Gly (DFG) conformation. We also compared the performance of our models with those obtained from a full 2D KCM model and QSAR models. In both cases, the internal validation of the models demonstrated good capabilities to distinguish "active" from "inactive" protein kinase-ligand pairs. However, the external validation performed on two independent data sets showed that the two statistical models tended to overestimate the number of "inactive" pairs.
Assuntos
Biologia Computacional/métodos , Proteínas Quinases/metabolismo , Ligantes , Modelos Moleculares , Ligação Proteica , Conformação Proteica , Inibidores de Proteínas Quinases/química , Inibidores de Proteínas Quinases/metabolismo , Inibidores de Proteínas Quinases/farmacologia , Proteínas Quinases/química , Relação Quantitativa Estrutura-AtividadeRESUMO
Given the difficulties to identify chemical probes that can modulate protein-protein interactions (PPIs), actors in the field have started to agree on the necessity to use PPI-tailored screening chemical collections. However, which type of scaffolds may promote the binding of compounds to PPI targets remains unclear. In this big data analysis, we have identified a list of privileged chemical substructures that are most often observed within inhibitors of PPIs. Using molecular frameworks as a way to perceive chemical substructures with the combination of an experimental and a machine-learning based predicted data set of iPPI compounds, we propose a list of privileged substructures in the form of scaffolds and chemical moieties that can be substantially chemically functionalized and do not present any toxicophore nor pan-assay interference (PAINS) alerts. We think that such chemical guidance will be valuable for medicinal chemists in their attempt to identify initial quality chemical probes on PPI targets.
Assuntos
Modelos Químicos , Proteínas/química , Aprendizado de Máquina , Estrutura Molecular , Bibliotecas de Moléculas PequenasRESUMO
Schizophrenia is a severe, chronic, and heterogeneous mental disorder that affects approximately 1% of the world population. Ongoing research aims at clustering schizophrenia heterogeneity into various "biotypes" to identify subgroups of individuals displaying homogeneous symptoms, etiopathogenesis, prognosis, and treatment response. The present study is in line with this approach and focuses on a biotype partly characterized by a specific membrane lipid composition. We have examined clinical and biological data of patients with stabilized schizophrenia, including the fatty acid content of their erythrocyte membranes, in particular the omega-3 docosahexaenoic acid (DHA). Two groups of patients of similar size were identified: the DHA- group (N = 19) with a lower proportion of membrane DHA as compared to the norm in the general population, and the DHAn group (N = 18) with a normal proportion of DHA. Compared to DHAn, DHA- patients had a higher number of hospitalizations and a lower quality of life in terms of perceived health and physical health. They also exhibited significant higher interleukin-6 and cortisol blood levels. These results emphasize the importance of measuring membrane lipid and immunoinflammatory biomarkers in stabilized patients to identify a specific subgroup and optimize non-pharmacological interventions. It could also guide future research aimed at proposing specific pharmacological treatments.
RESUMO
Efforts to tackle malaria must continue for a disease that threatens half of the global population. Parasite resistance to current therapies requires new chemotypes that are able to demonstrate effectiveness and safety. Previously, we developed a machine-learning-based approach to predict compound antimalarial activity, which was trained on the compound collections of several organizations. The resulting prediction platform, MAIP, was made freely available to the scientific community and offers a solution to prioritize molecules of interest in virtual screening and hit-to-lead optimization. Here, we experimentally validate MAIP and demonstrate how the approach was used in combination with a robust compound selection workflow and a recently introduced innovative high-throughput screening (HTS) cascade to select and purchase compounds from a public library for subsequent experimental screening. We observed a 12-fold enrichment compared with a randomly selected set of molecules, and the eight hits we ultimately selected exhibit good potency and absorption, distribution, metabolism, and excretion (ADME) profiles.
RESUMO
Pre-competitive data sharing can offer the pharmaceutical industry significant benefits in terms of reducing the time and costs involved in getting a new drug to market through more informed testing strategies and knowledge gained by pooling data. If sufficient data is shared and can be co-analyzed, then it can also offer the potential for reduced animal usage and improvements in the in silico prediction of toxicological effects. Data sharing benefits can be further enhanced by applying the FAIR Guiding Principles, reducing time spent curating, transforming and aggregating datasets and allowing more time for data mining and analysis. We hope to facilitate data sharing by other organizations and initiatives by describing lessons learned as part of the Enhancing TRANslational SAFEty Assessment through Integrative Knowledge Management (eTRANSAFE) project, an Innovative Medicines Initiative (IMI) partnership which aims to integrate publicly available data sources with proprietary preclinical and clinical data donated by pharmaceutical organizations. Methods to foster trust and overcome non-technical barriers to data sharing such as legal and IPR (intellectual property rights) are described, including the security requirements that pharmaceutical organizations generally expect to be met. We share the consensus achieved among pharmaceutical partners on decision criteria to be included in internal clearance procedures used to decide if data can be shared. We also report on the consensus achieved on specific data fields to be excluded from sharing for sensitive preclinical safety and pharmacology data that could otherwise not be shared.
Assuntos
Mineração de Dados , Disseminação de Informação , Animais , Simulação por Computador , Indústria FarmacêuticaRESUMO
Malaria is a disease affecting hundreds of millions of people across the world, mainly in developing countries and especially in sub-Saharan Africa. It is the cause of hundreds of thousands of deaths each year and there is an ever-present need to identify and develop effective new therapies to tackle the disease and overcome increasing drug resistance. Here, we extend a previous study in which a number of partners collaborated to develop a consensus in silico model that can be used to identify novel molecules that may have antimalarial properties. The performance of machine learning methods generally improves with the number of data points available for training. One practical challenge in building large training sets is that the data are often proprietary and cannot be straightforwardly integrated. Here, this was addressed by sharing QSAR models, each built on a private data set. We describe the development of an open-source software platform for creating such models, a comprehensive evaluation of methods to create a single consensus model and a web platform called MAIP available at https://www.ebi.ac.uk/chembl/maip/ . MAIP is freely available for the wider community to make large-scale predictions of potential malaria inhibiting compounds. This project also highlights some of the practical challenges in reproducing published computational methods and the opportunities that open-source software can offer to the community.
RESUMO
Protein-protein interactions (PPIs) mediate nearly every cellular process and represent attractive targets for modulating disease states but are challenging to target with small molecules. Despite this, several PPI inhibitors (iPPIs) have entered clinical trials, and a growing number of PPIs have become validated drug targets. However, high-throughput screening efforts still endure low hit rates mainly because of the use of unsuitable screening libraries. Here, we describe the collective effort of a French consortium to build, select, and store in plates a unique chemical library dedicated to the inhibition of PPIs. Using two independent predictive models and two updated databases of experimentally confirmed PPI inhibitors developed by members of the consortium, we built models based on different training sets, molecular descriptors, and machine learning methods. Independent statistical models were used to select putative PPI inhibitors from large commercial compound collections showing great complementarity. Medicinal chemistry filters were applied to remove undesirable structures from this set (such as PAINS, frequent hitters, and toxic compounds) and to improve drug likeness. The remaining compounds were subjected to a clustering procedure to reduce the final size of the library while maintaining its chemical diversity. In practice, the library showed a 46-fold activity rate enhancement when compared to a non-iPPI-enriched diversity library in high-throughput screening against the CD47-SIRPα PPI. The Fr-PPIChem library is plated in 384-well plates and will be distributed on demand to the scientific community as a powerful tool for discovering new chemical probes and early hits for the development of potential therapeutic drugs.
Assuntos
Bases de Dados de Compostos Químicos , Ensaios de Triagem em Larga Escala/métodos , Mapas de Interação de Proteínas , Bibliotecas de Moléculas Pequenas/química , Descoberta de Drogas , Modelos Químicos , Reprodutibilidade dos TestesRESUMO
Uncovering cellular responses from heterogeneous genomic data is crucial for molecular medicine in particular for drug safety. This can be realized by integrating the molecular activities in networks of interacting proteins. As proof-of-concept we challenge network modeling with time-resolved proteome, transcriptome and methylome measurements in iPSC-derived human 3D cardiac microtissues to elucidate adverse mechanisms of anthracycline cardiotoxicity measured with four different drugs (doxorubicin, epirubicin, idarubicin and daunorubicin). Dynamic molecular analysis at in vivo drug exposure levels reveal a network of 175 disease-associated proteins and identify common modules of anthracycline cardiotoxicity in vitro, related to mitochondrial and sarcomere function as well as remodeling of extracellular matrix. These in vitro-identified modules are transferable and are evaluated with biopsies of cardiomyopathy patients. This to our knowledge most comprehensive study on anthracycline cardiotoxicity demonstrates a reproducible workflow for molecular medicine and serves as a template for detecting adverse drug responses from complex omics data.
Assuntos
Metaboloma , Modelos Biológicos , Proteoma , Transcriptoma , Epigênese Genética , Perfilação da Expressão Gênica/métodos , Regulação da Expressão Gênica , Redes Reguladoras de Genes , Humanos , Metabolômica/métodos , Mitocôndrias/genética , Mitocôndrias/metabolismo , Proteômica/métodos , Sarcômeros/genética , Sarcômeros/metabolismo , Transdução de SinaisRESUMO
In response to Krstajic's letter to the editor concerning our published paper, we here take the opportunity to reply, to re-iterate that no errors in our work were identified, to provide further details, and to re-emphasise the outputs of our study. Moreover, we highlight that all of the data are freely available for the wider scientific community (including the aforementioned correspondent) to undertake follow-on studies and comparisons.
RESUMO
Structure-activity relationship modelling is frequently used in the early stage of drug discovery to assess the activity of a compound on one or several targets, and can also be used to assess the interaction of compounds with liability targets. QSAR models have been used for these and related applications over many years, with good success. Conformal prediction is a relatively new QSAR approach that provides information on the certainty of a prediction, and so helps in decision-making. However, it is not always clear how best to make use of this additional information. In this article, we describe a case study that directly compares conformal prediction with traditional QSAR methods for large-scale predictions of target-ligand binding. The ChEMBL database was used to extract a data set comprising data from 550 human protein targets with different bioactivity profiles. For each target, a QSAR model and a conformal predictor were trained and their results compared. The models were then evaluated on new data published since the original models were built to simulate a "real world" application. The comparative study highlights the similarities between the two techniques but also some differences that it is important to bear in mind when the methods are used in practical drug discovery applications.
RESUMO
ChEMBL is a large-scale, open-access drug discovery resource containing bioactivity information primarily extracted from scientific literature. A substantial dataset of more than 135,000 in vivo assays has been collated as a key resource of animal models for translational medicine within drug discovery. To improve the utility of the in vivo data, an extensive data curation task has been undertaken that allows the assays to be grouped by animal disease model or phenotypic endpoint. The dataset contains previously unavailable information about compounds or drugs tested in animal models and, in conjunction with assay data on protein targets or cell- or tissue- based systems, allows the investigation of the effects of compounds at differing levels of biological complexity. Equally, it enables researchers to identify compounds that have been investigated for a group of disease-, pharmacology- or toxicity-relevant assays.
Assuntos
Bioensaio , Bases de Dados de Compostos Químicos , Descoberta de Drogas/métodos , Animais , Avaliação Pré-Clínica de Medicamentos , Modelos AnimaisRESUMO
Because of the success of imatinib, the first type-II kinase inhibitor approved by the FDA in 2001, sustained efforts have been made by the pharmaceutical industry to discover novel compounds stabilizing the inactive conformation of protein kinases. On the seven type-II inhibitors having reached the market, four were released in 2012, suggesting an acceleration of the research of such a class of compounds. Still, they represent less than a third of the protein kinase inhibitors available to patients today. The identification of key residues involved in the binding of this type of ligands in the kinase active site might ease the design of potent and selective type-II inhibitors. In order to identify those discriminant residues, we have developed a proteometric approach combining residue descriptors of protein kinase sequences and biological activities of various type-II kinase inhibitors. We applied Partial Least Squares (PLS) regression to identify 29 key residues that influence the binding of four type-II inhibitors to most proteins of the kinome. The gatekeeper residue was found to be the most relevant, confirming an essential role in ligand binding as well as in protein kinase conformational changes. Using the newly developed proteometric model, we predicted the propensity of each protein kinase to be inhibited by type-II ligands. The model was further validated using an external data set of protein/ligand activity pairs. Other residues present in the kinase domain, and more specifically in the binding site, have been highlighted by this approach, but their role in biological mechanisms is still unknown.