RESUMEN
Alzheimer's disease (AD), a prevalent neurodegenerative disorder, presents significant challenges in drug development due to its multifactorial nature and complex pathophysiology. The AlzyFinder Platform, introduced in this study, addresses these challenges by providing a comprehensive, free web-based tool for parallel ligand-based virtual screening and network pharmacology, specifically targeting over 85 key proteins implicated in AD. This innovative approach is designed to enhance the identification and analysis of potential multitarget ligands, thereby accelerating the development of effective therapeutic strategies against AD. AlzyFinder Platform incorporates machine learning models to facilitate the ligand-based virtual screening process. These models, built with the XGBoost algorithm and optimized through Optuna, were meticulously trained and validated using robust methodologies to ensure high predictive accuracy. Validation included extensive testing with active, inactive, and decoy molecules, demonstrating the platform's efficacy in distinguishing active compounds. The models are evaluated based on balanced accuracy, precision, and F1 score metrics. A unique soft-voting ensemble approach is utilized to refine the classification process, integrating the strengths of individual models. This methodological framework enables a comprehensive analysis of interaction data, which is presented in multiple formats such as tables, heat maps, and interactive Ligand-Protein Interaction networks, thus enhancing the visualization and analysis of drug-protein interactions. AlzyFinder was applied to screen five molecules recently reported (and not used to train or validate the ML models) as active compounds against five key AD targets. The platform demonstrated its efficacy by accurately predicting all five molecules as true positives with a probability greater than 0.70. This result underscores the platform's capability in identifying potential therapeutic compounds with high precision. In conclusion, AlzyFinder's innovative approach extends beyond traditional virtual screening by incorporating network pharmacology analysis, thus providing insights into the systemic actions of drug candidates. This feature allows for the exploration of ligand-protein and protein-protein interactions and their extensions, offering a comprehensive view of potential therapeutic impacts. As the first open-access platform of its kind, AlzyFinder stands as a valuable resource for the AD research community, available at http://www.alzyfinder-platform.udec.cl with supporting data and scripts accessible via GitHub https://github.com/ramirezlab/AlzyFinder.
RESUMEN
Designing and developing inhibitors against the epigenetic target DNA methyltransferase (DNMT) is an attractive strategy in epigenetic drug discovery. DNMT1 is one of the epigenetic enzymes with significant clinical relevance. Structure-based de novo design is a drug discovery strategy that was used in combination with similarity searching to identify a novel DNMT inhibitor with a novel chemical scaffold and warrants further exploration. This study aimed to continue exploring the potential of de novo design to build epigenetic-focused libraries targeted toward DNMT1. Herein, we report the results of an in-depth and critical comparison of ligand- and structure-based de novo design of screening libraries focused on DNMT1. The newly designed chemical libraries focused on DNMT1 are freely available on GitHub.
Asunto(s)
ADN (Citosina-5-)-Metiltransferasa 1 , Diseño de Fármacos , Inhibidores Enzimáticos , Ligandos , ADN (Citosina-5-)-Metiltransferasa 1/antagonistas & inhibidores , ADN (Citosina-5-)-Metiltransferasa 1/metabolismo , Inhibidores Enzimáticos/química , Inhibidores Enzimáticos/farmacología , Humanos , Bibliotecas de Moléculas Pequeñas/farmacología , Bibliotecas de Moléculas Pequeñas/química , Relación Estructura-ActividadRESUMEN
Virtual small molecule libraries are valuable resources for identifying bioactive compounds in virtual screening campaigns and improving the quality of libraries in terms of physicochemical properties, complexity, and structural diversity. In this context, the computational-aided design of libraries focused against antidiabetic targets can provide novel alternatives for treating type II diabetes mellitus (T2DM). In this work, we integrated the information generated to date on compounds with antidiabetic activity, advances in computational methods, and knowledge of chemical transformations available in the literature to design multi-target compound libraries focused on T2DM. We evaluated the novelty and diversity of the newly generated library by comparing it with antidiabetic compounds approved for clinical use, natural products, and multi-target compounds tested in vivo in experimental antidiabetic models. The designed libraries are freely available and are a valuable starting point for drug design, chemical synthesis, and biological evaluation or further computational filtering. Also, the compendium of 280 transformation rules identified in a medicinal chemistry context is made available in the linear notation SMIRKS for use in other chemical library enumeration or hit optimization approaches.
RESUMEN
Science and art have been connected for centuries. With the development of new computational methods, new scientific disciplines have emerged, such as computational chemistry, and related fields, such as cheminformatics. Chemoinformatics is grounded on the chemical space concept: a multi-descriptor space in which chemical structures are described. In several practical applications, visual representations of the chemical space of compound datasets are low-dimensional plots helpful in identifying patterns. However, the authors propose that the plots can also be used as artistic expressions. This manuscript introduces an approach to merging art with chemoinformatics through visual and artistic representations of chemical space. As case studies, we portray the chemical space of food chemicals and other compounds to generate visually appealing graphs with twofold benefits: sharing chemical knowledge and developing pieces of art driven by chemoinformatics. The art driven by chemical space visualization will help increase the application of chemistry and art and contribute to general education and dissemination of chemoinformatics and chemistry through artistic expressions. All the code and data sets to reproduce the visual representation of the chemical space presented in the manuscript are freely available at https://github.com/DIFACQUIM/Art-Driven-by-Visual-Representations-of-Chemical-Space- . Scientific contribution: Chemical space as a concept to create digital art and as a tool to train and introduce students to cheminformatics.
RESUMEN
We report the main conclusions of the first Chemoinformatics and Artificial Intelligence Colloquium, Mexico City, June 15-17, 2022. Fifteen lectures were presented during a virtual public event with speakers from industry, academia, and non-for-profit organizations. Twelve hundred and ninety students and academics from more than 60 countries. During the meeting, applications, challenges, and opportunities in drug discovery, de novo drug design, ADME-Tox (absorption, distribution, metabolism, excretion and toxicity) property predictions, organic chemistry, peptides, and antibiotic resistance were discussed. The program along with the recordings of all sessions are freely available at https://www.difacquim.com/english/events/2022-colloquium/ .
RESUMEN
Technological advances and practical applications of the chemical space concept in drug discovery, natural product research, and other research areas have attracted the scientific community's attention. The large- and ultra-large chemical spaces are associated with the significant increase in the number of compounds that can potentially be made and exist and the increasing number of experimental and calculated descriptors, that are emerging that encode the molecular structure and/or property aspects of the molecules. Due to the importance and continued evolution of compound libraries, herein, we discuss definitions proposed in the literature for chemical space and emphasize the convenience, discussed in the literature to use complementary descriptors to obtain a comprehensive view of the chemical space of compound data sets. In this regard, we introduce the term chemical multiverse to refer to the comprehensive analysis of compound data sets through several chemical spaces, each defined by a different set of chemical representations. The chemical multiverse is contrasted with a related idea: consensus chemical space.
Asunto(s)
Productos Biológicos , Bibliotecas de Moléculas Pequeñas , Bibliotecas de Moléculas Pequeñas/química , Relación Estructura-Actividad , Estructura Molecular , Descubrimiento de DrogasRESUMEN
INTRODUCTION: Chemical space is a general conceptual framework that addresses the diversity of molecules and it has various applications. Moreover, chemical space is a cornerstone of chemoinformatics. In response to the increase in the set of chemical compounds in databases, generators of chemical structures, and tools to calculate molecular descriptors, novel approaches to generate visual representations of chemical space are emerging and evolving. AREAS COVERED: The current state of chemical space in drug design and discovery is reviewed. The topics discussed herein include advances for efficient navigation in chemical space, the use of this concept in assessing the diversity of different data sets, exploring structure-property/activity relationships for one or multiple endpoints, and compound library design. Recent advances in methodologies for generating visual representations of chemical space have been highlighted, thereby emphasizing open-source methods. EXPERT OPINION: Quantitative and qualitative generation and analysis of chemical space require novel approaches for handling the increasing number of molecules and their information available in chemical databases (including emerging ultra-large libraries). Chemical space is a conceptual framework that goes beyond visual representation in low dimensions. However, the graphical representation of chemical space has several practical applications in drug discovery and beyond.
Asunto(s)
Bases de Datos de Compuestos Químicos , Descubrimiento de Drogas , Bases de Datos Factuales , Diseño de Fármacos , Descubrimiento de Drogas/métodos , Humanos , Relación Estructura-ActividadRESUMEN
Acquired immunodeficiency syndrome (AIDS) caused by the human immunodeficiency virus (HIV) continues to be a public health problem. In 2020, 680,000 people died from HIV-related causes, and 1.5 million people were infected. Antiretrovirals are a way to control HIV infection but not to cure AIDS. As such, effective treatment must be developed to control AIDS. Developing a drug is not an easy task, and there is an enormous amount of work and economic resources invested. For this reason, it is highly convenient to employ computer-aided drug design methods, which can help generate and identify novel molecules. Using the de novo design, novel molecules can be developed using fragments as building blocks. In this work, we develop a virtual focused compound library of HIV-1 viral protease inhibitors from natural product fragments. Natural products are characterized by a large diversity of functional groups, many sp3 atoms, and chiral centers. Pseudo-natural products are a combination of natural products fragments that keep the desired structural characteristics from different natural products. An interactive version of chemical space visualization of virtual compounds focused on HIV-1 viral protease inhibitors from natural product fragments is freely available in the supplementary material.
Asunto(s)
Productos Biológicos/síntesis química , Inhibidores de la Proteasa del VIH/síntesis química , VIH-1/enzimología , Síndrome de Inmunodeficiencia Adquirida/tratamiento farmacológico , Síndrome de Inmunodeficiencia Adquirida/virología , Productos Biológicos/química , Productos Biológicos/farmacología , Computadores , Bases de Datos Farmacéuticas , Diseño de Fármacos , Inhibidores de la Proteasa del VIH/química , Inhibidores de la Proteasa del VIH/farmacología , VIH-1/efectos de los fármacos , Humanos , Estructura Molecular , Relación Estructura-ActividadRESUMEN
Virtual compound libraries are increasingly being used in computer-assisted drug discovery applications and have led to numerous successful cases. This paper aims to examine the fundamental concepts of library design and describe how to enumerate virtual libraries using open source tools. To exemplify the enumeration of chemical libraries, we emphasize the use of pre-validated or reported reactions and accessible chemical reagents. This tutorial shows a step-by-step procedure for anyone interested in designing and building chemical libraries with or without chemoinformatics experience. The aim is to explore various methodologies proposed by synthetic organic chemists and explore affordable chemical space using open-access chemoinformatics tools. As part of the tutorial, we discuss three examples of design: a Diversity-Oriented-Synthesis library based on lactams, a bis-heterocyclic combinatorial library, and a set of target-oriented molecules: isoindolinone based compounds as potential acetylcholinesterase inhibitors. This manuscript also seeks to contribute to the critical task of teaching and learning chemoinformatics.
RESUMEN
Natural products have a significant role in drug discovery. Natural products have distinctive chemical structures that have contributed to identifying and developing drugs for different therapeutic areas. Moreover, natural products are significant sources of inspiration or starting points to develop new therapeutic agents. Natural products such as peptides and macrocycles, and other compounds with unique features represent attractive sources to address complex diseases. Computational approaches that use chemoinformatics and molecular modeling methods contribute to speed up natural product-based drug discovery. Several research groups have recently used computational methodologies to organize data, interpret results, generate and test hypotheses, filter large chemical databases before the experimental screening, and design experiments. This review discusses a broad range of chemoinformatics applications to support natural product-based drug discovery. We emphasize profiling natural product data sets in terms of diversity; complexity; acid/base; absorption, distribution, metabolism, excretion, and toxicity (ADME/Tox) properties; and fragment analysis. Novel techniques for the visual representation of the chemical space are also discussed.
Asunto(s)
Productos Biológicos/química , Productos Biológicos/farmacología , Quimioinformática/métodos , Bases de Datos Factuales , Descubrimiento de Drogas/métodos , Animales , Productos Biológicos/análisis , Humanos , Modelos MolecularesRESUMEN
Small molecule libraries for virtual screening are becoming a well-established tool for the identification of new hit compounds. As for experimental assays, the library quality, defined in terms of structural complexity and diversity, is crucial to increase the chance of a successful outcome in the screening campaign. In this context, Diversity-Oriented Synthesis has proven to be very effective, as the compounds generated are structurally complex and differ not only for the appendages, but also for the molecular scaffold. In this work, we automated the design of a library of lactams by applying a Diversity-Oriented Synthesis strategy called Build/Couple/Pair. We evaluated the novelty and diversity of these compounds by comparing them with lactam moieties contained in approved drugs, natural products, and bioactive compounds from ChEMBL. Finally, depending on their scaffold we classified them into ß-, γ-, δ-, ε-, and isolated, fused, bridged and spirolactam groups and we assessed their drug-like and lead-like properties, thus providing the value of this novel in silico designed library for medicinal chemistry applications.
Asunto(s)
Diseño de Fármacos , Lactamas/química , Bibliotecas de Moléculas Pequeñas/química , Productos Biológicos/química , Técnicas Químicas Combinatorias , Lactamas/metabolismo , Bibliotecas de Moléculas Pequeñas/metabolismoRESUMEN
Compound databases of natural products have a major impact on drug discovery projects and other areas of research. The number of databases in the public domain with compounds with natural origins is increasing. Several countries, Brazil, France, Panama and, recently, Vietnam, have initiatives in place to construct and maintain compound databases that are representative of their diversity. In this proof-of-concept study, we discuss the first version of BIOFACQUIM, a novel compound database with natural products isolated and characterized in Mexico. We discuss its construction, curation, and a complete chemoinformatic characterization of the content and coverage in chemical space. The profile of physicochemical properties, scaffold content, and diversity, as well as structural diversity based on molecular fingerprints is reported. BIOFACQUIM is available for free.
Asunto(s)
Productos Biológicos/química , Bases de Datos de Compuestos Químicos , Descubrimiento de Drogas , MéxicoRESUMEN
Natural products (NPs) have been shown to be an extraordinary source of bioactive compounds and three-dimensional complex frameworks that can be useful to produce high-value molecular collections that are able to address "undruggable" and difficult therapeutic targets. Bicyclic acetals are particularly relevant for these purposes, given their key role in several biological interactions and the structural and stereochemical diversity that comes from the many possible ring combinations. To investigate this topological diversity, we have systematically characterized in a systematic and detailed manner fused, spiro and bridged bicyclic acetals in a large set of NPs, highlighting the great potential of bicyclic acetals in Diversity-Oriented Synthesis (DOS). Accordingly, a summary of some recent efforts on the development of acetal-containing small molecule collections through DOS approaches is herein reported.
Asunto(s)
Acetales/química , Productos Biológicos/química , Compuestos Bicíclicos con Puentes/química , Productos Biológicos/síntesis química , Bibliotecas de Moléculas Pequeñas/química , EstereoisomerismoRESUMEN
NuBBEDB is the first library of natural products of Brazilian biodiversity. It includes a large variety of classes of compounds and structural types of secondary metabolites of plants, fungi, insects, marine organisms, and bacteria. So far the chemical diversity and complexity of NuBBEDB have not been characterized in a systematic and detailed manner. Herein, we report a comprehensive chemoinformatic analysis of the most current version of NuBBEDB. As part of the characterization, NuBBEDB was compared with several databases of natural products in terms of structural diversity and complexity. Results of the analysis showed that NuBBEDB is diverse in terms of structural fingerprints, distribution of chemical scaffolds, and molecular properties. In addition, the results of the visualization of chemical space support quantitatively that NUBBEDB is a promising source of molecules for drug discovery and medicinal chemistry.
Asunto(s)
Productos Biológicos/química , Bases de Datos de Compuestos Químicos , Animales , Descubrimiento de DrogasRESUMEN
Lactams are a class of compounds important for drug design, due to their great variety of potential therapeutic applications, spanning cancer, diabetes, and infectious diseases. So far, the biological profile and chemical diversity of lactams have not been characterized in a systematic and detailed manner. In this work, we report the chemoinformatic analysis of beta-, gamma-, delta- and epsilon-lactams present in databases of approved drugs, natural products, and bioactive compounds from the large public database ChEMBL. We identified the main biological targets in which the lactams have been evaluated according to their chemical classification. We also identified the most frequent scaffolds and those that can be prioritized in chemical synthesis, since they are scaffolds with potential biological activity but with few reported analogs. Results of the biological and chemoinformatic analysis of lactams indicate that spiro- and bridged-lactams belong to classes with the lowest number of compounds and unique scaffolds, and some showing activity against specific targets. Information obtained from this analysis allows focusing the design of new chemical structures in less explored spaces and with increased possibilities of success.
RESUMEN
Naturally occurring small molecules include a large variety of natural products from different sources that have confirmed activity against epigenetic targets. In this work we review chemoinformatic, molecular modeling, and other computational approaches that have been used to uncover natural products as inhibitors of DNA methyltransferases, a major family of epigenetic targets with therapeutic interest. Examples of computational approaches surveyed in this work are docking, similarity-based virtual screening, and pharmacophore modeling. It is also discussed the chemoinformatic-guided exploration of the chemical space of naturally occurring compounds as epigenetic modulators which may have significant implications in epigenetic drug discovery and nutriepigenetics.