Your browser doesn't support javascript.
loading
: 20 | 50 | 100
1 - 17 de 17
2.
J Cheminform ; 14(1): 82, 2022 Dec 02.
Article En | MEDLINE | ID: mdl-36461094

We report the main conclusions of the first Chemoinformatics and Artificial Intelligence Colloquium, Mexico City, June 15-17, 2022. Fifteen lectures were presented during a virtual public event with speakers from industry, academia, and non-for-profit organizations. Twelve hundred and ninety students and academics from more than 60 countries. During the meeting, applications, challenges, and opportunities in drug discovery, de novo drug design, ADME-Tox (absorption, distribution, metabolism, excretion and toxicity) property predictions, organic chemistry, peptides, and antibiotic resistance were discussed. The program along with the recordings of all sessions are freely available at https://www.difacquim.com/english/events/2022-colloquium/ .

4.
RSC Adv ; 12(11): 6783-6790, 2022 Feb 22.
Article En | MEDLINE | ID: mdl-35424595

Transcription factors associated with quorum sensing in P. aeruginosa are promising targets for discovering new adjuvants against infection with this pathogen. Regulation of these transcription factors offers the possibility of controlling multiple virulence factors related to them as biofilm development, proteases, hydrogen cyanide, among others. Numerous molecules have been tested against these targets, however, the keys responsible for antagonistic activity are still unknown. In this work, the structure-activity relationships of active molecules tested against LasR, PqsR, and RhlR transcription factors are analyzed in order to establish the structural characteristics associated. As part of the study, molecular complexity, scaffold, activity cliffs, and chemical space visualization analyses were conducted to find out characteristics associated with biological activity. In this study, several structural features were identified as significant for antagonist activity, highlighting molecular size and hydrogen bond acceptors.

5.
J Comput Aided Mol Des ; 36(5): 341-354, 2022 05.
Article En | MEDLINE | ID: mdl-34143323

The concept of chemical space is a cornerstone in chemoinformatics, and it has broad conceptual and practical applicability in many areas of chemistry, including drug design and discovery. One of the most considerable impacts is in the study of structure-property relationships where the property can be a biological activity or any other characteristic of interest to a particular chemistry discipline. The chemical space is highly dependent on the molecular representation that is also a cornerstone concept in computational chemistry. Herein, we discuss the recent progress on chemoinformatic tools developed to expand and characterize the chemical space of compound data sets using different types of molecular representations, generate visual representations of such spaces, and explore structure-property relationships in the context of chemical spaces. We emphasize the development of methods and freely available tools focusing on drug discovery applications. We also comment on the general advantages and shortcomings of using freely available and easy-to-use tools and discuss the value of using such open resources for research, education, and scientific dissemination.


Cheminformatics , Drug Discovery , Drug Design , Drug Discovery/methods
6.
J Med Chem ; 64(12): 8208-8220, 2021 06 24.
Article En | MEDLINE | ID: mdl-33770434

Epigenetic targets are of significant importance in drug discovery research, as demonstrated by the eight approved epigenetic drugs for treatment of cancer and the increasing availability of chemogenomic data related to epigenetics. This data represents many structure-activity relationships that have not been exploited thus far to develop predictive models to support medicinal chemistry efforts. Herein, we report the first large-scale study of 26 318 compounds with a quantitative measure of biological activity for 55 protein targets with epigenetic activity. We built predictive models with high accuracy for small molecules' epigenetic target profiling through a systematic comparison of the machine learning models trained on different molecular fingerprints. The models were thoroughly validated, showing mean precisions of up to 0.952 for the epigenetic target prediction task. Our results indicate that the models reported herein have considerable potential to identify small molecules with epigenetic activity. Therefore, our results were implemented as a freely accessible web application.


Drug Discovery/methods , Epigenomics/methods , Machine Learning , Organic Chemicals/chemistry , Databases, Chemical/statistics & numerical data , Histone Deacetylases/metabolism , Molecular Structure , Organic Chemicals/metabolism , Proof of Concept Study , Structure-Activity Relationship , Transcription Factors/metabolism
7.
J Chem Inf Model ; 61(4): 1550-1554, 2021 04 26.
Article En | MEDLINE | ID: mdl-33729791

The identification of protein targets of small molecules is essential for drug discovery. With the increasing amount of chemogenomic data in the public domain, multiple ligand-based models for target prediction have emerged. However, these models are generally biased by the number of known ligands for different targets, which involves an under-representation of epigenetic targets, and despite the increasing importance of epigenetic targets in drug discovery, there are no open tools for epigenetic target prediction. In this work, we introduce Epigenetic Target Profiler (ETP), a freely accessible and easy-to-use web application for the prediction of epigenetic targets of small molecules. For a query compound, ETP predicts its bioactivity profile over a panel of 55 different epigenetic targets. To that aim, ETP uses a consensus model based on two binary classification models for each target, relying on support vector machines and built on molecular fingerprints of different design. A distance-to-model parameter related to the reliability of the predictions is included to facilitate their interpretability and assist in the identification of small molecules with potential epigenetic activity. Epigenetic Target Profiler is freely available at http://www.epigenetictargetprofiler.com.


Computers , Proteins , Epigenesis, Genetic , Internet , Ligands , Reproducibility of Results , Software
8.
Bioinformatics ; 37(10): 1376-1382, 2021 06 16.
Article En | MEDLINE | ID: mdl-33226061

MOTIVATION: Machine-learning scoring functions (SFs) have been found to outperform standard SFs for binding affinity prediction of protein-ligand complexes. A plethora of reports focus on the implementation of increasingly complex algorithms, while the chemical description of the system has not been fully exploited. RESULTS: Herein, we introduce Extended Connectivity Interaction Features (ECIF) to describe protein-ligand complexes and build machine-learning SFs with improved predictions of binding affinity. ECIF are a set of protein-ligand atom-type pair counts that take into account each atom's connectivity to describe it and thus define the pair types. ECIF were used to build different machine-learning models to predict protein-ligand affinities (pKd/pKi). The models were evaluated in terms of 'scoring power' on the Comparative Assessment of Scoring Functions 2016. The best models built on ECIF achieved Pearson correlation coefficients of 0.857 when used on its own, and 0.866 when used in combination with ligand descriptors, demonstrating ECIF descriptive power. AVAILABILITY AND IMPLEMENTATION: Data and code to reproduce all the results are freely available at https://github.com/DIFACQUIM/ECIF. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.


Machine Learning , Proteins , Algorithms , Ligands , Protein Binding , Proteins/metabolism
9.
Pharmaceuticals (Basel) ; 14(1)2020 Dec 27.
Article En | MEDLINE | ID: mdl-33375520

Inhibitors of DNA methyltransferases (DNMTs) are attractive compounds for epigenetic drug discovery. They are also chemical tools to understand the biochemistry of epigenetic processes. Herein, we report five distinct inhibitors of DNMT1 characterized in enzymatic inhibition assays that did not show activity with DNMT3B. It was concluded that the dietary component theaflavin is an inhibitor of DNMT1. Two additional novel inhibitors of DNMT1 are the approved drugs glyburide and panobinostat. The DNMT1 enzymatic inhibitory activity of panobinostat, a known pan inhibitor of histone deacetylases, agrees with experimental reports of its ability to reduce DNMT1 activity in liver cancer cell lines. Molecular docking of the active compounds with DNMT1, and re-scoring with the recently developed extended connectivity interaction features approach, led to an excellent agreement between the experimental IC50 values and docking scores.

10.
Biomolecules ; 10(11)2020 11 06.
Article En | MEDLINE | ID: mdl-33172012

Natural products and semi-synthetic compounds continue to be a significant source of drug candidates for a broad range of diseases, including coronavirus disease 2019 (COVID-19), which is causing the current pandemic. Besides being attractive sources of bioactive compounds for further development or optimization, natural products are excellent substrates of unique substructures for fragment-based drug discovery. To this end, fragment libraries should be incorporated into automated drug design pipelines. However, public fragment libraries based on extensive collections of natural products are still limited. Herein, we report the generation and analysis of a fragment library of natural products derived from a database with more than 400,000 compounds. We also report fragment libraries of a large food chemical database and other compound datasets of interest in drug discovery, including compound libraries relevant for COVID-19 drug discovery. The fragment libraries were characterized in terms of content and diversity.


Biological Products/chemistry , Drug Discovery , Algorithms , Betacoronavirus/isolation & purification , Biological Products/therapeutic use , COVID-19 , Coronavirus Infections/drug therapy , Coronavirus Infections/virology , Databases, Chemical , Humans , Pandemics , Pneumonia, Viral/drug therapy , Pneumonia, Viral/virology , SARS-CoV-2 , Small Molecule Libraries/chemistry , Small Molecule Libraries/therapeutic use
11.
Drug Discov Today ; 25(12): 2268-2276, 2020 12.
Article En | MEDLINE | ID: mdl-33010481

The ability of epigenetic markers to affect genome function has enabled transformative changes in drug discovery, especially in cancer and other emerging therapeutic areas. Concordant with the introduction of the term 'epi-informatics', the size of the epigenetically relevant chemical space has grown substantially and so did the number of applications of cheminformatic methods to epigenetics. Recent progress in epi-informatics has improved our understanding of the structure-epigenetic activity relationships and boosted the development of models predicting novel epigenetic agents. Herein, we review the advances in computational approaches to drug discovery of small molecules with epigenetic modulation profiles, summarize the current chemogenomics data available for epigenetic targets, and provide a perspective on the greater utility of biomedical knowledge mining as a means to advance the epigenetic drug discovery.


Cheminformatics , Drug Discovery , Epigenesis, Genetic , Databases, Factual , Humans
12.
Int J Mol Sci ; 21(14)2020 Jul 15.
Article En | MEDLINE | ID: mdl-32679731

Propolis is a natural resinous material produced by bees and has been used in folk medicines since ancient times. Due to it possessing a broad spectrum of biological activities, it has gained significant scientific and commercial interest over the last two decades. As a result of searching 122 publications reported up to the end of 2019, we assembled a unique compound database consisting of 578 components isolated from both honey bee propolis and stingless bee propolis, and analyzed the chemical space and chemical diversity of these compounds. The results demonstrated that both honey bee propolis and stingless bee propolis are valuable sources for pharmaceutical and nutraceutical development.


Propolis/chemistry , Propolis/pharmacology , Animals , Bees , Cheminformatics , Drug Discovery , Honey/analysis , Medicine, Traditional , Phenols/analysis , Phenols/pharmacology , Terpenes/analysis , Terpenes/pharmacology
13.
Mol Inform ; 39(11): e2000050, 2020 11.
Article En | MEDLINE | ID: mdl-32302465

We report a comprehensive fragment library with 205,903 fragments derived from the recently published Collection of Open Natural Products (COCONUT) data set with more than 400,000 non-redundant natural products. The natural products-based fragment library was compared with other two fragment libraries herein generated from ChEMBL (biologically relevant compounds) and Enamine-REAL (a large on-demand collection of synthetic compounds), both used as reference data sets with relevance in drug discovery. It was found that there is a large diversity of unique fragments derived from natural products and that the entire structures and fragments derived from natural products are more diverse and structurally complex than the two reference compound collections. During this work we introduced a novel visual representation of the chemical space based on the recently published concept of statistical-based database fingerprint. The compounds and fragments libraries from natural products generated and analyzed in this work are freely available.


Biological Products/analysis , Cheminformatics , Small Molecule Libraries/analysis , Databases as Topic
14.
RSC Adv ; 10(42): 25089-25099, 2020 Jun 29.
Article En | MEDLINE | ID: mdl-35517466

The pandemic caused by SARS-CoV-2 (COVID-19 disease) has claimed more than 500 000 lives worldwide, and more than nine million people are infected. Unfortunately, an effective drug or vaccine for its treatment is yet to be found. The increasing information available on critical molecular targets of SARS-CoV-2 and active compounds against related coronaviruses facilitates the proposal (or repurposing) of drug candidates for the treatment of COVID-19, with the aid of in silico methods. As part of a global effort to fight the COVID-19 pandemic, herein we report a consensus virtual screening of extensive collections of food chemicals and compounds known as dark chemical matter. The rationale is to contribute to global efforts with a description of currently underexplored chemical space regions. The consensus approach included combining similarity searching with various queries and fingerprints, molecular docking with two docking protocols, and ADMETox profiling. We propose compounds commercially available for experimental testing. The full list of virtual screening hits is disclosed.

15.
F1000Res ; 82019.
Article En | MEDLINE | ID: mdl-32047598

Background: Natural product databases are important in drug discovery and other research areas. An analysis of its structural content, as well as functional group occurrence, provides a useful overview, as well as a means of comparison with related databases. BIOFACQUIM is an emerging database of natural products characterized and isolated in Mexico. Herein, we discuss the results of a first systematic functional group analysis and global diversity of an updated version of BIOFACQUIM. Methods: BIOFACQUIM was augmented through a literature search and data curation. A structural content analysis of the dataset was performed. This involved a functional group analysis with a novel algorithm to automatically identify all functional groups in a molecule and an assessment of the global diversity using consensus diversity plots. To this end, BIOFACQUIM was compared to two major and large databases: ChEMBL 25, and a herein assembled collection of natural products with 169,839 unique compounds. Results: The structural content analysis showed that 15.7% of compounds and 11.6% of scaffolds present in the current version of BIOFACQUIM have not been reported in the other large reference datasets. It also gave a diversity increase in terms of scaffolds and molecular fingerprints regarding the previous version of the dataset, as well as a higher similarity to the assembled collection of natural products than to ChEMBL 25, in terms of diversity and frequent functional groups. Conclusions: A total of 148 natural products were added to BIOFACQUIM, which meant a diversity increase in terms of scaffolds and fingerprints. Regardless of its relatively small size, there are a significant number of compounds and scaffolds that are not present in the reference datasets, showing that curated databases of natural products, such as BIOFACQUIM, can serve as a starting point to increase the biologically relevant chemical space.


Biological Products , Databases, Chemical , Drug Discovery , Mexico
16.
J Cheminform ; 10(1): 55, 2018 Nov 22.
Article En | MEDLINE | ID: mdl-30467740

BACKGROUND: Simplified representation of compound databases has several applications in cheminformatics. Herein, we introduce an alternative and general method to build single fingerprint representations of compound databases. The approach is inspired on the previously published modal fingerprints that are aimed to capture the most significant bits of a fingerprint representation for a compound data set. The novelty of the herein proposed statistical-based database fingerprint (SB-DFP) is that it is generated based on binomial proportions comparisons taking as reference the distribution of "1" bits on a large representative set of the chemical space. RESULTS: To illustrate the Method, SB-DFPs were constructed for 28 epigenetic target data sets retrieved from a recently published epigenomics database of interest in probe and drug discovery. For each target data set, the SB-DFPs were built based on two representative fingerprints of different design using as reference a data set with more than 15 million compounds from ZINC. The application of SB-DFP was illustrated and compared to other methods through association relationships of the 28 epigenetic data sets and similarity searching. It was found that SB-DFPs captured overall, the common features between data sets and the distinct features of each set. In similarity searching SB-DFP equaled or outperformed other approaches for at least 20 out of the 28 sets. CONCLUSIONS: SB-DFP is a general approach based on binomial proportion comparisons to represent a compound data set with a single fingerprint. SB-DFP can be developed, at least in principle, based on any fingerprint and reference data set. SB-DFP is a good alternative for exploration of relationships between targets through its associated compound data sets and performing similarity searching.

17.
Front Pharmacol ; 9: 1144, 2018.
Article En | MEDLINE | ID: mdl-30364171

Naturally occurring small molecules include a large variety of natural products from different sources that have confirmed activity against epigenetic targets. In this work we review chemoinformatic, molecular modeling, and other computational approaches that have been used to uncover natural products as inhibitors of DNA methyltransferases, a major family of epigenetic targets with therapeutic interest. Examples of computational approaches surveyed in this work are docking, similarity-based virtual screening, and pharmacophore modeling. It is also discussed the chemoinformatic-guided exploration of the chemical space of naturally occurring compounds as epigenetic modulators which may have significant implications in epigenetic drug discovery and nutriepigenetics.

...