Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 20 de 49
Filter
1.
Int J Mol Sci ; 25(11)2024 May 25.
Article in English | MEDLINE | ID: mdl-38891933

ABSTRACT

The role of the gut microbiota and its interplay with host metabolic health, particularly in the context of type 2 diabetes mellitus (T2DM) management, is garnering increasing attention. Dipeptidyl peptidase 4 (DPP4) inhibitors, commonly known as gliptins, constitute a class of drugs extensively used in T2DM treatment. However, their potential interactions with gut microbiota remain poorly understood. In this study, we employed computational methodologies to investigate the binding affinities of various gliptins to DPP4-like homologs produced by intestinal bacteria. The 3D structures of DPP4 homologs from gut microbiota species, including Segatella copri, Phocaeicola vulgatus, Bacteroides uniformis, Parabacteroides merdae, and Alistipes sp., were predicted using computational modeling techniques. Subsequently, molecular dynamics simulations were conducted for 200 ns to ensure the stability of the predicted structures. Stable structures were then utilized to predict the binding interactions with known gliptins through molecular docking algorithms. Our results revealed binding similarities of gliptins toward bacterial DPP4 homologs compared to human DPP4. Specifically, certain gliptins exhibited similar binding scores to bacterial DPP4 homologs as they did with human DPP4, suggesting a potential interaction of these drugs with gut microbiota. These findings could help in understanding the interplay between gliptins and gut microbiota DPP4 homologs, considering the intricate relationship between the host metabolism and microbial communities in the gut.


Subject(s)
Diabetes Mellitus, Type 2 , Dipeptidyl Peptidase 4 , Dipeptidyl-Peptidase IV Inhibitors , Gastrointestinal Microbiome , Humans , Bacteria/metabolism , Bacterial Proteins/metabolism , Bacterial Proteins/chemistry , Binding Sites , Diabetes Mellitus, Type 2/metabolism , Diabetes Mellitus, Type 2/drug therapy , Dipeptidyl Peptidase 4/metabolism , Dipeptidyl Peptidase 4/chemistry , Dipeptidyl-Peptidase IV Inhibitors/pharmacology , Molecular Docking Simulation , Molecular Dynamics Simulation , Protein Binding
2.
Chem Res Toxicol ; 37(4): 580-589, 2024 Apr 15.
Article in English | MEDLINE | ID: mdl-38501392

ABSTRACT

The desirable pharmacological properties and a broad number of therapeutic activities have made peptides promising drugs over small organic molecules and antibody drugs. Nevertheless, toxic effects, such as hemolysis, have hampered the development of such promising drugs. Hence, a reliable computational tool to predict peptide hemolytic toxicity is enormously useful before synthesis and experimental evaluation. Currently, four web servers that predict hemolytic activity using machine learning (ML) algorithms are available; however, they exhibit some limitations, such as the need for a reliable negative set and limited application domain. Hence, we developed a robust model based on a novel theoretical approach that combines network science and a multiquery similarity searching (MQSS) method. A total of 1152 initial models were constructed from 144 scaffolds generated in a previous report. These were evaluated on external data sets, and the best models were fused and improved. Our best MQSS model I1 outperformed all state-of-the-art ML-based models and was used to characterize the prevalence of hemolytic toxicity on therapeutic peptides. Based on our model's estimation, the number of hemolytic peptides might be 3.9-fold higher than the reported.


Subject(s)
Hemolysis , Peptides , Humans , Amino Acid Sequence , Peptides/pharmacology , Peptides/chemistry , Algorithms , Machine Learning
3.
J Comput Aided Mol Des ; 38(1): 9, 2024 Feb 14.
Article in English | MEDLINE | ID: mdl-38351144

ABSTRACT

Notwithstanding the wide adoption of the OECD principles (or best practices) for QSAR modeling, disparities between in silico predictions and experimental results are frequent, suggesting that model predictions are often too optimistic. Of these OECD principles, the applicability domain (AD) estimation has been recognized in several reports in the literature to be one of the most challenging, implying that the actual reliability measures of model predictions are often unreliable. Applying tree-based error analysis workflows on 5 QSAR models reported in the literature and available in the QsarDB repository, i.e., androgen receptor bioactivity (agonists, antagonists, and binders, respectively) and membrane permeability (highest membrane permeability and the intrinsic permeability), we demonstrate that predictions erroneously tagged as reliable (AD prediction errors) overwhelmingly correspond to instances in subspaces (cohorts) with the highest prediction error rates, highlighting the inhomogeneity of the AD space. In this sense, we call for more stringent AD analysis guidelines which require the incorporation of model error analysis schemes, to provide critical insight on the reliability of underlying AD algorithms. Additionally, any selected AD method should be rigorously validated to demonstrate its suitability for the model space over which it is applied. These steps will ultimately contribute to more accurate estimations of the reliability of model predictions. Finally, error analysis may also be useful in "rational" model refinement in that data expansion efforts and model retraining are focused on cohorts with the highest error rates.


Subject(s)
Algorithms , Quantitative Structure-Activity Relationship , Reproducibility of Results
4.
Mol Divers ; 2023 Apr 05.
Article in English | MEDLINE | ID: mdl-37017875

ABSTRACT

Ubiquitin-proteasome system (UPS) is a highly regulated mechanism of intracellular protein degradation and turnover. The UPS is involved in different biological activities, such as the regulation of gene transcription and cell cycle. Several researchers have applied cheminformatics and artificial intelligence methods to study the inhibition of proteasomes, including the prediction of UPP inhibitors. Following this idea, we applied a new tool for obtaining molecular descriptors (MDs) for modeling proteasome Inhibition in terms of EC50 (µmol/L), in which a set of new MDs called atomic weighted vectors (AWV) and several prediction algorithms were used in cheminformatics studies. In the manuscript, a set of descriptors based on AWV are presented as datasets for training different machine learning techniques, such as linear regression, multiple linear regression (MLR), random forest (RF), K-nearest neighbors (IBK), multi-layer perceptron, best-first search, and genetic algorithm. The results suggest that these atomic descriptors allow adequate modeling of proteasome inhibitors despite artificial intelligence techniques, as a variant to build efficient models for the prediction of inhibitory activity.

5.
Int J Mol Sci ; 23(3)2022 Jan 31.
Article in English | MEDLINE | ID: mdl-35163573

ABSTRACT

Inflammasomes are multiprotein complexes that represent critical elements of the inflammatory response. The dysregulation of the best-characterized complex, the NLRP3 inflammasome, has been linked to the pathogenesis of diseases such as multiple sclerosis, type 2 diabetes mellitus, Alzheimer's disease, and cancer. While there exist molecular inhibitors specific for the various components of inflammasome complexes, no currently reported inhibitors specifically target NLRP3PYD homo-oligomerization. In the present study, we describe the identification of QM380 and QM381 as NLRP3PYD homo-oligomerization inhibitors after screening small molecules from the MyriaScreen library using a split-luciferase complementation assay. Our results demonstrate that these NLRP3PYD inhibitors interfere with ASC speck formation, inhibit pro-inflammatory cytokine IL1-ß release, and decrease pyroptotic cell death. We employed spectroscopic techniques and computational docking analyses with QM380 and QM381 and the PYD domain to confirm the experimental results and predict possible mechanisms underlying the inhibition of NLRP3PYD homo-interactions.


Subject(s)
Anti-Inflammatory Agents , NLR Family, Pyrin Domain-Containing 3 Protein , Protein Multimerization/drug effects , Pyroptosis/drug effects , Anti-Inflammatory Agents/chemistry , Anti-Inflammatory Agents/pharmacology , HEK293 Cells , Humans , NLR Family, Pyrin Domain-Containing 3 Protein/antagonists & inhibitors , NLR Family, Pyrin Domain-Containing 3 Protein/chemistry , NLR Family, Pyrin Domain-Containing 3 Protein/genetics , NLR Family, Pyrin Domain-Containing 3 Protein/metabolism
6.
Mol Divers ; 25(3): 1425-1438, 2021 Aug.
Article in English | MEDLINE | ID: mdl-34258685

ABSTRACT

Scientific and consumer interest in healthy foods (also known as functional foods), nutraceuticals and cosmeceuticals has increased in the recent years, leading to an increased presence of these products in the market. However, the regulations across different countries that define the type of claims that may be made, and the degree of evidence required to support these claims, are rather inconsistent. Moreover, there is also controversy on the effectiveness and biological mode of action of many of these products, which should undergo an exhaustive approval process to guarantee the consumer rights. Computational approaches constitute invaluable tools to facilitate the discovery of bioactive molecules and provide biological plausibility on the mode of action of these products. Indeed, methodologies like QSAR, docking or molecular dynamics have been used in drug discovery protocols for decades and can now aid in the discovery of bioactive food components. Thanks to these approaches, it is possible to search for new functions in food constituents, which may be part of our daily diet, and help to prevent disorders like diabetes, hypercholesterolemia or obesity. In the present manuscript, computational studies applied to this field are reviewed to illustrate the potential of these approaches to guide the first screening steps and the mechanistic studies of nutraceutical, cosmeceutical and functional foods.


Subject(s)
Cheminformatics/methods , Cosmeceuticals/chemistry , Dietary Supplements/analysis , Functional Food/analysis , Models, Molecular , Quantitative Structure-Activity Relationship , Algorithms , Cosmeceuticals/pharmacology , Databases, Chemical , Humans , Machine Learning , Molecular Docking Simulation , Molecular Dynamics Simulation
7.
Environ Toxicol Pharmacol ; 87: 103688, 2021 Oct.
Article in English | MEDLINE | ID: mdl-34119701

ABSTRACT

Multiple substances are considered endocrine disrupting chemicals (EDCs). However, there is a significant gap in the early prioritization of EDC's effects. In this work, in silico and in vitro methods were used to model estrogenicity. Two Quantitative Structure-Activity Relationship (QSAR) models based on Logistic Regression and REPTree algorithms were built using a large and diverse database of estrogen receptor (ESR) agonism. A 10-fold external validation demonstrated their robustness and predictive capacity. Mechanistic interpretations of the molecular descriptors (C-026, nArOH,PW5, B06[Br-Br]) used for modelling suggested that the heteroatomic fragments, aromatic hydroxyls, and bromines, and the relative bond accessibility areas of molecules, are structural determinants in estrogenicity. As validation of the QSARs, ESR transactivity of thirteen persistent organic pollutants (POPs) and suspected EDCs was tested in vitro using the MMV-Luc cell line. A good correspondence between predictions and experimental bioassays demonstrated the value of the QSARs for prioritization of ESR agonist compounds.


Subject(s)
Endocrine Disruptors/toxicity , Estrogens/toxicity , Receptors, Estrogen/metabolism , Algorithms , Cell Line, Tumor , Cell Survival/drug effects , Computer Simulation , Endocrine Disruptors/chemistry , Endocrine Disruptors/classification , Estrogens/chemistry , Estrogens/classification , Humans , Models, Chemical , Quantitative Structure-Activity Relationship , Receptors, Estrogen/antagonists & inhibitors
8.
Future Med Chem ; 13(11): 993-1000, 2021 06.
Article in English | MEDLINE | ID: mdl-33890502

ABSTRACT

Background: There is currently no effective dengue virus (DENV) therapeutic. We aim to develop a genetic algorithm-based framework for the design of peptides with possible DENV inhibitory activity. Methods & results: A Python-based tool (denominated AutoPepGEN) based on a DENV support vector machine classifier as the objective function was implemented. AutoPepGEN was applied to the design of three- to seven-amino acid sequences and ten peptides were selected. Peptide-protease (DENV) docking and Molecular Mechanics-Generalized Born Surface Area calculations were performed for the selected sequences and favorable binding energies were observed. Conclusion: It is hoped that AutoPepGEN will serve as an in silico alternative to the experimental design of positional scanning combinatorial libraries, known to be prone to a combinatorial explosion. AutoPepGEN is available at: https://github.com/sjbarigye/AutoPepGEN.


Subject(s)
Algorithms , Antiviral Agents/pharmacology , Dengue Virus/drug effects , Peptides/pharmacology , Amino Acid Sequence , Antiviral Agents/chemical synthesis , Antiviral Agents/chemistry , Microbial Sensitivity Tests , Peptides/chemical synthesis , Peptides/chemistry
9.
Proteins ; 89(2): 174-184, 2021 02.
Article in English | MEDLINE | ID: mdl-32881068

ABSTRACT

We present a novel Java-based program denominated PeptiDesCalculator for computing peptide descriptors. These descriptors include: redefinitions of known protein parameters to suite the peptide domain, generalization schemes for the global descriptions of peptide characteristics, as well as empirical descriptors based on experimental evidence on peptide stability and interaction propensity. The PeptiDesCalculator software provides a user-friendly Graphical User Interface (GUI) and is parallelized to maximize the use of computational resources available in current work stations. The PeptiDesCalculator indices are employed in modeling 8 peptide bioactivity endpoints demonstrating satisfactory behavior. Moreover, we compare the performance of a support vector machine (SVM) classifier built using 15 PeptiDesCalculator indices with that of a recently reported deep neural network (DNN) antimicrobial activity classifier, demonstrating comparable test set performance notwithstanding the remarkably lower degree of freedom for the former. This software will facilitate the development of in silico models for the prediction of peptide properties.


Subject(s)
Peptides/chemistry , Peptides/pharmacology , Software , Support Vector Machine , Anti-Bacterial Agents/chemistry , Anti-Bacterial Agents/pharmacology , Antifungal Agents/chemistry , Antifungal Agents/pharmacology , Antineoplastic Agents/chemistry , Antineoplastic Agents/pharmacology , Antiviral Agents/chemistry , Antiviral Agents/pharmacology , Candida albicans/drug effects , HIV Infections/drug therapy , Hepatitis C/drug therapy , Humans , Listeria monocytogenes/drug effects , Neoplasms/drug therapy , Neural Networks, Computer , Peptide Mapping , Peptides/genetics , Peptides/metabolism , Protein Stability , Pseudomonas aeruginosa/drug effects
10.
Eur J Med Chem ; 207: 112777, 2020 Dec 01.
Article in English | MEDLINE | ID: mdl-32971427

ABSTRACT

The aryl hydrocarbon receptor (AhR) is a chemical sensor upregulating the transcription of responsive genes associated with endocrine homeostasis, oxidative balance and diverse metabolic, immunological and inflammatory processes, which have raised the pharmacological interest on its modulation. Herein, a novel set of 32 unsymmetrical triarylmethane (TAM) class of structures has been synthesized, characterized and their AhR transcriptional activity evaluated using a cell-based assay. Eight of the assayed TAM compounds (14, 15, 18, 19, 21, 22, 25, 28) exhibited AhR agonism but none of them showed antagonist effects. TAMs bearing benzotrifluoride, naphthol or heteroaromatic (indole, quinoline or thiophene) rings seem to be prone to AhR activation unlike phenyl substituted or benzotriazole derivatives. A molecular docking analysis with the AhR ligand binding domain (LBD) showed similarities in the binding mode and in the interactions of the most potent TAM identified 4-(pyridin-2-yl (thiophen-2-yl)methyl)phenol (22) compared to the endogenous AhR agonist 5,11-dihydroindolo[3,2-b]carbazole-12-carbaldehyde (FICZ). Finally, in silico predictions of physicochemical and biopharmaceutical properties for the most potent agonistic compounds were performed and these exhibited acceptable druglikeness and good ADME profiles. To our knowledge, this is the first study assessing the AhR modulatory effects of unsymmetrical TAM class of compounds.


Subject(s)
Methane/chemistry , Methane/pharmacology , Receptors, Aryl Hydrocarbon/metabolism , Hep G2 Cells , Humans , Methane/chemical synthesis , Methane/metabolism , Molecular Docking Simulation , Molecular Targeted Therapy , Protein Binding , Receptors, Aryl Hydrocarbon/agonists , Receptors, Aryl Hydrocarbon/chemistry , Transcriptional Activation/drug effects
11.
J Chem Inf Model ; 60(7): 3534-3545, 2020 07 27.
Article in English | MEDLINE | ID: mdl-32589419

ABSTRACT

Over the past few decades, virtual high-throughput screening (vHTS) and molecular dynamics simulations have become effective and widely used tools in the initial stages of drug discovery efforts. These methods allow a great number of druglike molecules to be screened quickly and inexpensively. Unfortunately, however, the accuracies of both these methods rely on the quality of the underlying molecular mechanics force fields (FFs), which are often poor. This major weakness originates from the reliance of FFs on a finite list of specific parameters, called atom types, which have low transferability between molecules. In particular, the torsional energy barriers of druglike molecules are notoriously difficult to predict. Continuing our endeavor to understand factors affecting the torsional energy barriers of small molecules and quantify them, we showed that descriptors calculated using the extended-Hückel method could be used to rapidly assign accurate torsion parameters for conjugated molecules. This method, called H-TEQ 4.5, was developed using a set of 684 conjugated molecules. It was subsequently validated on a test set of 200 diverse molecules and produced an average root-mean-square error (rmse) of 1.01 kcal·mol-1, with respect to the reference quantum mechanic torsional profiles. For comparison, GAFF2, MMFF94, and MAB produced average rmse's of 3.49, 1.50, and 1.77 kcal·mol-1, respectively. H-TEQ 4.5 is also computationally inexpensive, running just under 0.25 ms for a biphenyl molecule on a home computer, allowing it to be used for vHTS of large libraries of compounds. Overall, H-TEQ 4.5 solved the problems associated with the transferability of torsion parameters for conjugated molecules. This method was incorporated into the Molecular Operating Environment and will be available for a wide variety of applications.


Subject(s)
Molecular Dynamics Simulation , Quantum Theory , Physical Phenomena , Static Electricity , Thermodynamics
12.
Mol Inform ; 39(10): e2000086, 2020 10.
Article in English | MEDLINE | ID: mdl-32558335

ABSTRACT

In the present report we evaluate the possible utility of the Generative Adversarial Networks (GANs) in mapping the chemical structural space for molecular property profiles, with the goal of subsequently yielding synthetic (artificial) samples for ligand-based molecular modeling. Two case studies are considered: BACE-1 (ß-Secretase 1) and DENV (Dengue Virus) inhibitory activities, with the former focused on data populating and the latter on data balancing tasks. We train GANs using subsamples extracted from datasets for each bioactivity endpoint, and apply the trained networks in generating synthetic examples from the respective bioactivity chemical spaces. Original and synthetic samples are pooled together and employed to build BACE-1 and DENV inhibitory activity classifiers and their performance evaluated over tenfold external validation sets. In both case studies, the obtained classifiers demonstrate satisfactory predictivity with the former yielding accuracy (ACC) and Mathew's correlation coefficient (MCC) values of 0.80 and 0.59, while the latter produces balanced accuracy(BACC) and MCC values of 0.81 and 0.70, respectively. Moreover, the statistics of these classifiers are compared with those of other models in the literature demonstrating comparable to better performance. These results suggest that GANs may be useful in mapping the chemical space for molecular property profiles of interest, and thus allow for the extraction of synthetic examples for computational modeling.


Subject(s)
Amyloid Precursor Protein Secretases/chemistry , Aspartic Acid Endopeptidases/chemistry , Computational Biology/methods , Dengue Virus/drug effects , Small Molecule Libraries/pharmacology , Amyloid Precursor Protein Secretases/antagonists & inhibitors , Antiviral Agents/chemistry , Antiviral Agents/pharmacology , Aspartic Acid Endopeptidases/antagonists & inhibitors , Computer Simulation , Drug Evaluation, Preclinical , Enzyme Inhibitors/chemistry , Enzyme Inhibitors/pharmacology , Humans , Models, Molecular , Neural Networks, Computer , Small Molecule Libraries/chemistry , Support Vector Machine
13.
Chemosphere ; 256: 127068, 2020 Oct.
Article in English | MEDLINE | ID: mdl-32447110

ABSTRACT

The aryl hydrocarbon receptor (AhR) plays a key role in the regulation of gene expression in metabolic machinery and detoxification systems. In the recent years, this receptor has attracted interest as a therapeutic target for immunological, oncogenic and inflammatory conditions. In the present report, in silico and in vitro approaches were combined to study the activation of the AhR. To this end, a large database of chemical compounds with known AhR agonistic activity was employed to build 5 classifiers based on the Adaboost (AdB), Gradient Boosting (GB), Random Forest (RF), Multilayer Perceptron (MLP) and Support Vector Machine (SVM) algorithms, respectively. The built classifiers were examined, following a 10-fold external validation procedure, demonstrating adequate robustness and predictivity. These models were integrated into a majority vote based ensemble, subsequently used to screen an in-house library of compounds from which 40 compounds were selected for prospective in vitro experimental validation. The general correspondence between the ensemble predictions and the in vitro results suggests that the constructed ensemble may be useful in predicting the AhR agonistic activity, both in a toxicological and pharmacological context. A preliminary structure-activity analysis of the evaluated compounds revealed that all structures bearing a benzothiazole moiety induced AhR expression while diverse activity profiles were exhibited by phenolic derivatives.


Subject(s)
Receptors, Aryl Hydrocarbon/metabolism , Algorithms , Animals , Basic Helix-Loop-Helix Transcription Factors , Benzothiazoles , Computer Simulation , Humans , Neural Networks, Computer , Phenols , Prospective Studies , Support Vector Machine
14.
Sci Rep ; 10(1): 5285, 2020 03 24.
Article in English | MEDLINE | ID: mdl-32210335

ABSTRACT

Breast cancer (BC) is the leading cause of cancer-related death among women and the most commonly diagnosed cancer worldwide. Although in recent years large-scale efforts have focused on identifying new therapeutic targets, a better understanding of BC molecular processes is required. Here we focused on elucidating the molecular hallmarks of BC heterogeneity and the oncogenic mutations involved in precision medicine that remains poorly defined. To fill this gap, we established an OncoOmics strategy that consists of analyzing genomic alterations, signaling pathways, protein-protein interactome network, protein expression, dependency maps in cell lines and patient-derived xenografts in 230 previously prioritized genes to reveal essential genes in breast cancer. As results, the OncoOmics BC essential genes were rationally filtered to 140. mRNA up-regulation was the most prevalent genomic alteration. The most altered signaling pathways were associated with basal-like and Her2-enriched molecular subtypes. RAC1, AKT1, CCND1, PIK3CA, ERBB2, CDH1, MAPK14, TP53, MAPK1, SRC, RAC3, BCL2, CTNNB1, EGFR, CDK2, GRB2, MED1 and GATA3 were essential genes in at least three OncoOmics approaches. Drugs with the highest amount of clinical trials in phases 3 and 4 were paclitaxel, docetaxel, trastuzumab, tamoxifen and doxorubicin. Lastly, we collected ~3,500 somatic and germline oncogenic variants associated with 50 essential genes, which in turn had therapeutic connectivity with 73 drugs. In conclusion, the OncoOmics strategy reveals essential genes capable of accelerating the development of targeted therapies for precision oncology.


Subject(s)
Biomarkers, Tumor/genetics , Breast Neoplasms/genetics , Breast Neoplasms/pathology , Gene Expression Regulation, Neoplastic , Genes, Essential , Mutation , Precision Medicine , Animals , Biomarkers, Tumor/metabolism , Breast Neoplasms/metabolism , Female , Gene Regulatory Networks , High-Throughput Nucleotide Sequencing , Humans , Mice , Prognosis , Protein Interaction Maps , Proteome , Tumor Cells, Cultured , Xenograft Model Antitumor Assays
15.
Mol Divers ; 24(4): 913-932, 2020 Nov.
Article in English | MEDLINE | ID: mdl-31659696

ABSTRACT

In this report, we introduce a set of aggregation operators (AOs) to calculate global and local (group and atom type) molecular descriptors (MDs) as a generalization of the classical approach of molecular encoding using the sum of the atomic (or fragment) contributions. These AOs are implemented in a new and free software denominated MD-LOVIs ( http://tomocomd.com/md-lovis ), which allows for the calculation of MDs from atomic weights vector and LOVIs (local vertex invariants). This software was developed in Java programming language and employed the Chemical Development Kit (CDK) library for handling chemical structures and the calculation of atomic weights. An analysis of the complexities of the algorithms presented herein demonstrates that these aspects were efficiently implemented. The calculation speed experiments show that the MD-LOVIs software has satisfactory behavior when compared to software such as Padel, CDKDescriptor, DRAGON and Bluecal software. Shannon's entropy (SE)-based variability studies demonstrate that MD-LOVIs yields indices with greater information content when compared to those of popular academic and commercial software. A principal component analysis reveals that our approach captures chemical information orthogonal to that codified by the DRAGON, Padel and Mold2 software, as a result of the several generalizations in MD-LOVIs not used in other programs. Lastly, three QSARs were built using multiple linear regression with genetic algorithms, and the statistical parameters of these models demonstrate that the MD-LOVIs indices obtained with AOs yield better performance than those obtained when the summation operator is used exclusively. Moreover, it is also revealed that the MD-LOVIs indices yield models with comparable to superior performance when compared to other QSAR methodologies reported in the literature, despite their simplicity. The studies performed herein collectively demonstrated that MD-LOVIs software generates indices as simple as possible, but not simpler and that use of AOs enhances the diversity of the chemical information codified, which consequently improves the performance of traditional MDs.


Subject(s)
Models, Chemical , Small Molecule Libraries/chemistry , Algorithms , Linear Models , Multivariate Analysis , Quantitative Structure-Activity Relationship , Software
16.
J Comput Aided Mol Des ; 33(11): 997-1008, 2019 11.
Article in English | MEDLINE | ID: mdl-31773464

ABSTRACT

Imbalanced datasets, comprising of more inactive compounds relative to the active ones, are a common challenge in ligand-based model building workflows for drug discovery. This is particularly true for neglected tropical diseases since efforts to identify therapeutics for these diseases are often limited. In this report, we analyze the performance of several undersampling strategies in modeling the Dengue Virus 2 (DENV2) inhibitory activity, as well as the anti-flaviviral activities for the West Nile (WNV) and Zika (ZIKV) viruses. To this end, we build datasets comprising of 1218 (159 actives and 1059 inactives), 1044 (132 actives and 912 inactives) and 302 (75 actives and 227 inactives) molecules with known DENV2, WNV and ZIKV inhibitory activity profiles, respectively. We develop ensemble classifiers for these endpoints and compare the performance of the different undersampling algorithms on external sets. It is observed that data pruning algorithms yield superior performance relative to data selection algorithms. The best overall performance is provided by the one-sided selection algorithm with test set balanced accuracy (BACC) values of 0.84, 0.74 and 0.77 for the DENV2, WNV and ZIKV inhibitory activities, respectively. For the model building, we use the recently proposed GT-STAF information indices, and compare the predictivity of 3 molecular fragmentation approaches: connected subgraphs, substructure and alogp atom types, which are observed to show comparable performance. On the other hand, a combination of indices based on these fragmentation strategies enhances the predictivity of the built ensembles. The built models could be useful for screening new molecules with possible DENV, WNV and ZIKV inhibitory activities. ADMET modelers are encouraged to adopt undersampling algorithms in their workflows when dealing with imbalanced datasets.


Subject(s)
Antiviral Agents/pharmacology , Drug Discovery/methods , Flaviviridae/drug effects , Support Vector Machine , Antiviral Agents/chemistry , Dengue Virus/drug effects , Flaviviridae Infections/drug therapy , Humans , West Nile virus/drug effects , Zika Virus/drug effects
17.
J Chem Inf Model ; 59(11): 4750-4763, 2019 11 25.
Article in English | MEDLINE | ID: mdl-31589815

ABSTRACT

Applications of computational methods to predict binding affinities for protein/drug complexes are routinely used in structure-based drug discovery. Applications of these methods often rely on empirical force fields (FFs) and their associated parameter sets and atom types. However, it is widely accepted that FFs cannot accurately cover the entire chemical space of drug-like molecules, due to the restrictive cost of parametrization and the poor transferability of existing parameters. To address these limitations, initiatives have been carried out to develop more transferable methods, in order to allow for more rigorous descriptions of any drug-like molecule. We have previously reported H-TEQ, a method which does not rely on atom types and incorporates well established chemical principles to assign parameters to organic molecules. The previous implementation of H-TEQ (a torsional barrier prediction method) only covered saturated and lone pair containing molecules; here, we report our efforts to incorporate conjugated systems into our model. The next step was the evaluation of the introduction of unsaturations. The developed model (H-TEQ3.0) has been validated on a wide variety of molecules containing heteroaromatic groups, alkyls, and fused ring systems. Our method performs on par with one of the most commonly used FFs (GAFF2), without relying on atom types or any prior parametrization.


Subject(s)
Allyl Compounds/chemistry , Benzene Derivatives/chemistry , Drug Discovery , Molecular Conformation , Molecular Dynamics Simulation , Pharmaceutical Preparations/chemistry , Quantum Theory , Thermodynamics
18.
J Chem Inf Model ; 59(11): 4764-4777, 2019 11 25.
Article in English | MEDLINE | ID: mdl-31430147

ABSTRACT

Biaryl molecules are ubiquitous pharmacophores found in natural products and pharmaceuticals. In spite of this, existing molecular mechanics force fields are unable to accurately reproduce their torsional energy profiles, except for a few well-parametrized cases. This effectively limits the ability of structure-based drug design methods to correctly identify hits involving biaryls with confidence (e.g., during virtual screening, employing docking and/or molecular dynamics simulations). Continuing in our endeavor to quantify organic chemistry principles, we showed that the torsional energy profile of biaryl compounds could be computed on-the-fly based on the electron richness/deficiency of the aromatic rings. This method, called H-TEQ 4.0, was developed using a set of 131 biaryls. It was subsequently validated on a separate set of 100 diverse biaryls, including multisubstituted, bicyclic and tricyclic druglike molecules, and produced an average root-mean-square error (RMSE) of 0.95 kcal·mol-1. For comparison, GAFF2 produced an RMSE of 3.88 kcal·mol-1, owing to problems associated with the transferability of torsion parameters. The success of H-TEQ 4.0 provided further evidence that force fields could transition to become atom-type independent, providing that the correct chemical principles are used. Overall, this method solved the problem of transferability of biaryl torsion parameters, while simultaneously improving the overall accuracy of the force field.


Subject(s)
Hydrocarbons, Aromatic/chemistry , Pharmaceutical Preparations/chemistry , Drug Design , Electrons , Models, Chemical , Quantum Theory , Static Electricity , Thermodynamics
19.
Mol Inform ; 38(7): e1900024, 2019 07.
Article in English | MEDLINE | ID: mdl-31131991

ABSTRACT

A lot of research initiatives in the last decades have been focused on the search of new strategies to treat depression. However, despite the availability of various antidepressants, current treatment is still far from ideal. Unwanted side effects, modest response rates and the slow onset of action are the main shortcomings. As a strategy to improve symptomatic relief and response rates, the dual modulation of the serotonin transporter and the histamine H3 receptor by a single chemical entity has been proposed in the literature. Accordingly, this work aims to elucidate key structural features responsible for the dual inhibitory activity of the hexahydro-pyrrolo-isoquinoline derivatives. For this purpose, two approaches were employed, four-dimensional quantitative structure-activity relationship (4D-QSAR) and molecular docking. The 4D-QSAR models for both receptors allowed the identification of the pharmacophore groups critical for the modelled biological activity, whereas the binding mode of this class of compounds to the human serotonin transporter was assessed by molecular docking. The findings can be applicable to design new antidepressants.


Subject(s)
Antidepressive Agents/chemistry , Depression/drug therapy , Molecular Docking Simulation , Antidepressive Agents/therapeutic use , Humans , Molecular Structure , Quantitative Structure-Activity Relationship
20.
Sci Rep ; 8(1): 16679, 2018 11 12.
Article in English | MEDLINE | ID: mdl-30420728

ABSTRACT

Consensus strategy was proved to be highly efficient in the recognition of gene-disease association. Therefore, the main objective of this study was to apply theoretical approaches to explore genes and communities directly involved in breast cancer (BC) pathogenesis. We evaluated the consensus between 8 prioritization strategies for the early recognition of pathogenic genes. A communality analysis in the protein-protein interaction (PPi) network of previously selected genes was enriched with gene ontology, metabolic pathways, as well as oncogenomics validation with the OncoPPi and DRIVE projects. The consensus genes were rationally filtered to 1842 genes. The communality analysis showed an enrichment of 14 communities specially connected with ERBB, PI3K-AKT, mTOR, FOXO, p53, HIF-1, VEGF, MAPK and prolactin signaling pathways. Genes with highest ranking were TP53, ESR1, BRCA2, BRCA1 and ERBB2. Genes with highest connectivity degree were TP53, AKT1, SRC, CREBBP and EP300. The connectivity degree allowed to establish a significant correlation between the OncoPPi network and our BC integrated network conformed by 51 genes and 62 PPi. In addition, CCND1, RAD51, CDC42, YAP1 and RPA1 were functional genes with significant sensitivity score in BC cell lines. In conclusion, the consensus strategy identifies both well-known pathogenic genes and prioritized genes that need to be further explored.


Subject(s)
Algorithms , Breast Neoplasms/metabolism , Female , Gene Expression Regulation, Neoplastic/genetics , Gene Expression Regulation, Neoplastic/physiology , Gene Regulatory Networks/genetics , Gene Regulatory Networks/physiology , Humans , Metabolic Networks and Pathways/genetics , Metabolic Networks and Pathways/physiology , Protein Binding , Signal Transduction/genetics , Signal Transduction/physiology
SELECTION OF CITATIONS
SEARCH DETAIL