Your browser doesn't support javascript.
loading
Show: 20 | 50 | 100
Results 1 - 5 de 5
Filter
Add more filters










Database
Language
Publication year range
1.
J Phys Chem B ; 128(20): 4943-4951, 2024 May 23.
Article in English | MEDLINE | ID: mdl-38733335

ABSTRACT

Options to improve the extrapolation power of the neural network designed using the SchNetPack package with respect to top docking scores prediction are presented. It is shown that hyperparameter tuning of the atomistic model representation (in the schnetpack.representation) improves the prediction of the top scoring compounds, which have characteristically a low incidence in randomized data sets for training of machine learning models. The prediction robustness is evaluated according to the mean square error (MSE) and the entropy of the average loss landscape decrease. Admittedly, the improvement of the top scoring compounds' prediction accuracy comes with the penalty of worsening the overall prediction power. It is revealed that the most impactful hyperparameter is the cutoff (5 Å is reported as the optimal choice). Other parameters (e.g., number of radial basis functions, number of interaction layers of the neural network, feature vector size or its batch size) are found to not affect the prediction robustness of the top scoring compounds in any comparable way relative to the cutoff. The MSE of the best docking score prediction (below -13 kcal/mol) improves from ca. 3.5 to 0.9 kcal/mol, while the prediction of less potent compounds (-13 to -11 kcal/mol) shows a lesser improvement, i.e., a decrease of MSE from 1.6 to 1.3 kcal/mol. Additionally, oversampling and undersampling of the training set with respect to the top scoring compounds' abundance is presented. The results indicate that the cutoff choice performs better than over- or undersampling of the training set, with undersampling performing better than oversampling.

2.
Biophys Chem ; 288: 106854, 2022 09.
Article in English | MEDLINE | ID: mdl-35810518

ABSTRACT

Molecular docking of 234 unique compounds identified in the softwood bark (W set) is presented with a focus on their inhibition potential to the main protease of the SARS-CoV-2 virus 3CLpro (6WQF). The docking results are compared with the docking results of 866 COVID19-related compounds (S set). Furthermore, machine learning (ML) prediction of docking scores of the W set is presented using the S set trained TensorFlow, XGBoost, and SchNetPack ML approaches. Docking scores are evaluated with the Autodock 4.2.6 software. Four compounds in the W set achieve a docking score below -13 kcal/mol, with (+)-lariciresinol 9'-p-coumarate (CID 11497085) achieving the best docking score (-15 kcal/mol) within the W and S sets. In addition, 50% of W set docking scores are found below -8 kcal/mol and 25% below -10 kcal/mol. Therefore, the compounds identified in the softwood bark, show potential for antiviral activity upon extraction or further derivatization. The W set molecular docking studies are validated by means of molecular dynamics (five best compounds). The solubility (Log S, ESOL) and druglikeness of the best docking compounds in S and W sets are compared to evaluate the pharmacological potential of compounds identified in softwood bark.


Subject(s)
COVID-19 , SARS-CoV-2 , Antiviral Agents/pharmacology , Machine Learning , Molecular Docking Simulation , Molecular Dynamics Simulation , Peptide Hydrolases , Plant Bark , Protease Inhibitors/pharmacology
3.
Comput Biol Chem ; 98: 107656, 2022 Jun.
Article in English | MEDLINE | ID: mdl-35288359

ABSTRACT

Molecular docking results of two training sets containing 866 and 8,696 compounds were used to train three different machine learning (ML) approaches. Neural network approaches according to Keras and TensorFlow libraries and the gradient boosted decision trees approach of XGBoost were used with DScribe's Smooth Overlap of Atomic Positions molecular descriptors. In addition, neural networks using the SchNetPack library and descriptors were used. The ML performance was tested on three different sets, including compounds for future organic synthesis. The final evaluation of the ML predicted docking scores was based on the ZINC in vivo set, from which 1,200 compounds were randomly selected with respect to their size. The results obtained showed a consistent ML prediction capability of docking scores, and even though compounds with more than 60 atoms were found slightly overestimated they remain valid for a subsequent evaluation of their drug repurposing suitability.


Subject(s)
COVID-19 , SARS-CoV-2 , Antiviral Agents/therapeutic use , Humans , Machine Learning , Molecular Docking Simulation , Protease Inhibitors
4.
Sci Rep ; 11(1): 19456, 2021 09 30.
Article in English | MEDLINE | ID: mdl-34593871

ABSTRACT

Coronavirus disease 2019 (COVID-19) pandemic caused by severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) emerges to scientific research and monitoring of wastewaters to predict the spread of the virus in the community. Our study investigated the COVID-19 disease in Bratislava, based on wastewater monitoring from September 2020 until March 2021. Samples were analyzed from two wastewater treatment plants of the city with reaching 0.6 million monitored inhabitants. Obtained results from the wastewater analysis suggest significant statistical dependence. High correlations between the number of viral particles in wastewater and the number of reported positive nasopharyngeal RT-qPCR tests of infected individuals with a time lag of 2 weeks/12 days (R2 = 83.78%/R2 = 52.65%) as well as with a reported number of death cases with a time lag of 4 weeks/27 days (R2 = 83.21%/R2 = 61.89%) was observed. The obtained results and subsequent mathematical modeling will serve in the future as an early warning system for the occurrence of a local site of infection and, at the same time, predict the load on the health system up to two weeks in advance.


Subject(s)
COVID-19/epidemiology , SARS-CoV-2/genetics , Wastewater/analysis , Wastewater/virology , COVID-19/mortality , Disease Outbreaks/prevention & control , Humans , Models, Theoretical , RNA, Viral/isolation & purification , Real-Time Polymerase Chain Reaction , Slovakia/epidemiology , Wastewater/chemistry , Wastewater-Based Epidemiological Monitoring , Water Purification
5.
J Comput Chem ; 41(7): 698-714, 2020 Mar 15.
Article in English | MEDLINE | ID: mdl-31804728

ABSTRACT

Geometry optimization, energetics, electronic structure, and topology of electron density of dicopper (I) and dichromium (II) tetrakis(µ-acetato)-diaqua complexes are studied focusing on the metal-metal interactions. The performance of broken symmetry (BS) single-determinant ab initio (Hartree-Fock, Møller-Plesset perturbation theory to the second and third orders, coupled clusters singles and doubles) and density functional theory (BLYP, B3LYP, B3LYP-D3, B2PLYP, MPW2PLYP) methods is compared to multideterminant ab initio (CASSCF, NEVPT2) methods as well as to the multipole model of charge density from a single-crystal X-ray diffraction experiment (Herich et al., Acta Cryst. 2018, B74, 681-692). In vacuo DFT geometry optimizations (improper axial water ligand orientation) are compared against the periodic ones. The singlet state is found to be energetically preferred. J coupling of (I) becomes underestimated for all ab initio methods used, when compared to experiment. It is concluded that the strength of the direct M─M interactions correlates closely with the J coupling magnitude at a given level of theory. The double potential well character of (II) and of the dehydrated form of (II) are considered with respect to the Cr─Cr distance. The physical effective bond order of the metal-metal interaction is small (below 0.1 e) in (I) and moderate (0.4 e) in (II). The CASSCF results overestimate the electron density of the metal-metal bond critical point by 20% and 50% in (I) and (II), respectively, when compared to the multipole model. © 2019 Wiley Periodicals, Inc.

SELECTION OF CITATIONS
SEARCH DETAIL
...