Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 145
Filtrar
1.
J Chem Inf Model ; 64(6): 1907-1918, 2024 Mar 25.
Artigo em Inglês | MEDLINE | ID: mdl-38470995

RESUMO

The protein-ligand binding free energy is a central quantity in structure-based computational drug discovery efforts. Although popular alchemical methods provide sound statistical means of computing the binding free energy of a large breadth of systems, they are generally too costly to be applied at the same frequency as end point or ligand-based methods. By contrast, these data-driven approaches are typically fast enough to address thousands of systems but with reduced transferability to unseen systems. We introduce DrΔG-Net (or simply Dragnet), an equivariant graph neural network that can blend ligand-based and protein-ligand data-driven approaches. It is based on a 3D fingerprint representation of the ligand alone and in complex with the protein target. Dragnet is a global scoring function to predict the binding affinity of arbitrary protein-ligand complexes, but can be easily tuned via transfer learning to specific systems or end points, performing similarly to common 2D ligand-based approaches in these tasks. Dragnet is evaluated on a total of 28 validation proteins with a set of congeneric ligands derived from the Binding DB and one custom set extracted from the ChEMBL Database. In general, a handful of experimental binding affinities are sufficient to optimize the scoring function for a particular protein and ligand scaffold. When not available, predictions from physics-based methods such as absolute free energy perturbation can be used for the transfer learning tuning of Dragnet. Furthermore, we use our data to illustrate the present limitations of data-driven modeling of binding free energy predictions.


Assuntos
Redes Neurais de Computação , Proteínas , Ligantes , Proteínas/química , Entropia , Ligação Proteica
2.
J Chem Phys ; 160(24)2024 Jun 28.
Artigo em Inglês | MEDLINE | ID: mdl-38916266

RESUMO

Access to accurate force-field parameters for small molecules is crucial for computational studies of their interactions with proteins. Although a number of general force fields for small molecules exist, e.g., CGenFF, GAFF, and OPLS, they do not cover all common chemical groups and their combinations. The Force Field Toolkit (ffTK) provides a comprehensive graphical interface that streamlines the development of classical parameters for small molecules directly from quantum mechanical (QM) calculations, allowing for force-field generation for almost any chemical group and validation of the fit relative to the target data. ffTK relies on supported external software for the QM calculations, but it can generate the necessary QM input files and parse and analyze the QM output. In previous ffTK versions, support for Gaussian and ORCA QM packages was implemented. Here, we add support for Psi4, an open-source QM package free for all users, thereby broadening user access to ffTK. We also compare the parameter sets obtained with the new ffTK version using Gaussian, ORCA, and Psi4 for three molecules: pyrrolidine, n-propylammonium cation, and chlorobenzene. Despite minor differences between the resulting parameter sets for each compound, most prominently in the dihedral and improper terms, we show that conformational distributions sampled in molecular dynamics simulations using these parameter sets are quite comparable.

3.
J Chem Inf Model ; 63(10): 3150-3157, 2023 05 22.
Artigo em Inglês | MEDLINE | ID: mdl-37125692

RESUMO

High-order quantum chemistry is applied to hydrogen-bonded natural DNA nucleobase pairs [adenine:thymine (A:T) and guanine:cytosine (G:C)] and non-natural Hachimoji nucleobase pairs [isoguanine:1-methylcytosine (B:S) and 2-aminoimidazo[1,2a][1,3,5]triazin-4(1H)-one:6-amino-5-nitropyridin-2-one (P:Z)] to see how the intermolecular interaction energies and their energetic components (electrostatics, exchange-repulsion, induction/polarization, and London dispersion interactions) vary among the base pairs. We examined the Hoogsteen (HG) geometries in addition to the traditional Watson-Crick (WC) geometries. Coupled-cluster theory through perturbative triples [CCSD(T)] extrapolated to the complete basis set (CBS) limit and high-order symmetry-adapted perturbation theory (SAPT) at the SAPT2+(3)(CCD)δMP2/aug-cc-pVTZ level are used to estimate highly accurate noncovalent interaction energies. Electrostatic interactions are the most attractive component of the interaction energies, but the sum of induction/polarization and London dispersion is nearly as large, for all base pairs and geometries considered. Interestingly, the non-natural Hachimoji base pairs interact more strongly than the corresponding natural base pairs, by -21.8 (B:S) and -0.3 (P:Z) kcal mol-1 in the WC geometries, according to CCSD(T)/CBS. This is consistent with the H-bond distances being generally shorter in the non-natural base pairs. The natural base pairs are energetically more stabilized in their Hoogsteen geometries than in their WC geometries. The Hoogsteen geometry makes the A:T base pair slightly more stable, by -0.8 kcal mol-1, and it greatly stabilizes the G:C+ base pair, by -15.3 kcal mol-1. The G:C+ stabilization is mainly due to the fact that C has typically added a proton when found in Hoogsteen geometries. By contrast, Hoogsteen geometries are substantially less favorable than WC geometries for non-natural Hachimoji base pairs, by 17.3 (B:S) and 13.8 (P:Z) kcal mol-1.


Assuntos
DNA , Timina , Pareamento de Bases , Ligação de Hidrogênio , DNA/química , Timina/química , Adenina/química
4.
J Chem Phys ; 158(9): 094110, 2023 Mar 07.
Artigo em Inglês | MEDLINE | ID: mdl-36889937

RESUMO

To study the contribution of three-body dispersion to crystal lattice energies, we compute the three-body contributions to the lattice energies for crystalline benzene, carbon dioxide, and triazine using various computational methods. We show that these contributions converge quickly as the intermolecular distances between the monomers grow. In particular, the smallest value among the three pairwise intermonomer closest-contact distances, Rmin, shows a strong correlation with the three-body contribution to the lattice energy, and, here, the largest of the closest-contact distances, Rmax, serves as a cutoff criterion to limit the number of trimers to be considered. We considered all trimers up to Rmax=15Å. The trimers with Rmin<4Å contribute 90.4%, 90.6%, and 93.9% of the total three-body contributions for crystalline benzene, carbon dioxide, and triazine, respectively, for the coupled-cluster singles, doubles, and perturbative triples [CCSD(T)] method. For trimers with Rmin>4Å, the second-order Møller-Plesset perturbation theory (MP2) supplemented with the Axilrod-Teller-Muto (ATM) three-body dispersion correction reproduces the CCSD(T) values for the cumulative three-body contributions with errors of less than 0.1 kJ mol-1. Moreover, three-body contributions are converged within 0.15 kJ mol-1 by Rmax=10Å. From these results, it appears that in molecular crystals where dispersion dominates the three-body contribution to the lattice energy, the trimers with Rmin>4Å can be computed with the MP2+ATM method to reduce the computational cost, and those with Rmax>10Å appear to be basically negligible.

5.
J Chem Phys ; 159(9)2023 Sep 07.
Artigo em Inglês | MEDLINE | ID: mdl-37655773

RESUMO

The focal-point approximation can be used to estimate a high-accuracy, slow quantum chemistry computation by combining several lower-accuracy, faster computations. We examine the performance of focal-point methods by combining second-order Møller-Plesset perturbation theory (MP2) with coupled-cluster singles, doubles, and perturbative triples [CCSD(T)] for the calculation of harmonic frequencies and that of fundamental frequencies using second-order vibrational perturbation theory (VPT2). In contrast to standard CCSD(T), the focal-point CCSD(T) method approaches the complete basis set (CBS) limit with only triple-ζ basis sets for the coupled-cluster portion of the computation. The predicted harmonic and fundamental frequencies were compared with the experimental values for a set of 20 molecules containing up to six atoms. The focal-point method combining CCSD(T)/aug-cc-pV(T + d)Z with CBS-extrapolated MP2 has mean absolute errors vs experiment of only 7.3 cm-1 for the fundamental frequencies, which are essentially the same as the mean absolute error for CCSD(T) extrapolated to the CBS limit using the aug-cc-pV(Q + d)Z and aug-cc-pV(5 + d)Z basis sets. However, for H2O, the focal-point procedure requires only 3% of the computation time as the extrapolated CCSD(T) result, and the cost savings will grow for larger molecules.

6.
J Chem Phys ; 158(5): 054112, 2023 Feb 07.
Artigo em Inglês | MEDLINE | ID: mdl-36754814

RESUMO

Using the many-body expansion to predict crystal lattice energies (CLEs), a pleasantly parallel process, allows for flexibility in the choice of theoretical methods. Benchmark-level two-body contributions to CLEs of 23 molecular crystals have been computed using interaction energies of dimers with minimum inter-monomer separations (i.e., closest contact distances) up to 30 Å. In a search for ways to reduce the computational expense of calculating accurate CLEs, we have computed these two-body contributions with 15 different quantum chemical levels of theory and compared these energies to those computed with coupled-cluster in the complete basis set (CBS) limit. Interaction energies of the more distant dimers are easier to compute accurately and several of the methods tested are suitable as replacements for coupled-cluster through perturbative triples for all but the closest dimers. For our dataset, sub-kJ mol-1 accuracy can be obtained when calculating two-body interaction energies of dimers with separations shorter than 4 Å with coupled-cluster with single, double, and perturbative triple excitations/CBS and dimers with separations longer than 4 Å with MP2.5/aug-cc-pVDZ, among other schemes, reducing the number of dimers to be computed with coupled-cluster by as much as 98%.

7.
J Chem Phys ; 158(23)2023 Jun 21.
Artigo em Inglês | MEDLINE | ID: mdl-37318167

RESUMO

The many-body expansion (MBE) is promising for the efficient, parallel computation of lattice energies in organic crystals. Very high accuracy should be achievable by employing coupled-cluster singles, doubles, and perturbative triples at the complete basis set limit [CCSD(T)/CBS] for the dimers, trimers, and potentially tetramers resulting from the MBE, but such a brute-force approach seems impractical for crystals of all but the smallest molecules. Here, we investigate hybrid or multi-level approaches that employ CCSD(T)/CBS only for the closest dimers and trimers and utilize much faster methods like Møller-Plesset perturbation theory (MP2) for more distant dimers and trimers. For trimers, MP2 is supplemented with the Axilrod-Teller-Muto (ATM) model of three-body dispersion. MP2(+ATM) is shown to be a very effective replacement for CCSD(T)/CBS for all but the closest dimers and trimers. A limited investigation of tetramers using CCSD(T)/CBS suggests that the four-body contribution is entirely negligible. The large set of CCSD(T)/CBS dimer and trimer data should be valuable in benchmarking approximate methods for molecular crystals and allows us to see that a literature estimate of the core-valence contribution of the closest dimers to the lattice energy using just MP2 was overbinding by 0.5 kJ mol-1, and an estimate of the three-body contribution from the closest trimers using the T0 approximation in local CCSD(T) was underbinding by 0.7 kJ mol-1. Our CCSD(T)/CBS best estimate of the 0 K lattice energy is -54.01 kJ mol-1, compared to an estimated experimental value of -55.3 ± 2.2 kJ mol-1.

8.
J Chem Phys ; 158(24)2023 Jun 28.
Artigo em Inglês | MEDLINE | ID: mdl-37352421

RESUMO

Dimer interaction energies have been well studied in computational chemistry, but they can offer an incomplete understanding of molecular binding depending on the system. In the current study, we present a dataset of focal-point coupled-cluster interaction and deformation energies (summing to binding energies, De) of 28 organic molecular dimers. We use these highly accurate energies to evaluate ten density functional approximations for their accuracy. The best performing method (with a double-ζ basis set), B97M-D3BJ, is then used to calculate the binding energies of 104 organic dimers, and we analyze the influence of the nature and strength of interaction on deformation energies. Deformation energies can be as large as 50% of the dimer interaction energy, especially when hydrogen bonding is present. In most cases, two or more hydrogen bonds present in a dimer correspond to an interaction energy of -10 to -25 kcal mol-1, allowing a deformation energy above 1 kcal mol-1 (and up to 9.5 kcal mol-1). A lack of hydrogen bonding usually restricts the deformation energy to below 1 kcal mol-1 due to the weaker interaction energy.


Assuntos
Termodinâmica , Fenômenos Físicos , Ligação de Hidrogênio
9.
Nucleic Acids Res ; 49(1): 79-89, 2021 01 11.
Artigo em Inglês | MEDLINE | ID: mdl-33300028

RESUMO

The helical structures of DNA and RNA were originally revealed by experimental data. Likewise, the development of programs for modeling these natural polymers was guided by known structures. These nucleic acid polymers represent only two members of a potentially vast class of polymers with similar structural features, but that differ from DNA and RNA in the backbone or nucleobases. Xeno nucleic acids (XNAs) incorporate alternative backbones that affect the conformational, chemical, and thermodynamic properties of XNAs. Given the vast chemical space of possible XNAs, computational modeling of alternative nucleic acids can accelerate the search for plausible nucleic acid analogs and guide their rational design. Additionally, a tool for the modeling of nucleic acids could help reveal what nucleic acid polymers may have existed before RNA in the early evolution of life. To aid the development of novel XNA polymers and the search for possible pre-RNA candidates, this article presents the proto-Nucleic Acid Builder (https://github.com/GT-NucleicAcids/pnab), an open-source program for modeling nucleic acid analogs with alternative backbones and nucleobases. The torsion-driven conformation search procedure implemented here predicts structures with good accuracy compared to experimental structures, and correctly demonstrates the correlation between the helical structure and the backbone conformation in DNA and RNA.


Assuntos
Algoritmos , Modelos Químicos , Ácidos Nucleicos/química , Software , DNA/química , Desoxirribose/química , Estrutura Molecular , Conformação de Ácido Nucleico , RNA/química
10.
J Chem Phys ; 157(8): 084503, 2022 Aug 28.
Artigo em Inglês | MEDLINE | ID: mdl-36050028

RESUMO

Routinely assessing the stability of molecular crystals with high accuracy remains an open challenge in the computational sciences. The many-body expansion decomposes computation of the crystal lattice energy into an embarrassingly parallel collection of computations over molecular dimers, trimers, and so forth, making quantum chemistry techniques tractable for many crystals of small organic molecules. By examining the range-dependence of different types of energetic contributions to the crystal lattice energy, we can glean qualitative understanding of solid-state intermolecular interactions as well as practical, exploitable reductions in the number of computations required for accurate energies. Here, we assess the range-dependent character of two-body interactions of 24 small organic molecular crystals by using the physically interpretable components from symmetry-adapted perturbation theory (electrostatics, exchange-repulsion, induction/polarization, and London dispersion). We also examine correlations between the convergence rates of electrostatics and London dispersion terms with molecular dipole moments and polarizabilities, to provide guidance for estimating convergence rates in other molecular crystals.


Assuntos
Teoria Quântica , Eletricidade Estática , Termodinâmica
11.
J Chem Phys ; 157(2): 024801, 2022 Jul 14.
Artigo em Inglês | MEDLINE | ID: mdl-35840394

RESUMO

We report the implementation of a symmetry-adapted perturbation theory algorithm based on a density functional theory [SAPT(DFT)] description of monomers. The implementation adopts a density-fitting treatment of hybrid exchange-correlation kernels to enable the description of monomers with hybrid functionals, as in the algorithm by Bukowski, Podeszwa, and Szalewicz [Chem. Phys. Lett. 414, 111 (2005)]. We have improved the algorithm by increasing numerical stability with QR factorization and optimized the computation of the exchange-correlation kernel with its 2-index density-fitted representation. The algorithm scales as O(N5) formally and is usable for systems with up to ∼3000 basis functions, as demonstrated for the C60-buckycatcher complex with the aug-cc-pVDZ basis set. The hybrid-kernel-based SAPT(DFT) algorithm is shown to be as accurate as SAPT(DFT) implementations based on local effective exact exchange potentials obtained from the local Hartree-Fock (LHF) method while avoiding the lower-scaling [O(N4)] but iterative and sometimes hard-to-converge LHF process. The hybrid-kernel algorithm outperforms Hartree-Fock-based SAPT (SAPT0) for the S66 test set, and its accuracy is comparable to the many-body perturbation theory based SAPT2+ approach, which scales as O(N7), although SAPT2+ exhibits a more narrow distribution of errors.

12.
J Chem Phys ; 156(19): 194306, 2022 May 21.
Artigo em Inglês | MEDLINE | ID: mdl-35597646

RESUMO

High-level quantum chemical computations have provided significant insight into the fundamental physical nature of non-covalent interactions. These studies have focused primarily on gas-phase computations of small van der Waals dimers; however, these interactions frequently take place in complex chemical environments, such as proteins, solutions, or solids. To better understand how the chemical environment affects non-covalent interactions, we have undertaken a quantum chemical study of π-π interactions in an aqueous solution, as exemplified by T-shaped benzene dimers surrounded by 28 or 50 explicit water molecules. We report interaction energies (IEs) using second-order Møller-Plesset perturbation theory, and we apply the intramolecular and functional-group partitioning extensions of symmetry-adapted perturbation theory (ISAPT and F-SAPT, respectively) to analyze how the solvent molecules tune the π-π interactions of the solute. For complexes containing neutral monomers, even 50 explicit waters (constituting a first and partial second solvation shell) change total SAPT IEs between the two solute molecules by only tenths of a kcal mol-1, while significant changes of up to 3 kcal mol-1 of the electrostatic component are seen for the cationic pyridinium-benzene dimer. This difference between charged and neutral solutes is attributed to large non-additive three-body interactions within solvated ion-containing complexes. Overall, except for charged solutes, our quantum computations indicate that nearby solvent molecules cause very little "tuning" of the direct solute-solute interactions. This indicates that differences in binding energies between the gas phase and solution phase are primarily indirect effects of the competition between solute-solute and solute-solvent interactions.


Assuntos
Benzeno , Água , Benzeno/química , Soluções , Solventes , Eletricidade Estática , Água/química
13.
J Am Chem Soc ; 143(47): 19824-19833, 2021 12 01.
Artigo em Inglês | MEDLINE | ID: mdl-34783562

RESUMO

Nucleobase mimicking small molecules able to reconfigure DNA are a recently discovered strategy that promises to extend the structural and functional diversity of nucleic acids. However, only simple, unfunctionalized molecules such as cyanuric acid and melamine have so far been used in this approach. In this work, we show that the addition of substituted cyanuric acid molecules can successfully program polyadenine strands to assemble into supramolecular fibers. Unlike conventional DNA nanostructure functionalization, which typically end-labels DNA strands, our approach incorporates functional groups into DNA with high density using small molecules and results in new DNA triple helices coated with alkylamine or alcohol units that grow into micrometer-long fibers. We find that small changes in the small molecule functional group can result in large structural and energetic variation in the overall assembly. A combination of circular dichroism, atomic force microscopy, molecular dynamics simulations, and a new thermodynamic method, transient equilibrium mapping, elucidated the molecular factors behind these large changes. In particular, we identify substantial DNA sugar and phosphate group deformations to accommodate a hydrogen bond between the phosphate and the small-molecule functional groups, as well as a critical chain length of the functional group which switches this interaction from intra- to interfiber. These parameters allow the controlled formation of hierarchical, hybrid DNA assemblies simply through the addition and variation of small, functionalized molecules.


Assuntos
DNA/química , Ligação de Hidrogênio , Simulação de Dinâmica Molecular , Conformação de Ácido Nucleico , Polimerização , Eletricidade Estática , Triazinas/química
14.
J Am Chem Soc ; 143(16): 6079-6094, 2021 04 28.
Artigo em Inglês | MEDLINE | ID: mdl-33852800

RESUMO

Aqueous solutions of equimolar mixtures of 2,4,6-triaminopyrimidine (TAP) and carboxylic acid substituted cyanuric acid (CyCo6 or R-4MeCyCo6) monomers self-assemble into gel-forming supramolecular polymers. Macroscopic fibers drawn from these mixtures were analyzed by X-ray diffraction to determine their molecular structures. Computational methods were used to explore the intrinsic intermolecular interactions that contribute to the structure and stability of these assemblies. Both polymers are formed by the stacking of hexameric rosettes, (TAP/CyCo6)3 or (TAP/R-4MeCyCo6)3, respectively, into long, stiff, twisted stacks of essentially planar rosettes. Chiral, left-handed supramolecular polymers with a helical twist angle of -26.7° per hexad are formed when the pure enantiomer R-4MeCyCo6 is used. These hexad stacks pack into bundles with a hexagonal crystalline lattice organization perpendicular to the axis of the macroscopic fiber. Polymers formed from TAP and CyCo6, both of which are achiral, assemble into macroscopic domains that are packed as a centered rectangular lattice. Within these domains, the individual polymers exist as either right-handed or left-handed helical stacks, with twist angles of +15° or -15° per hexad, respectively. The remarkable ability of TAP and cyanuric acid derivatives to self-assemble in water, and the structural features of their supramolecular polymers reported here, provide additional support for the proposal that these heterocycles could have served as recognition units for an early form of nucleic acids, before the emergence of RNA.


Assuntos
Polímeros/química , Prebióticos/análise , Água/química , Géis/química , Ligação de Hidrogênio , Conformação Molecular , Simulação de Dinâmica Molecular , Pirimidinas/química , Teoria Quântica , Sódio/química , Estereoisomerismo , Triazinas/química , Difração de Raios X
15.
Chemistry ; 27(12): 4043-4052, 2021 Feb 24.
Artigo em Inglês | MEDLINE | ID: mdl-33174300

RESUMO

Cyanuric acid (CA), a triazine heterocycle, is extensively utilized for noncovalent self-assembly. The association between poly(adenine) and CA into micron-length fibers was a remarkable observation made by Sleiman and co-workers, who proposed that adenine and CA adopt a hexameric rosette configuration in analogy with previously reported structures for CA assemblies. However, recent experimental observations from the Krishnamurthy group led to a reevaluation of the hexameric rosette model, wherein they have proposed a hydrogen-bonded helicene model as an alternative. Our molecular dynamics simulations show that the hexad model is indeed unlikely and that this novel noncovalent helicene geometry, where the adenine and CA bases form an extended helical hydrogen-bond network across the system, is a more probable structural motif. The existence of noncovalent helicene compounds may have wide-ranging applications in DNA nanotechnology and helicene chemistry.


Assuntos
Ácidos Nucleicos , Humanos , Ligação de Hidrogênio , Compostos Policíclicos , Triazinas
16.
J Chem Inf Model ; 61(1): 115-122, 2021 01 25.
Artigo em Inglês | MEDLINE | ID: mdl-33326247

RESUMO

Atomic charges are critical quantities in molecular mechanics and molecular dynamics, but obtaining these quantities requires heuristic choices based on atom typing or relatively expensive quantum mechanical computations to generate a density to be partitioned. Most machine learning efforts in this domain ignore total molecular charges, relying on overfitting and arbitrary rescaling in order to match the total system charge. Here, we introduce the electron-passing neural network (EPNN), a fast, accurate neural network atomic charge partitioning model that conserves total molecular charge by construction. EPNNs predict atomic charges very similar to those obtained by partitioning quantum mechanical densities but at such a small fraction of the cost that they can be easily computed for large biomolecules. Charges from this method may be used directly for molecular mechanics, as features for cheminformatics, or as input to any neural network potential.


Assuntos
Elétrons , Redes Neurais de Computação , Quimioinformática , Aprendizado de Máquina , Simulação de Dinâmica Molecular
17.
J Chem Phys ; 154(22): 224103, 2021 Jun 14.
Artigo em Inglês | MEDLINE | ID: mdl-34241239

RESUMO

The message passing neural network (MPNN) framework is a promising tool for modeling atomic properties but is, until recently, incompatible with directional properties, such as Cartesian tensors. We propose a modified Cartesian MPNN (CMPNN) suitable for predicting atom-centered multipoles, an essential component of ab initio force fields. The efficacy of this model is demonstrated on a newly developed dataset consisting of 46 623 chemical structures and corresponding high-quality atomic multipoles, which was deposited into the publicly available Molecular Sciences Software Institute QCArchive server. We show that the CMPNN accurately predicts atom-centered charges, dipoles, and quadrupoles and that errors in the predicted atomic multipoles have a negligible effect on multipole-multipole electrostatic energies. The CMPNN is accurate enough to model conformational dependencies of a molecule's electronic structure. This opens up the possibility of recomputing atomic multipoles on the fly throughout a simulation in which they might exhibit strong conformational dependence.

18.
J Chem Phys ; 154(18): 184110, 2021 May 14.
Artigo em Inglês | MEDLINE | ID: mdl-34241025

RESUMO

Computation of intermolecular interactions is a challenge in drug discovery because accurate ab initio techniques are too computationally expensive to be routinely applied to drug-protein models. Classical force fields are more computationally feasible, and force fields designed to match symmetry adapted perturbation theory (SAPT) interaction energies can remain accurate in this context. Unfortunately, the application of such force fields is complicated by the laborious parameterization required for computations on new molecules. Here, we introduce the component-based machine-learned intermolecular force field (CLIFF), which combines accurate, physics-based equations for intermolecular interaction energies with machine-learning models to enable automatic parameterization. The CLIFF uses functional forms corresponding to electrostatic, exchange-repulsion, induction/polarization, and London dispersion components in SAPT. Molecule-independent parameters are fit with respect to SAPT2+(3)δMP2/aug-cc-pVTZ, and molecule-dependent atomic parameters (atomic widths, atomic multipoles, and Hirshfeld ratios) are obtained from machine learning models developed for C, N, O, H, S, F, Cl, and Br. The CLIFF achieves mean absolute errors (MAEs) no worse than 0.70 kcal mol-1 in both total and component energies across a diverse dimer test set. For the side chain-side chain interaction database derived from protein fragments, the CLIFF produces total interaction energies with an MAE of 0.27 kcal mol-1 with respect to reference data, outperforming similar and even more expensive methods. In applications to a set of model drug-protein interactions, the CLIFF is able to accurately rank-order ligand binding strengths and achieves less than 10% error with respect to SAPT reference values for most complexes.

19.
J Chem Phys ; 154(23): 234107, 2021 Jun 21.
Artigo em Inglês | MEDLINE | ID: mdl-34241276

RESUMO

Symmetry-adapted perturbation theory (SAPT) has become an invaluable tool for studying the fundamental nature of non-covalent interactions by directly computing the electrostatics, exchange (steric) repulsion, induction (polarization), and London dispersion contributions to the interaction energy using quantum mechanics. Further application of SAPT is primarily limited by its computational expense, where even its most affordable variant (SAPT0) scales as the fifth power of system size [O(N5)] due to the dispersion terms. The algorithmic scaling of SAPT0 is reduced from O(N5)→O(N4) by replacing these terms with the empirical D3 dispersion correction of Grimme and co-workers, forming a method that may be termed SAPT0-D3. Here, we optimize the damping parameters for the -D3 terms in SAPT0-D3 using a much larger training set than has previously been considered, namely, 8299 interaction energies computed at the complete-basis-set limit of coupled cluster through perturbative triples [CCSD(T)/CBS]. Perhaps surprisingly, with only three fitted parameters, SAPT0-D3 improves on the accuracy of SAPT0, reducing mean absolute errors from 0.61 to 0.49 kcal mol-1 over the full set of complexes. Additionally, SAPT0-D3 exhibits a nearly 2.5× speedup over conventional SAPT0 for systems with ∼300 atoms and is applied here to systems with up to 459 atoms. Finally, we have also implemented a functional group partitioning of the approach (F-SAPT0-D3) and applied it to determine important contacts in the binding of salbutamol to G-protein coupled ß1-adrenergic receptor in both active and inactive forms. SAPT0-D3 capabilities have been added to the open-source Psi4 software.


Assuntos
Teoria Quântica , Algoritmos , Eletricidade Estática
20.
J Chem Phys ; 155(20): 204801, 2021 Nov 28.
Artigo em Inglês | MEDLINE | ID: mdl-34852489

RESUMO

Community efforts in the computational molecular sciences (CMS) are evolving toward modular, open, and interoperable interfaces that work with existing community codes to provide more functionality and composability than could be achieved with a single program. The Quantum Chemistry Common Driver and Databases (QCDB) project provides such capability through an application programming interface (API) that facilitates interoperability across multiple quantum chemistry software packages. In tandem with the Molecular Sciences Software Institute and their Quantum Chemistry Archive ecosystem, the unique functionalities of several CMS programs are integrated, including CFOUR, GAMESS, NWChem, OpenMM, Psi4, Qcore, TeraChem, and Turbomole, to provide common computational functions, i.e., energy, gradient, and Hessian computations as well as molecular properties such as atomic charges and vibrational frequency analysis. Both standard users and power users benefit from adopting these APIs as they lower the language barrier of input styles and enable a standard layout of variables and data. These designs allow end-to-end interoperable programming of complex computations and provide best practices options by default.

SELEÇÃO DE REFERÊNCIAS
DETALHE DA PESQUISA