Búsqueda | BVS CLAP/SMR-OPS/OMS

1.

Efficient approximation of molecular kinetics using random Fourier features.

Nüske, Feliks; Klus, Stefan.

J Chem Phys ; 159(7)2023 Aug 21.

Artículo en Inglés | MEDLINE | ID: mdl-37594070

RESUMEN

Slow kinetic processes in molecular systems can be analyzed by computing the dominant eigenpairs of the Koopman operator or its generator. In this context, the Variational Approach to Markov Processes (VAMP) provides a rigorous way of discerning the quality of different approximate models. Kernel methods have been shown to provide accurate and robust estimates for slow kinetic processes, but they are sensitive to hyper-parameter selection and require the solution of large-scale generalized eigenvalue problems, which can easily become computationally demanding for large data sizes. In this contribution, we employ a stochastic approximation of the kernel based on random Fourier features (RFFs) to derive a small-scale dual eigenvalue problem that can be easily solved. We provide an interpretation of this procedure in terms of a finite, randomly generated basis set. By combining the RFF approach and model selection by means of the VAMP score, we show that kernel parameters can be efficiently tuned and accurate estimates of slow molecular kinetics can be obtained for several benchmarking systems, such as deca alanine and the NTL9 protein.

2.

Spectral Properties of Effective Dynamics from Conditional Expectations.

Nüske, Feliks; Koltai, Péter; Boninsegna, Lorenzo; Clementi, Cecilia.

Entropy (Basel) ; 23(2)2021 Jan 21.

Artículo en Inglés | MEDLINE | ID: mdl-33494443

RESUMEN

The reduction of high-dimensional systems to effective models on a smaller set of variables is an essential task in many areas of science. For stochastic dynamics governed by diffusion processes, a general procedure to find effective equations is the conditioning approach. In this paper, we are interested in the spectrum of the generator of the resulting effective dynamics, and how it compares to the spectrum of the full generator. We prove a new relative error bound in terms of the eigenfunction approximation error for reversible systems. We also present numerical examples indicating that, if Kramers-Moyal (KM) type approximations are used to compute the spectrum of the reduced generator, it seems largely insensitive to the time window used for the KM estimators. We analyze the implications of these observations for systems driven by underdamped Langevin dynamics, and show how meaningful effective dynamics can be defined in this setting.

3.

Kernel-Based Approximation of the Koopman Generator and Schrödinger Operator.

Klus, Stefan; Nüske, Feliks; Hamzi, Boumediene.

Entropy (Basel) ; 22(7)2020 Jun 30.

Artículo en Inglés | MEDLINE | ID: mdl-33286494

RESUMEN

Many dimensionality and model reduction techniques rely on estimating dominant eigenfunctions of associated dynamical operators from data. Important examples include the Koopman operator and its generator, but also the Schrödinger operator. We propose a kernel-based method for the approximation of differential operators in reproducing kernel Hilbert spaces and show how eigenfunctions can be estimated by solving auxiliary matrix eigenvalue problems. The resulting algorithms are applied to molecular dynamics and quantum chemistry examples. Furthermore, we exploit that, under certain conditions, the Schrödinger operator can be transformed into a Kolmogorov backward operator corresponding to a drift-diffusion process and vice versa. This allows us to apply methods developed for the analysis of high-dimensional stochastic differential equations to quantum mechanical systems.

4.

Coarse-graining molecular systems by spectral matching.

Nüske, Feliks; Boninsegna, Lorenzo; Clementi, Cecilia.

J Chem Phys ; 151(4): 044116, 2019 Jul 28.

Artículo en Inglés | MEDLINE | ID: mdl-31370528

RESUMEN

Coarse-graining has become an area of tremendous importance within many different research fields. For molecular simulation, coarse-graining bears the promise of finding simplified models such that long-time simulations of large-scale systems become computationally tractable. While significant progress has been made in tuning thermodynamic properties of reduced models, it remains a key challenge to ensure that relevant kinetic properties are retained by coarse-grained dynamical systems. In this study, we focus on data-driven methods to preserve the rare-event kinetics of the original system and make use of their close connection to the low-lying spectrum of the system's generator. Building on work by Crommelin and Vanden-Eijnden [Multiscale Model. Simul. 9, 1588 (2011)], we present a general framework, called spectral matching, which directly targets the generator's leading eigenvalue equations when learning parameters for coarse-grained models. We discuss different parametric models for effective dynamics and derive the resulting data-based regression problems. We show that spectral matching can be used to learn effective potentials which retain the slow dynamics but also to correct the dynamics induced by existing techniques, such as force matching.

5.

Sparse learning of stochastic dynamical equations.

Boninsegna, Lorenzo; Nüske, Feliks; Clementi, Cecilia.

J Chem Phys ; 148(24): 241723, 2018 Jun 28.

Artículo en Inglés | MEDLINE | ID: mdl-29960307

RESUMEN

With the rapid increase of available data for complex systems, there is great interest in the extraction of physically relevant information from massive datasets. Recently, a framework called Sparse Identification of Nonlinear Dynamics (SINDy) has been introduced to identify the governing equations of dynamical systems from simulation data. In this study, we extend SINDy to stochastic dynamical systems which are frequently used to model biophysical processes. We prove the asymptotic correctness of stochastic SINDy in the infinite data limit, both in the original and projected variables. We discuss algorithms to solve the sparse regression problem arising from the practical implementation of SINDy and show that cross validation is an essential tool to determine the right level of sparsity. We demonstrate the proposed methodology on two test systems, namely, the diffusion in a one-dimensional potential and the projected dynamics of a two-dimensional diffusion process.

6.

Quantitative comparison of adaptive sampling methods for protein dynamics.

Hruska, Eugen; Abella, Jayvee R; Nüske, Feliks; Kavraki, Lydia E; Clementi, Cecilia.

J Chem Phys ; 149(24): 244119, 2018 Dec 28.

Artículo en Inglés | MEDLINE | ID: mdl-30599712

RESUMEN

Adaptive sampling methods, often used in combination with Markov state models, are becoming increasingly popular for speeding up rare events in simulation such as molecular dynamics (MD) without biasing the system dynamics. Several adaptive sampling strategies have been proposed, but it is not clear which methods perform better for different physical systems. In this work, we present a systematic evaluation of selected adaptive sampling strategies on a wide selection of fast folding proteins. The adaptive sampling strategies were emulated using models constructed on already existing MD trajectories. We provide theoretical limits for the sampling speed-up and compare the performance of different strategies with and without using some a priori knowledge of the system. The results show that for different goals, different adaptive sampling strategies are optimal. In order to sample slow dynamical processes such as protein folding without a priori knowledge of the system, a strategy based on the identification of a set of metastable regions is consistently the most efficient, while a strategy based on the identification of microstates performs better if the goal is to explore newer regions of the conformational space. Interestingly, the maximum speed-up achievable for the adaptive sampling of slow processes increases for proteins with longer folding times, encouraging the application of these methods for the characterization of slower processes, beyond the fast-folding proteins considered here.

Asunto(s)

Simulación de Dinámica Molecular , Proteínas/química , Conformación Proteica , Pliegue de Proteína

7.

Variational Koopman models: Slow collective variables and molecular kinetics from short off-equilibrium simulations.

Wu, Hao; Nüske, Feliks; Paul, Fabian; Klus, Stefan; Koltai, Péter; Noé, Frank.

J Chem Phys ; 146(15): 154104, 2017 Apr 21.

Artículo en Inglés | MEDLINE | ID: mdl-28433026

RESUMEN

Markov state models (MSMs) and master equation models are popular approaches to approximate molecular kinetics, equilibria, metastable states, and reaction coordinates in terms of a state space discretization usually obtained by clustering. Recently, a powerful generalization of MSMs has been introduced, the variational approach conformation dynamics/molecular kinetics (VAC) and its special case the time-lagged independent component analysis (TICA), which allow us to approximate slow collective variables and molecular kinetics by linear combinations of smooth basis functions or order parameters. While it is known how to estimate MSMs from trajectories whose starting points are not sampled from an equilibrium ensemble, this has not yet been the case for TICA and the VAC. Previous estimates from short trajectories have been strongly biased and thus not variationally optimal. Here, we employ the Koopman operator theory and the ideas from dynamic mode decomposition to extend the VAC and TICA to non-equilibrium data. The main insight is that the VAC and TICA provide a coefficient matrix that we call Koopman model, as it approximates the underlying dynamical (Koopman) operator in conjunction with the basis set used. This Koopman model can be used to compute a stationary vector to reweight the data to equilibrium. From such a Koopman-reweighted sample, equilibrium expectation values and variationally optimal reversible Koopman models can be constructed even with short simulations. The Koopman model can be used to propagate densities, and its eigenvalue decomposition provides estimates of relaxation time scales and slow collective variables for dimension reduction. Koopman models are generalizations of Markov state models, TICA, and the linear VAC and allow molecular kinetics to be described without a cluster discretization.

8.

Variational tensor approach for approximating the rare-event kinetics of macromolecular systems.

Nüske, Feliks; Schneider, Reinhold; Vitalini, Francesca; Noé, Frank.

J Chem Phys ; 144(5): 054105, 2016 Feb 07.

Artículo en Inglés | MEDLINE | ID: mdl-26851906

RESUMEN

Essential information about the stationary and slow kinetic properties of macromolecules is contained in the eigenvalues and eigenfunctions of the dynamical operator of the molecular dynamics. A recent variational formulation allows to optimally approximate these eigenvalues and eigenfunctions when a basis set for the eigenfunctions is provided. In this study, we propose that a suitable choice of basis functions is given by products of one-coordinate basis functions, which describe changes along internal molecular coordinates, such as dihedral angles or distances. A sparse tensor product approach is employed in order to avoid a combinatorial explosion of products, i.e., of the basis set size. Our results suggest that the high-dimensional eigenfunctions can be well approximated with relatively small basis set sizes.

9.

Slicing and Dicing: Optimal Coarse-Grained Representation to Preserve Molecular Kinetics.

Yang, Wangfei; Templeton, Clark; Rosenberger, David; Bittracher, Andreas; Nüske, Feliks; Noé, Frank; Clementi, Cecilia.

ACS Cent Sci ; 9(2): 186-196, 2023 Feb 22.

Artículo en Inglés | MEDLINE | ID: mdl-36844497

RESUMEN

The aim of molecular coarse-graining approaches is to recover relevant physical properties of the molecular system via a lower-resolution model that can be more efficiently simulated. Ideally, the lower resolution still accounts for the degrees of freedom necessary to recover the correct physical behavior. The selection of these degrees of freedom has often relied on the scientist's chemical and physical intuition. In this article, we make the argument that in soft matter contexts desirable coarse-grained models accurately reproduce the long-time dynamics of a system by correctly capturing the rare-event transitions. We propose a bottom-up coarse-graining scheme that correctly preserves the relevant slow degrees of freedom, and we test this idea for three systems of increasing complexity. We show that in contrast to this method existing coarse-graining schemes such as those from information theory or structure-based approaches are not able to recapitulate the slow time scales of the system.

10.

Rapid Calculation of Molecular Kinetics Using Compressed Sensing.

Litzinger, Florian; Boninsegna, Lorenzo; Wu, Hao; Nüske, Feliks; Patel, Raajen; Baraniuk, Richard; Noé, Frank; Clementi, Cecilia.

J Chem Theory Comput ; 14(5): 2771-2783, 2018 May 08.

Artículo en Inglés | MEDLINE | ID: mdl-29660273

RESUMEN

Recent methods for the analysis of molecular kinetics from massive molecular dynamics (MD) data rely on the solution of very large eigenvalue problems. Here we build upon recent results from the field of compressed sensing and develop the spectral oASIS method, a highly efficient approach to approximate the leading eigenvalues and eigenvectors of large generalized eigenvalue problems without ever having to evaluate the full matrices. The approach is demonstrated to reduce the dimensionality of the problem by 1 or 2 orders of magnitude, directly leading to corresponding savings in the computation and storage of the necessary matrices and a speedup of 2 to 4 orders of magnitude in solving the eigenvalue problem. We demonstrate the method on extensive data sets of protein conformational changes and protein-ligand binding using the variational approach to conformation dynamics (VAC) and time-lagged independent component analysis (TICA). Our approach can also be applied to kernel formulations of VAC, TICA, and extended dynamic mode decomposition (EDMD).

11.

Variational Approach to Molecular Kinetics.

Nüske, Feliks; Keller, Bettina G; Pérez-Hernández, Guillermo; Mey, Antonia S J S; Noé, Frank.

J Chem Theory Comput ; 10(4): 1739-52, 2014 Apr 08.

Artículo en Inglés | MEDLINE | ID: mdl-26580382

RESUMEN

The eigenvalues and eigenvectors of the molecular dynamics propagator (or transfer operator) contain the essential information about the molecular thermodynamics and kinetics. This includes the stationary distribution, the metastable states, and state-to-state transition rates. Here, we present a variational approach for computing these dominant eigenvalues and eigenvectors. This approach is analogous to the variational approach used for computing stationary states in quantum mechanics. A corresponding method of linear variation is formulated. It is shown that the matrices needed for the linear variation method are correlation matrices that can be estimated from simple MD simulations for a given basis set. The method proposed here is thus to first define a basis set able to capture the relevant conformational transitions, then compute the respective correlation matrices, and then to compute their dominant eigenvalues and eigenvectors, thus obtaining the key ingredients of the slow kinetics.

RESUMEN

RESUMEN

RESUMEN

RESUMEN

RESUMEN

RESUMEN

Asunto(s)

RESUMEN

RESUMEN

RESUMEN

RESUMEN

RESUMEN

ENVIAR RESULTADO:

SELECCIÓN DE REFERENCIAS

DETALLE DE LA BÚSQUEDA