RESUMEN
The development of new materials typically involves a process of trial and error, guided by insights from past experimental and theoretical findings. The inverse design approach for soft-matter systems has the potential to optimize specific physical parameters, such as particle interactions, particle shape, or composition and packing fraction. This optimization aims to facilitate the spontaneous formation of specific target structures through self-assembly. In this study, we expand upon a recently introduced inverse design protocol for monodisperse systems to identify the required conditions and interactions for assembling crystal and quasicrystal phases within a binary mixture of two distinct species. This method utilizes an evolution algorithm to identify the optimal state point and interaction parameters, enabling the self-assembly of the desired structure. In addition, we employ a convolutional neural network (CNN) that classifies different phases based on their diffraction patterns, serving as a fitness function for the desired structure. Using our protocol, we successfully inverse design two-dimensional crystalline structures, including a hexagonal lattice and a dodecagonal quasicrystal, within a non-additive binary mixture of hard disks. Finally, we introduce a symmetry-based order parameter that leverages the encoded symmetry within the diffraction pattern. This order parameter circumvents the need for training a CNN and is used as a fitness function to inverse design an octagonal quasicrystal.
RESUMEN
Hoogsteen (HG) base pairing is characterized by a 180° rotation of the purine base with respect to the Watson-Crick-Franklin (WCF) motif. Recently, it has been found that both conformations coexist in a dynamical equilibrium and that several biological functions require HG pairs. This relevance has motivated experimental and computational investigations of the base-pairing transition. However, a systematic simulation of sequence variations has remained out of reach. Here, we employ advanced path-based methods to perform unprecedented free-energy calculations. Our methodology enables us to study the different mechanisms of purine rotation, either remaining inside or after flipping outside of the double helix. We study seven different sequences, which are neighbor variations of a well-studied Aâ T pair in A6-DNA. We observe the known effect of Aâ T steps favoring HG stability, and find evidence of triple-hydrogen-bonded neighbors hindering the inside transition. More importantly, we identify a dominant factor: the direction of the A rotation, with the 6-ring pointing either towards the longer or shorter segment of the chain, respectively relating to a lower or higher barrier. This highlights the role of DNA's relative flexibility as a modulator of the WCF/HG dynamic equilibrium. Additionally, we provide a robust methodology for future HG proclivity studies.
Asunto(s)
ADN , Purinas , Emparejamiento Base , ADN/química , ADN/genética , Enlace de Hidrógeno , Conformación Molecular , Conformación de Ácido Nucleico , TermodinámicaRESUMEN
DNA predominantly contains Watson-Crick (WC) base pairs, but a non-negligible fraction of base pairs are in the Hoogsteen (HG) hydrogen bonding motif at any time. In HG, the purine is rotated â¼180° relative to the WC motif. The transitions between WC and HG may play a role in recognition and replication, but are difficult to investigate experimentally because they occur quickly, but only rarely. To gain insight into the mechanisms for this process, we performed transition path sampling simulations on a model nucleotide sequence in which an AT pair changes from WC to HG. This transition can occur in two ways, both starting with loss of hydrogen bonds in the base pair, followed by rotation around the glycosidic bond. In one route the adenine base converts from WC to HG geometry while remaining entirely within the double helix. The other route involves the adenine leaving the confines of the double helix and interacting with water. Our results indicate that this outside route is more probable. We used transition interface sampling to compute rate constants and relative free energies for the transitions between WC and HG. Our results agree with experiments, and provide highly detailed insights into the mechanisms of this important process.
Asunto(s)
Emparejamiento Base , Secuencia de Bases , ADN/química , Enlace de Hidrógeno , TermodinámicaRESUMEN
With the continual improvement of computing hardware and algorithms, simulations have become a powerful tool for understanding all sorts of (bio)molecular processes. To handle the large simulation data sets and to accelerate slow, activated transitions, a condensed set of descriptors, or collective variables (CVs), is needed to discern the relevant dynamics that describes the molecular process of interest. However, proposing an adequate set of CVs that can capture the intrinsic reaction coordinate of the molecular transition is often extremely difficult. Here, we present a framework to find an optimal set of CVs from a pool of candidates using a combination of artificial neural networks and genetic algorithms. The approach effectively replaces the encoder of an autoencoder network with genes to represent the latent space, i.e., the CVs. Given a selection of CVs as input, the network is trained to recover the atom coordinates underlying the CV values at points along the transition. The network performance is used as an estimator of the fitness of the input CVs. Two genetic algorithms optimize the CV selection and the neural network architecture. The successful retrieval of optimal CVs by this framework is illustrated at the hand of two case studies: the well-known conformational change in the alanine dipeptide molecule and the more intricate transition of a base pair in B-DNA from the classic Watson-Crick pairing to the alternative Hoogsteen pairing. Key advantages of our framework include the following: optimal interpretable CVs, avoiding costly calculation of committor or time-correlation functions, and automatic hyperparameter optimization. In addition, we show that applying a time-delay between the network input and output allows for enhanced selection of slow variables. Moreover, the network can also be used to generate molecular configurations of unexplored microstates, for example, for augmentation of the simulation data.
Asunto(s)
Alanina/química , Algoritmos , Dipéptidos/química , Redes Neurales de la Computación , Alanina/genética , Dipéptidos/genéticaRESUMEN
The biological functions of natural polyelectrolytes are strongly influenced by the presence of ions, which bind to the polymer chains and thereby modify their properties. Although the biological impact of such modifications is well recognized, a detailed molecular picture of the binding process and of the mechanisms that drive the subsequent structural changes in the polymer is lacking. Here, we study the molecular mechanism of the condensation of calcium, a divalent cation, on hyaluronan, a ubiquitous polymer in human tissues. By combining two-dimensional infrared spectroscopy experiments with molecular dynamics simulations, we find that calcium specifically binds to hyaluronan at millimolar concentrations. Because of its large size and charge, the calcium cation can bind simultaneously to the negatively charged carboxylate group and the amide group of adjacent saccharide units. Molecular dynamics simulations and single-chain force spectroscopy measurements provide evidence that the binding of the calcium ions weakens the intramolecular hydrogen-bond network of hyaluronan, increasing the flexibility of the polymer chain. We also observe that the binding of calcium to hyaluronan saturates at a maximum binding fraction of â¼10-15 mol %. This saturation indicates that the binding of Ca2+ strongly reduces the probability of subsequent binding of Ca2+ at neighboring binding sites, possibly as a result of enhanced conformational fluctuations and/or electrostatic repulsion effects. Our findings provide a detailed molecular picture of ion condensation and reveal the severe effect of a few, selective and localized electrostatic interactions on the rigidity of a polyelectrolyte chain.
RESUMEN
This perspective article highlights the challenges in the theoretical description of photoreceptor proteins using multiscale modeling, as discussed at the CECAM workshop in Tel Aviv, Israel. The participants have identified grand challenges and discussed the development of new tools to address them. Recent progress in understanding representative proteins such as green fluorescent protein, photoactive yellow protein, phytochrome, and rhodopsin is presented, along with methodological developments.
Asunto(s)
Proteínas Bacterianas/química , Proteínas Fluorescentes Verdes/química , Modelos Moleculares , Fotorreceptores Microbianos/química , Fitocromo/química , Rodopsina/química , Distribución de Poisson , Teoría Cuántica , Electricidad EstáticaRESUMEN
The carboxyl (COOH) side chain groups of amino acids, such as aspartic acid, play an important role in biochemical processes, including enzymatic proton transport. In many theoretical studies, it was found that the (bio)chemical reactivity of the carboxyl group strongly depends on the conformation of this group. Interestingly, up to now there has been no experimental investigation of the geometry and the stability of different COOH conformers under biorelevant conditions. Here, we investigate the conformational isomerism of the side chain COOH group of N-acetyl aspartic acid amide using polarization-resolved two-dimensional infrared spectroscopy. We find that the carboxyl group shows two distinct near-planar conformers (syn and anti) when dissolved in water at room temperature. Both conformers are significantly populated in aqueous solution (75 ± 10% and 25 ± 10% for syn and anti, respectively). Molecular dynamics simulations show that the anti conformer interacts more strongly with water molecules than the syn conformer, explaining why this conformer is significantly present in aqueous solution.
Asunto(s)
Péptidos/química , Amidas/química , Isomerismo , Modelos Moleculares , Conformación Molecular , Espectrofotometría InfrarrojaRESUMEN
In the past decade, great progress has been made in the development of enhanced sampling methods, aimed at overcoming the time-scale limitations of molecular dynamics (MD) simulations. Many sampling schemes rely on adding an external bias to favor the sampling of transitions and to estimate the underlying free energy landscape. Nevertheless, sampling molecular processes described by many order parameters, or collective variables (CVs), such as complex biomolecular transitions, remains often very challenging. The computational cost has a prohibitive scaling with the dimensionality of the CV-space. Inspiration can be taken from methods that focus on localizing transition pathways: the CV-space can be projected onto a path-CV that connects two stable states, and a bias can be exerted onto a one-dimensional parameter that captures the progress of the transition along the path-CV. In principle, such a sampling scheme can handle an arbitrarily large number of CVs. A standard enhanced sampling technique combined with an adaptive path-CV can then locate the mean transition pathway and obtain the free energy profile along the path. In this chapter, we discuss the adaptive path-CV formalism and its numerical implementation. We apply the path-CV with several enhanced sampling methods-steered MD, metadynamics, and umbrella sampling-to a biologically relevant process: the Watson-Crick to Hoogsteen base-pairing transition in double-stranded DNA. A practical guide is provided on how to recognize and circumvent possible pitfalls during the calculation of a free energy landscape that contains multiple pathways. Examples are presented on how to perform enhanced sampling simulations using PLUMED, a versatile plugin that can work with many popular MD engines.