ABSTRACT
Modeling the rate at which adaptive phenotypes appear in a population is a key to predicting evolutionary processes. Given random mutations, should this rate be modeled by a simple Poisson process, or is a more complex dynamics needed? Here we use analytic calculations and simulations of evolving populations on explicit genotype-phenotype maps to show that the introduction of novel phenotypes can be "bursty" or overdispersed. In other words, a novel phenotype either appears multiple times in quick succession or not at all for many generations. These bursts are fundamentally caused by statistical fluctuations and other structure in the map from genotypes to phenotypes. Their strength depends on population parameters, being highest for "monomorphic" populations with low mutation rates. They can also be enhanced by additional inhomogeneities in the mapping from genotypes to phenotypes. We mainly investigate the effect of bursts using the well-studied genotype-phenotype map for RNA secondary structure, but find similar behavior in a lattice protein model and in Richard Dawkins's biomorphs model of morphological development. Bursts can profoundly affect adaptive dynamics. Most notably, they imply that fitness differences play a smaller role in determining which phenotype fixes than would be the case for a Poisson process without bursts.
Subject(s)
Models, Genetic , Phenotype , Genotype , Computer Simulation , Adaptation, Physiological/genetics , Evolution, Molecular , Mutation , Biological Evolution , Poisson Distribution , RNA/genetics , Adaptation, Biological/geneticsABSTRACT
Biomorphs, Richard Dawkins's iconic model of morphological evolution, are traditionally used to demonstrate the power of natural selection to generate biological order from random mutations. Here we show that biomorphs can also be used to illustrate how developmental bias shapes adaptive evolutionary outcomes. In particular, we find that biomorphs exhibit phenotype bias, a type of developmental bias where certain phenotypes can be many orders of magnitude more likely than others to appear through random mutations. Moreover, this bias exhibits a strong preference for simpler phenotypes with low descriptional complexity. Such bias towards simplicity is formalised by an information-theoretic principle that can be intuitively understood from a picture of evolution randomly searching in the space of algorithms. By using population genetics simulations, we demonstrate how moderately adaptive phenotypic variation that appears more frequently upon random mutations can fix at the expense of more highly adaptive biomorph phenotypes that are less frequent. This result, as well as many other patterns found in the structure of variation for the biomorphs, such as high mutational robustness and a positive correlation between phenotype evolvability and robustness, closely resemble findings in molecular genotype-phenotype maps. Many of these patterns can be explained with an analytic model based on constrained and unconstrained sections of the genome. We postulate that the phenotype bias towards simplicity and other patterns biomorphs share with molecular genotype-phenotype maps may hold more widely for developmental systems.
Subject(s)
Genetics, Population , Selection, Genetic , Genotype , Phenotype , Mutation , Biological Evolution , Evolution, Molecular , Models, GeneticABSTRACT
SignificanceWhy does evolution favor symmetric structures when they only represent a minute subset of all possible forms? Just as monkeys randomly typing into a computer language will preferentially produce outputs that can be generated by shorter algorithms, so the coding theorem from algorithmic information theory predicts that random mutations, when decoded by the process of development, preferentially produce phenotypes with shorter algorithmic descriptions. Since symmetric structures need less information to encode, they are much more likely to appear as potential variation. Combined with an arrival-of-the-frequent mechanism, this algorithmic bias predicts a much higher prevalence of low-complexity (high-symmetry) phenotypes than follows from natural selection alone and also explains patterns observed in protein complexes, RNA secondary structures, and a gene regulatory network.
Subject(s)
Biological Evolution , Information Theory , Selection, Genetic , Algorithms , Gene Regulatory Networks , PhenotypeABSTRACT
We introduce oxNA, a new model for the simulation of DNA-RNA hybrids that is based on two previously developed coarse-grained models-oxDNA and oxRNA. The model naturally reproduces the physical properties of hybrid duplexes, including their structure, persistence length, and force-extension characteristics. By parameterizing the DNA-RNA hydrogen bonding interaction, we fit the model's thermodynamic properties to experimental data using both average-sequence and sequence-dependent parameters. To demonstrate the model's applicability, we provide three examples of its use-calculating the free energy profiles of hybrid strand displacement reactions, studying the resolution of a short R-loop, and simulating RNA-scaffolded wireframe origami.
Subject(s)
DNA , RNA , RNA/chemistry , Nucleic Acid Conformation , DNA/chemistry , Molecular Dynamics Simulation , SoftwareABSTRACT
Thymine dimers are a major mutagenic photoproduct induced by UV radiation. While they have been the subject of extensive theoretical and experimental investigations, questions of how DNA supercoiling affects local defect properties, or, conversely, how the presence of such defects changes global supercoiled structure, are largely unexplored. Here, we introduce a model of thymine dimers in the oxDNA forcefield, parametrized by comparison to melting experiments and structural measurements of the thymine dimer induced bend angle. We performed extensive molecular dynamics simulations of double-stranded DNA as a function of external twist and force. Compared to undamaged DNA, the presence of a thymine dimer lowers the supercoiling densities at which plectonemes and bubbles occur. For biologically relevant supercoiling densities and forces, thymine dimers can preferentially segregate to the tips of the plectonemes, where they enhance the probability of a localized tip-bubble. This mechanism increases the probability of highly bent and denatured states at the thymine dimer site, which may facilitate repair enzyme binding. Thymine dimer-induced tip-bubbles also pin plectonemes, which may help repair enzymes to locate damage. We hypothesize that the interplay of supercoiling and local defects plays an important role for a wider set of DNA damage repair systems.
Subject(s)
DNA, Superhelical/chemistry , Pyrimidine Dimers , Thymine , DNA Damage , DNA Repair , Nucleic Acid Conformation , Pyrimidine Dimers/chemistry , Ultraviolet RaysABSTRACT
Arguments inspired by algorithmic information theory predict an inverse relation between the probability and complexity of output patterns in a wide range of input-output maps. This phenomenon is known as simplicity bias. By viewing the parameters of dynamical systems as inputs, and the resulting (digitised) trajectories as outputs, we study simplicity bias in the logistic map, Gauss map, sine map, Bernoulli map, and tent map. We find that the logistic map, Gauss map, and sine map all exhibit simplicity bias upon sampling of map initial values and parameter values, but the Bernoulli map and tent map do not. The simplicity bias upper bound on the output pattern probability is used to make a priori predictions regarding the probability of output patterns. In some cases, the predictions are surprisingly accurate, given that almost no details of the underlying dynamical systems are assumed. More generally, we argue that studying probability-complexity relationships may be a useful tool when studying patterns in dynamical systems.
ABSTRACT
Morphospaces-representations of phenotypic characteristics-are often populated unevenly, leaving large parts unoccupied. Such patterns are typically ascribed to contingency, or else to natural selection disfavoring certain parts of the morphospace. The extent to which developmental bias, the tendency of certain phenotypes to preferentially appear as potential variation, also explains these patterns is hotly debated. Here we demonstrate quantitatively that developmental bias is the primary explanation for the occupation of the morphospace of RNA secondary structure (SS) shapes. Upon random mutations, some RNA SS shapes (the frequent ones) are much more likely to appear than others. By using the RNAshapes method to define coarse-grained SS classes, we can directly compare the frequencies that noncoding RNA SS shapes appear in the RNAcentral database to frequencies obtained upon a random sampling of sequences. We show that: 1) only the most frequent structures appear in nature; the vast majority of possible structures in the morphospace have not yet been explored; 2) remarkably small numbers of random sequences are needed to produce all the RNA SS shapes found in nature so far; and 3) perhaps most surprisingly, the natural frequencies are accurately predicted, over several orders of magnitude in variation, by the likelihood that structures appear upon a uniform random sampling of sequences. The ultimate cause of these patterns is not natural selection, but rather a strong phenotype bias in the RNA genotype-phenotype map, a type of developmental bias or "findability constraint," which limits evolutionary dynamics to a hugely reduced subset of structures that are easy to "find."
Subject(s)
Biological Evolution , RNA , Mutation , Phenotype , RNA/genetics , Selection, GeneticABSTRACT
We use the oxDNA coarse-grained model to provide a detailed characterization of the fundamental structural properties of DNA origami, focussing on archetypal 2D and 3D origami. The model reproduces well the characteristic pattern of helix bending in a 2D origami, showing that it stems from the intrinsic tendency of anti-parallel four-way junctions to splay apart, a tendency that is enhanced both by less screened electrostatic interactions and by increased thermal motion. We also compare to the structure of a 3D origami whose structure has been determined by cryo-electron microscopy. The oxDNA average structure has a root-mean-square deviation from the experimental structure of 8.4 Å, which is of the order of the experimental resolution. These results illustrate that the oxDNA model is capable of providing detailed and accurate insights into the structure of DNA origami, and has the potential to be used to routinely pre-screen putative origami designs and to investigate the molecular mechanisms that regulate the properties of DNA origami.
Subject(s)
DNA, Cruciform/chemistry , DNA/ultrastructure , Nucleic Acid Conformation , Cryoelectron Microscopy , Crystallography, X-Ray , DNA/chemistry , DNA, Cruciform/genetics , DNA, Cruciform/ultrastructure , Molecular Dynamics SimulationABSTRACT
DNA self-assembly allows the construction of nanometre-scale structures and devices. Structures with thousands of unique components are routinely assembled in good yield. Experimental progress has been rapid, based largely on empirical design rules. Herein, we demonstrate a DNA origami technique designed as a model system with which to explore the mechanism of assembly. The origami fold is controlled through single-stranded loops embedded in a double-stranded DNA template and is programmed by a set of double-stranded linkers that specify pairwise interactions between loop sequences. Assembly is via T-junctions formed by hybridization of single-stranded overhangs on the linkers with the loops. The sequence of loops on the template and the set of interaction rules embodied in the linkers can be reconfigured with ease. We show that a set of just two interaction rules can be used to assemble simple T-junction origami motifs and that assembly can be performed at room temperature.
ABSTRACT
Simulations of nucleic acids at different levels of structural details are increasingly used to complement and interpret experiments in different fields, from biophysics to medicine and materials science. However, the various structural models currently available for DNA and RNA and their accompanying suites of computational tools can be very rarely used in a synergistic fashion. The tacoxDNA webserver and standalone software package presented here are a step toward a long-sought interoperability of nucleic acids models. The webserver offers a simple interface for converting various common input formats of DNA structures and setting up molecular dynamics (MD) simulations. Users can, for instance, design DNA rings with different topologies, such as knots, with and without supercoiling, by simply providing an XYZ coordinate file of the DNA centre-line. More complex DNA geometries, as designable in the cadnano, CanDo and Tiamat tools, can also be converted to all-atom or oxDNA representations, which can then be used to run MD simulations. Though the latter are currently geared toward the native and LAMMPS oxDNA representations, the open-source package is designed to be further expandable. TacoxDNA is available at http://tacoxdna.sissa.it. © 2019 Wiley Periodicals, Inc.
Subject(s)
DNA/chemistry , Internet , Molecular Dynamics Simulation , Nucleic Acid Conformation , SoftwareABSTRACT
Inspired by recent successes using single-stranded DNA tiles to produce complex structures, we develop a two-step coarse-graining approach that uses detailed thermodynamic calculations with oxDNA, a nucleotide-based model of DNA, to parametrize a coarser kinetic model that can reach the time and length scales needed to study the assembly mechanisms of these structures. We test the model by performing a detailed study of the assembly pathways for a two-dimensional target structure made up of 334 unique strands each of which are 42 nucleotides long. Without adjustable parameters, the model reproduces a critical temperature for the formation of the assembly that is close to the temperature at which assembly first occurs in experiments. Furthermore, the model allows us to investigate in detail the nucleation barriers and the distribution of critical nucleus shapes for the assembly of a single target structure. The assembly intermediates are compact and highly connected (although not maximally so), and classical nucleation theory provides a good fit to the height and shape of the nucleation barrier at temperatures close to where assembly first occurs.
Subject(s)
DNA/chemistry , Molecular Dynamics Simulation , Algorithms , Kinetics , Monte Carlo Method , ThermodynamicsABSTRACT
It is well established that gene regulation can be achieved through activator and repressor proteins that bind to DNA and switch particular genes on or off, and that complex metabolic networks determine the levels of transcription of a given gene at a given time. Using three complementary computational techniques to study the sequence-dependence of DNA denaturation within DNA minicircles, we have observed that whenever the ends of the DNA are constrained, information can be transferred over long distances directly by the transmission of mechanical stress through the DNA itself, without any requirement for external signalling factors. Our models combine atomistic molecular dynamics (MD) with coarse-grained simulations and statistical mechanical calculations to span three distinct spatial resolutions and timescale regimes. While they give a consensus view of the non-locality of sequence-dependent denaturation in highly bent and supercoiled DNA loops, each also reveals a unique aspect of long-range informational transfer that occurs as a result of restraining the DNA within the closed loop of the minicircles.
Subject(s)
Computer Simulation , DNA, Circular/chemistry , Models, Molecular , Nucleic Acid Conformation , Stress, Mechanical , Algorithms , DNA, Superhelical/chemistry , Nucleic Acid DenaturationABSTRACT
Mutational neighbourhoods in genotype-phenotype (GP) maps are widely believed to be more likely to share characteristics than expected from random chance. Such genetic correlations should strongly influence evolutionary dynamics. We explore and quantify these intuitions by comparing three GP maps-a model for RNA secondary structure, the HP model for protein tertiary structure, and the Polyomino model for protein quaternary structure-to a simple random null model that maintains the number of genotypes mapping to each phenotype, but assigns genotypes randomly. The mutational neighbourhood of a genotype in these GP maps is much more likely to contain genotypes mapping to the same phenotype than in the random null model. Such neutral correlations can be quantified by the robustness to mutations, which can be many orders of magnitude larger than that of the null model, and crucially, above the critical threshold for the formation of large neutral networks of mutationally connected genotypes which enhance the capacity for the exploration of phenotypic novelty. Thus neutral correlations increase evolvability. We also study non-neutral correlations: Compared to the null model, i) If a particular (non-neutral) phenotype is found once in the 1-mutation neighbourhood of a genotype, then the chance of finding that phenotype multiple times in this neighbourhood is larger than expected; ii) If two genotypes are connected by a single neutral mutation, then their respective non-neutral 1-mutation neighbourhoods are more likely to be similar; iii) If a genotype maps to a folding or self-assembling phenotype, then its non-neutral neighbours are less likely to be a potentially deleterious non-folding or non-assembling phenotype. Non-neutral correlations of type i) and ii) reduce the rate at which new phenotypes can be found by neutral exploration, and so may diminish evolvability, while non-neutral correlations of type iii) may instead facilitate evolutionary exploration and so increase evolvability.
Subject(s)
Evolution, Molecular , Genetics, Population , Models, Genetic , Models, Statistical , Mutation/genetics , Proteome/genetics , Animals , Computer Simulation , Genotype , HumansABSTRACT
The effect of secondary structure on DNA duplex formation is poorly understood. Using oxDNA, a nucleotide level coarse-grained model of DNA, we study how hairpins influence the rate and reaction pathways of DNA hybridzation. We compare to experimental systems studied by Gao et al. (1) and find that 3-base pair hairpins reduce the hybridization rate by a factor of 2, and 4-base pair hairpins by a factor of 10, compared to DNA with limited secondary structure, which is in good agreement with experiments. By contrast, melting rates are accelerated by factors of â¼100 and â¼2000. This surprisingly large speed-up occurs because hairpins form during the melting process, and significantly lower the free energy barrier for dissociation. These results should assist experimentalists in designing sequences to be used in DNA nanotechnology, by putting limits on the suppression of hybridization reaction rates through the use of hairpins and offering the possibility of deliberately increasing dissociation rates by incorporating hairpins into single strands.
Subject(s)
DNA/chemistry , Nucleic Acid Hybridization , Base Pairing , Kinetics , Nucleic Acid Conformation , Nucleic Acid Denaturation , ThermodynamicsABSTRACT
We study the thermodynamics and kinetics of an RNA toehold-mediated strand displacement reaction with a recently developed coarse-grained model of RNA. Strand displacement, during which a single strand displaces a different strand previously bound to a complementary substrate strand, is an essential mechanism in active nucleic acid nanotechnology and has also been hypothesized to occur in vivo. We study the rate of displacement reactions as a function of the length of the toehold and temperature and make two experimentally testable predictions: that the displacement is faster if the toehold is placed at the 5' end of the substrate; and that the displacement slows down with increasing temperature for longer toeholds.
Subject(s)
Molecular Dynamics Simulation , RNA/chemistry , Base Sequence , Molecular Sequence Data , Nucleic Acid ConformationABSTRACT
We study the behaviour of double-stranded RNA under twist and tension using oxRNA, a recently developed coarse-grained model of RNA. Introducing explicit salt-dependence into the model allows us to directly compare our results to data from recent single-molecule experiments. The model reproduces extension curves as a function of twist and stretching force, including the buckling transition and the behaviour of plectoneme structures. For negative supercoiling, we predict denaturation bubble formation in plectoneme end-loops, suggesting preferential plectoneme localisation in weak base sequences. OxRNA exhibits a positive twist-stretch coupling constant, in agreement with recent experimental observations.
Subject(s)
Models, Molecular , Nucleic Acid Conformation , RNA, Double-Stranded/chemistry , Molecular Dynamics SimulationABSTRACT
Advances in DNA nanotechnology have stimulated the search for simple motifs that can be used to control the properties of DNA nanostructures. One such motif, which has been used extensively in structures such as polyhedral cages, two-dimensional arrays, and ribbons, is a bulged duplex, that is, two helical segments that connect at a bulge loop. We use a coarse-grained model of DNA to characterize such bulged duplexes. We find that this motif can adopt structures belonging to two main classes: one where the stacking of the helices at the center of the system is preserved, the geometry is roughly straight, and the bulge is on one side of the duplex and the other where the stacking at the center is broken, thus allowing this junction to act as a hinge and increasing flexibility. Small loops favor states where stacking at the center of the duplex is preserved, with loop bases either flipped out or incorporated into the duplex. Duplexes with longer loops show more of a tendency to unstack at the bulge and adopt an open structure. The unstacking probability, however, is highest for loops of intermediate lengths, when the rigidity of single-stranded DNA is significant and the loop resists compression. The properties of this basic structural motif clearly correlate with the structural behavior of certain nano-scale objects, where the enhanced flexibility associated with larger bulges has been used to tune the self-assembly product as well as the detailed geometry of the resulting nanostructures. We further demonstrate the role of bulges in determining the structure of a "Z-tile," a basic building block for nanostructures.
Subject(s)
DNA/chemistry , Models, Molecular , Base Pairing , Base Sequence , DNA/genetics , DNA, Single-Stranded/chemistry , DNA, Single-Stranded/geneticsABSTRACT
We introduce an extended version of oxDNA, a coarse-grained model of deoxyribonucleic acid (DNA) designed to capture the thermodynamic, structural, and mechanical properties of single- and double-stranded DNA. By including explicit major and minor grooves and by slightly modifying the coaxial stacking and backbone-backbone interactions, we improve the ability of the model to treat large (kilobase-pair) structures, such as DNA origami, which are sensitive to these geometric features. Further, we extend the model, which was previously parameterised to just one salt concentration ([Na(+)] = 0.5M), so that it can be used for a range of salt concentrations including those corresponding to physiological conditions. Finally, we use new experimental data to parameterise the oxDNA potential so that consecutive adenine bases stack with a different strength to consecutive thymine bases, a feature which allows a more accurate treatment of systems where the flexibility of single-stranded regions is important. We illustrate the new possibilities opened up by the updated model, oxDNA2, by presenting results from simulations of the structure of large DNA objects and by using the model to investigate some salt-dependent properties of DNA.
Subject(s)
DNA/chemistry , Models, Genetic , Salts/chemistry , Elasticity , Fluorescence Resonance Energy Transfer , Molecular Dynamics Simulation , Nucleic Acid Conformation , Static Electricity , Thermodynamics , Transition TemperatureABSTRACT
Although the thermodynamics of DNA hybridization is generally well established, the kinetics of this classic transition is less well understood. Providing such understanding has new urgency because DNA nanotechnology often depends critically on binding rates. Here, we explore DNA oligomer hybridization kinetics using a coarse-grained model. Strand association proceeds through a complex set of intermediate states, with successful binding events initiated by a few metastable base-pairing interactions, followed by zippering of the remaining bonds. But despite reasonably strong interstrand interactions, initial contacts frequently dissociate because typical configurations in which they form differ from typical states of similar enthalpy in the double-stranded equilibrium ensemble. Initial contacts must be stabilized by two or three base pairs before full zippering is likely, resulting in negative effective activation enthalpies. Non-Arrhenius behavior arises because the number of base pairs required for nucleation increases with temperature. In addition, we observe two alternative pathways-pseudoknot and inchworm internal displacement-through which misaligned duplexes can rearrange to form duplexes. These pathways accelerate hybridization. Our results explain why experimentally observed association rates of GC-rich oligomers are higher than rates of AT- rich equivalents, and more generally demonstrate how association rates can be modulated by sequence choice.