RESUMEN
Genetic regulatory networks (GRNs) regulate the flow of genetic information from the genome to expressed messenger RNAs (mRNAs) and thus are critical to controlling the phenotypic characteristics of cells. Numerous methods exist for profiling mRNA transcript levels and identifying protein-DNA binding interactions at the genome-wide scale. These enable researchers to determine the structure and output of transcriptional regulatory networks, but uncovering the complete structure and regulatory logic of GRNs remains a challenge. The field of GRN inference aims to meet this challenge using computational modeling to derive the structure and logic of GRNs from experimental data and to encode this knowledge in Boolean networks, Bayesian networks, ordinary differential equation (ODE) models, or other modeling frameworks. However, most existing models do not incorporate dynamic transcriptional data since it has historically been less widely available in comparison to "static" transcriptional data. We report the development of an evolutionary algorithm-based ODE modeling approach (named EA) that integrates kinetic transcription data and the theory of attractor matching to infer GRN architecture and regulatory logic. Our method outperformed six leading GRN inference methods, none of which incorporate kinetic transcriptional data, in predicting regulatory connections among TFs when applied to a small-scale engineered synthetic GRN in Saccharomyces cerevisiae. Moreover, we demonstrate the potential of our method to predict unknown transcriptional profiles that would be produced upon genetic perturbation of the GRN governing a two-state cellular phenotypic switch in Candida albicans. We established an iterative refinement strategy to facilitate candidate selection for experimentation; the experimental results in turn provide validation or improvement for the model. In this way, our GRN inference approach can expedite the development of a sophisticated mathematical model that can accurately describe the structure and dynamics of the in vivo GRN.
Asunto(s)
Algoritmos , Redes Reguladoras de Genes , Teorema de Bayes , Redes Reguladoras de Genes/genética , Evolución Biológica , Candida albicans/genética , ARN MensajeroRESUMEN
SUMMARY: pystablemotifs is a Python 3 library for analyzing Boolean networks. Its non-heuristic and exhaustive attractor identification algorithm was previously presented in Rozum et al. (2021). Here, we illustrate its performance improvements over similar methods and discuss how it uses outputs of the attractor identification process to drive a system to one of its attractors from any initial state. We implement six attractor control algorithms, five of which are new in this work. By design, these algorithms can return different control strategies, allowing for synergistic use. We also give a brief overview of the other tools implemented in pystablemotifs. AVAILABILITY AND IMPLEMENTATION: The source code is on GitHub at https://github.com/jcrozum/pystablemotifs/. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Asunto(s)
Algoritmos , Programas Informáticos , Biblioteca de GenesRESUMEN
Biomolecular network dynamics are thought to operate near the critical boundary between ordered and disordered regimes, where large perturbations to a small set of elements neither die out nor spread on average. A biomolecular automaton (e.g., gene, protein) typically has high regulatory redundancy, where small subsets of regulators determine activation via collective canalization. Previous work has shown that effective connectivity, a measure of collective canalization, leads to improved dynamical regime prediction for homogeneous automata networks. We expand this by (i) studying random Boolean networks (RBNs) with heterogeneous in-degree distributions, (ii) considering additional experimentally validated automata network models of biomolecular processes, and (iii) considering new measures of heterogeneity in automata network logic. We found that effective connectivity improves dynamical regime prediction in the models considered; in RBNs, combining effective connectivity with bias entropy further improves the prediction. Our work yields a new understanding of criticality in biomolecular networks that accounts for collective canalization, redundancy, and heterogeneity in the connectivity and logic of their automata models. The strong link we demonstrate between criticality and regulatory redundancy provides a means to modulate the dynamical regime of biochemical networks.
RESUMEN
We present a technique applicable in any dynamical framework to identify control-robust subsets of an interacting system. These robust subsystems, which we call stable modules, are characterized by constraints on the variables that make up the subsystem. They are robust in the sense that if the defining constraints are satisfied at a given time, they remain satisfied for all later times, regardless of what happens in the rest of the system, and can only be broken if the constrained variables are externally manipulated. We identify stable modules as graph structures in an expanded network, which represents causal links between variable constraints. A stable module represents a system "decision point", or trap subspace. Using the expanded network, small stable modules can be composed sequentially to form larger stable modules that describe dynamics on the system level. Collections of large, mutually exclusive stable modules describe the system's repertoire of long-term behaviors. We implement this technique in a broad class of dynamical systems and illustrate its practical utility via examples and algorithmic analysis of two published biological network models. In the segment polarity gene network of Drosophila melanogaster, we obtain a state-space visualization that reproduces by novel means the four possible cell fates and predicts the outcome of cell transplant experiments. In the T-cell signaling network, we identify six signaling elements that determine the high-signal response and show that control of an element connected to them cannot disrupt this response.
Asunto(s)
Modelos Biológicos , Algoritmos , Animales , Tipificación del Cuerpo/genética , Biología Computacional , Drosophila melanogaster/embriología , Drosophila melanogaster/genética , Redes Reguladoras de Genes , Humanos , Receptores de Antígenos de Linfocitos T/inmunología , Transducción de Señal/inmunología , Biología de Sistemas , Linfocitos T/inmunologíaRESUMEN
We consider a dynamic framework frequently used to model gene regulatory and signal transduction networks: monotonic ODEs that are composed of Hill functions. We derive conditions under which activity or inactivity in one system variable induces and sustains activity or inactivity in another. Cycles of such influences correspond to positive feedback loops that are self-sustaining and control-robust, in the sense that these feedback loops "trap" the system in a region of state space from which it cannot exit, even if the other system variables are externally controlled. To demonstrate the utility of this result, we consider prototypical examples of bistability and hysteresis in gene regulatory networks, and analyze a T-cell signal transduction ODE model from the literature.
Asunto(s)
Retroalimentación Fisiológica , Redes Reguladoras de Genes/fisiología , Modelos Biológicos , Animales , Humanos , Transducción de Señal , Linfocitos T/química , Linfocitos T/fisiologíaRESUMEN
Minimum spanning trees and forests are powerful sparsification techniques that remove cycles from weighted graphs to minimize total edge weight while preserving node reachability, with applications in computer science, network science, and graph theory. Despite their utility and ubiquity, they have several limitations, including that they are only defined for undirected networks, they significantly alter dynamics on networks, and they do not generally preserve important network features such as shortest distances, shortest path distribution, and community structure. In contrast, distance backbones, which are subgraphs formed by all edges that obey a generalized triangle inequality, are well defined in directed and undirected graphs and preserve those and other important network features. The backbone of a graph is defined with respect to a specified path-length operator that aggregates weights along a path to define its length, thereby associating a cost to indirect connections. The backbone is the union of all shortest paths between each pair of nodes according to the specified operator. One such operator, the max function, computes the length of a path as the largest weight of the edges that compose it (a weakest link criterion). It is the only operator that yields an algebraic structure for computing shortest paths that is consistent with De Morgan's laws. Applying this operator yields the ultrametric backbone of a graph in that (semi-triangular) edges whose weights are larger than the length of an indirect path connecting the same nodes (i.e. those that break the generalized triangle inequality based on max as a path-length operator) are removed. We show that the ultrametric backbone is the union of minimum spanning forests in undirected graphs and provides a new generalization of minimum spanning trees to directed graphs that, unlike minimum equivalent graphs and minimum spanning arborescences, preserves all max - min shortest paths and De Morgan's law consistency.
RESUMEN
Cellular automata (CA) are discrete dynamical systems with a prominent place in the history and study of artificial life. Here, we focus on the density classification task (DCT) in which a 1-dimensional lattice of Boolean (on/off) automata must perform a form of rudimentary quorum sensing. Typically, the ring lattice consists of 149 cells (though we consider other sizes as well) that update their state according to their own state and its six nearest neighbors in the previous time step. The goal is obtaining Boolean CA rules whose dynamics converges to the majority state of the entire lattice for a given initial configuration of the lattice. This is a nontrivial task because cells have access only to local information, and thus need to integrate and coordinate information across the lattice to converge to the correct collective state. Because initial conditions are random, they have very similar proportions of on and off states, which makes the problem very difficult. This problem has hitherto been studied with the assumption that input to each cell is perfectly stable. Since biological systems that solve similar problems (e.g. bacterial quorum sensing) must operate in noisy environments, here we study the impact of noise on DCT accuracy for the 13 highest-accuracy CA rules from the literature. We use cubewalkers, a recently released GPU-accelerated Boolean simulator to conduct large-scale random experiments. We uncover a trade-off between maximum accuracy without noise and robustness to noise among these high-performance CAs. Moreover, there is no significant difference between rules that were human-designed or evolved computationally.
RESUMEN
Complex living systems are thought to exist at the "edge of chaos" separating the ordered dynamics of robust function from the disordered dynamics of rapid environmental adaptation. Here, a deeper inspection of 72 experimentally supported discrete dynamical models of cell processes reveals previously unobserved order on long time scales, suggesting greater rigidity in these systems than was previously conjectured. We find that propagation of internal perturbations is transient in most cases, and that even when large perturbation cascades persist, their phenotypic effects are often minimal. Moreover, we find evidence that stochasticity and desynchronization can lead to increased recovery from regulatory perturbation cascades. Our analysis relies on new measures that quantify the tendency of perturbations to spread through a discrete dynamical system. Computing these measures was not feasible using current methodology; thus, we developed a multipurpose CUDA-based simulation tool, which we have made available as the open-source Python library cubewalkers. Based on novel measures and simulations, our results suggest that-contrary to current theory-cell processes are ordered and far from the edge of chaos.
RESUMEN
We present new applications of parity inversion and time reversal to the emergence of complex behavior from simple dynamical rules in stochastic discrete models. Our parity-based encoding of causal relationships and time-reversal construction efficiently reveal discrete analogs of stable and unstable manifolds. We demonstrate their predictive power by studying decision-making in systems biology and statistical physics models. These applications underpin a novel attractor identification algorithm implemented for Boolean networks under stochastic dynamics. Its speed enables resolving a long-standing open question of how attractor count in critical random Boolean networks scales with network size and whether the scaling matches biological observations. Via 80-fold improvement in probed network size (N = 16,384), we find the unexpectedly low scaling exponent of 0.12 ± 0.05, approximately one-tenth the analytical upper bound. We demonstrate a general principle: A system's relationship to its time reversal and state-space inversion constrains its repertoire of emergent behaviors.