RESUMEN
ATP-dependent proteases are vital to maintain cellular protein homeostasis. Here, we study the mechanisms of force generation and intersubunit coordination in the ClpXP protease from E. coli to understand how these machines couple ATP hydrolysis to mechanical protein unfolding. Single-molecule analyses reveal that phosphate release is the force-generating step in the ATP-hydrolysis cycle and that ClpXP translocates substrate polypeptides in bursts resulting from highly coordinated conformational changes in two to four ATPase subunits. ClpXP must use its maximum successive firing capacity of four subunits to unfold stable substrates like GFP. The average dwell duration between individual bursts of translocation is constant, regardless of the number of translocating subunits, implying that ClpXP operates with constant "rpm" but uses different "gears."
Asunto(s)
Endopeptidasa Clp/química , Endopeptidasa Clp/metabolismo , Proteínas de Escherichia coli/química , Proteínas de Escherichia coli/metabolismo , Escherichia coli/enzimología , Adenosina Trifosfatasas/metabolismo , Adenosina Trifosfato/análogos & derivados , Adenosina Trifosfato/metabolismo , Escherichia coli/metabolismo , Pinzas Ópticas , Fosfatos/metabolismo , Conformación Proteica , Subunidades de Proteína/química , Subunidades de Proteína/metabolismo , Desplegamiento ProteicoRESUMEN
Superresolution tools, such as PALM and STORM, provide nanoscale localization accuracy by relying on rare photophysical events, limiting these methods to static samples. By contrast, here, we extend superresolution to dynamics without relying on photodynamics by simultaneously determining emitter numbers and their tracks (localization and linking) with the same localization accuracy per frame as widefield superresolution on immobilized emitters under similar imaging conditions (≈50 nm). We demonstrate our Bayesian nonparametric track (BNP-Track) framework on both in cellulo and synthetic data. BNP-Track develops a joint (posterior) distribution that learns and quantifies uncertainty over emitter numbers and their associated tracks propagated from shot noise, camera artifacts, pixelation, background and out-of-focus motion. In doing so, we integrate spatiotemporal information into our distribution, which is otherwise compromised by modularly determining emitter numbers and localizing and linking emitter positions across frames. For this reason, BNP-Track remains accurate in crowding regimens beyond those accessible to other single-particle tracking tools.
Asunto(s)
Teorema de Bayes , Procesamiento de Imagen Asistido por Computador , Procesamiento de Imagen Asistido por Computador/métodos , Algoritmos , Humanos , Microscopía Fluorescente/métodosRESUMEN
Noisy time-series data-from various experiments, including Förster resonance energy transfer, patch clamp, and force spectroscopy, among others-are commonly analyzed with either hidden Markov models or step-finding algorithms, both of which detect discrete transitions. Hidden Markov models, including their extensions to infinite state spaces, inherently assume exponential-or technically geometric-holding time distributions, biasing step locations toward steps with geometric holding times, especially in sparse and/or noisy data. In contrast, existing step-finding algorithms, while free of this restraint, often rely on ad hoc metrics to penalize steps recovered in time traces (by using various information criteria) and otherwise rely on approximate greedy algorithms to identify putative global optima. Here, instead, we devise a robust and general probabilistic (Bayesian) step-finding tool that neither relies on ad hoc metrics to penalize step numbers nor assumes geometric holding times in each state. As the number of steps themselves in a time-series are a priori unknown, we treat these within a Bayesian nonparametric (BNP) paradigm. We find that the method developed, BNP Step (BNP-Step), accurately determines the number and location of transitions between discrete states without any assumed kinetic model and learns the emission distribution characteristic of each state. In doing so, we verify that BNP-Step can analyze sparser data sets containing higher noise and more closely spaced states than otherwise resolved by current state-of-the-art methods. What is more, BNP-Step rigorously propagates measurement uncertainty into uncertainty over state transition locations, numbers, and emission levels as characterized by the posterior. We demonstrate the performance of BNP-Step on both synthetic data as well as data drawn from force spectroscopy experiments.
Asunto(s)
Algoritmos , Teorema de Bayes , Cadenas de Markov , Probabilidad , Factores de TiempoRESUMEN
Accessing information on an underlying network driving a biological process often involves interrupting the process and collecting snapshot data. When snapshot data are stochastic, the data's structure necessitates a probabilistic description to infer underlying reaction networks. As an example, we may imagine wanting to learn gene state networks from the type of data collected in single molecule RNA fluorescence in situ hybridization (RNA-FISH). In the networks we consider, nodes represent network states, and edges represent biochemical reaction rates linking states. Simultaneously estimating the number of nodes and constituent parameters from snapshot data remains a challenging task in part on account of data uncertainty and timescale separations between kinetic parameters mediating the network. While parametric Bayesian methods learn parameters given a network structure (with known node numbers) with rigorously propagated measurement uncertainty, learning the number of nodes and parameters with potentially large timescale separations remain open questions. Here, we propose a Bayesian nonparametric framework and describe a hybrid Bayesian Markov Chain Monte Carlo (MCMC) sampler directly addressing these challenges. In particular, in our hybrid method, Hamiltonian Monte Carlo (HMC) leverages local posterior geometries in inference to explore the parameter space; Adaptive Metropolis Hastings (AMH) learns correlations between plausible parameter sets to efficiently propose probable models; and Parallel Tempering takes into account multiple models simultaneously with tempered information content to augment sampling efficiency. We apply our method to synthetic data mimicking single molecule RNA-FISH, a popular snapshot method in probing transcriptional networks to illustrate the identified challenges inherent to learning dynamical models from these snapshots and how our method addresses them.
Asunto(s)
Algoritmos , ARN , Teorema de Bayes , Hibridación Fluorescente in Situ , Cadenas de Markov , ARN/genética , Método de MontecarloRESUMEN
Exact methods for the exponentiation of matrices of dimension N can be computationally expensive in terms of execution time (N3) and memory requirements (N2), not to mention numerical precision issues. A matrix often exponentiated in the natural sciences is the rate matrix. Here, we explore five methods to exponentiate rate matrices, some of which apply more broadly to other matrix types. Three of the methods leverage a mathematical analogy between computing matrix elements of a matrix exponential process and computing transition probabilities of a dynamical process (technically a Markov jump process, MJP, typically simulated using Gillespie). In doing so, we identify a novel MJP-based method relying on restricting the number of "trajectory" jumps that incurs improved computational scaling. We then discuss this method's downstream implications on mixing properties of Monte Carlo posterior samplers. We also benchmark two other methods of matrix exponentiation valid for any matrix (beyond rate matrices and, more generally, positive definite matrices) related to solving differential equations: Runge-Kutta integrators and Krylov subspace methods. Under conditions where both the largest matrix element and the number of non-vanishing elements scale linearly with N-reasonable conditions for rate matrices often exponentiated-computational time scaling with the most competitive methods (Krylov and one of the MJP-based methods) reduces to N2 with total memory requirements of N.
RESUMEN
Telomeres terminate with a 50-300 bases long single-stranded G-rich overhang, which can be misrecognized as a DNA damage repair site. Shelterin plays critical roles in maintaining and protecting telomere ends by regulating access of various physiological agents to telomeric DNA, but the underlying mechanism is not well understood. Here, we measure how shelterin affects the accessibility of long telomeric overhangs by monitoring transient binding events of a short complementary peptide nucleic acid (PNA) probe using FRET-PAINT in vitro. We observed that the POT1 subunit of shelterin reduces the accessibility of the PNA probe by â¼2.5-fold, indicating that POT1 effectively binds to and protects otherwise exposed telomeric sequences. In comparison, a four-component shelterin stabilizes POT1 binding to the overhang by tethering POT1 to the double-stranded telomeric DNA and reduces the accessibility of telomeric overhangs by â¼5-fold. This enhanced protection suggests shelterin restructures the junction between single and double-stranded telomere, which is otherwise the most accessible part of the telomeric overhang.
Asunto(s)
Complejo Shelterina , Telómero , ADN/metabolismo , Complejo Shelterina/metabolismo , Telómero/genética , Telómero/metabolismo , Proteínas de Unión a Telómeros/metabolismoRESUMEN
Potential energy landscapes are useful models in describing events such as protein folding and binding. While single-molecule fluorescence resonance energy transfer (smFRET) experiments encode information on continuous potentials for the system probed, including rarely visited barriers between putative potential minima, this information is rarely decoded from the data. This is because existing analysis methods often model smFRET output assuming, from the onset, that the system probed evolves in a discretized state space to be analyzed within a hidden Markov model (HMM) paradigm. By contrast, here, we infer continuous potentials from smFRET data without discretely approximating the state space. We do so by operating within a Bayesian nonparametric paradigm by placing priors on the family of all possible potential curves. As our inference accounts for a number of required experimental features raising computational cost (such as incorporating discrete photon shot noise), the framework leverages a structured-kernel-interpolation Gaussian process prior to help curtail computational cost. We show that our structured-kernel-interpolation priors for potential energy reconstruction from smFRET analysis accurately infers the potential energy landscape from a smFRET binding experiment. We then illustrate advantages of structured-kernel-interpolation priors for potential energy reconstruction from smFRET over standard HMM approaches by providing information, such as barrier heights and friction coefficients, that is otherwise inaccessible to HMMs.
Asunto(s)
Transferencia Resonante de Energía de Fluorescencia , Imagen Individual de Molécula , Transferencia Resonante de Energía de Fluorescencia/métodos , Teorema de Bayes , Imagen Individual de Molécula/métodos , FotonesRESUMEN
Fluorescence lifetime imaging captures the spatial distribution of chemical species across cellular environments employing pulsed illumination confocal setups. However, quantitative interpretation of lifetime data continues to face critical challenges. For instance, fluorescent species with known in vitro excited-state lifetimes may split into multiple species with unique lifetimes when introduced into complex living environments. What is more, mixtures of species, which may be both endogenous and introduced into the sample, may exhibit 1) very similar lifetimes as well as 2) wide ranges of lifetimes including lifetimes shorter than the instrumental response function or whose duration may be long enough to be comparable to the interpulse window. By contrast, existing methods of analysis are optimized for well-separated and intermediate lifetimes. Here, we broaden the applicability of fluorescence lifetime analysis by simultaneously treating unknown mixtures of arbitrary lifetimes-outside the intermediate, Goldilocks, zone-for data drawn from a single confocal spot leveraging the tools of Bayesian nonparametrics (BNP). We benchmark our algorithm, termed BNP lifetime analysis, using a range of synthetic and experimental data. Moreover, we show that the BNP lifetime analysis method can distinguish and deduce lifetimes using photon counts as small as 500.
Asunto(s)
Colorantes , Fluorescencia , Teorema de Bayes , Microscopía Fluorescente/métodosRESUMEN
Bdellovibrio bacteriovorus is a predatory bacterium preying upon Gram-negative bacteria. As such, B. bacteriovorus has the potential to control antibiotic-resistant pathogens and biofilm populations. To survive and reproduce, B. bacteriovorus must locate and infect a host cell. However, in the temporary absence of prey, it is largely unknown how B. bacteriovorus modulate their motility patterns in response to physical or chemical environmental cues to optimize their energy expenditure. To investigate B. bacteriovorus' predation strategy, we track and quantify their motion by measuring speed distributions as a function of starvation time. While an initial unimodal speed distribution relaxing to one for pure diffusion at long times may be expected, instead we observe a bimodal speed distribution with one mode centered around that expected from diffusion and the other centered at higher speeds. What is more, for an increasing amount of time over which B. bacteriovorus is starved, we observe a progressive reweighting from the active swimming state to an apparent diffusive state in the speed distribution. Distributions of trajectory-averaged speeds for B. bacteriovorus are largely unimodal, indicating switching between a faster swim speed and an apparent diffusive state within individual observed trajectories rather than there being distinct active swimming and apparent diffusive populations. We also find that B. bacteriovorus' apparent diffusive state is not merely caused by the diffusion of inviable bacteria as subsequent spiking experiments show that bacteria can be resuscitated and bimodality restored. Indeed, starved B. bacteriovorus may modulate the frequency and duration of active swimming as a means of balancing energy consumption and procurement. Our results thus point to a reweighting of the swimming frequency on a trajectory basis rather than a population level basis.
Asunto(s)
Bdellovibrio bacteriovorus , Natación , Señales (Psicología) , Bdellovibrio bacteriovorus/fisiología , Bacterias , BiopelículasRESUMEN
For many classes of biomolecules, population-level heterogeneity is an essential aspect of biological functionâfrom antibodies produced by the immune system to post-translationally modified proteins that regulate cellular processes. However, heterogeneity is difficult to fully characterize for multiple reasons: (i) single-molecule approaches are needed to avoid information lost by ensemble-level averaging, (ii) sufficient statistics must be gathered on both a per-molecule and per-population level, and (iii) a suitable analysis framework is required to make sense of a potentially limited number of intrinsically noisy measurements. Here, we introduce an approach that overcomes these difficulties by combining three techniques: a DNA nanoswitch construct to repeatedly interrogate the same molecule, a benchtop centrifuge force microscope (CFM) to obtain thousands of statistics in a highly parallel manner, and a Bayesian nonparametric (BNP) inference method to resolve separate subpopulations with distinct kinetics. We apply this approach to characterize commercially available antibodies and find that polyclonal antibody from rabbit serum is well-modeled by a mixture of three subpopulations. Our results show how combining a spatially and temporally multiplexed nanoswitch-CFM assay with BNP analysis can help resolve complex biomolecular interactions in heterogeneous samples.
Asunto(s)
Anticuerpos , Nanotecnología , Animales , Humanos , Conejos , Teorema de Bayes , Microscopía de Fuerza Atómica/métodos , Cinética , Centrifugación/métodosRESUMEN
We study the hydrodynamic coupling of neighboring micro-beads placed in a multiple optical trap setup allowing us to precisely control the degree of coupling and directly measure time-dependent trajectories of entrained beads. We performed measurements on configurations with increasing complexity starting with a pair of entrained beads moving in one dimension, then in two dimensions, and finally a triplet of beads moving in two dimensions. The average experimental trajectories of a probe bead compare well with the theoretical computation, illustrating the role of viscous coupling and setting timescales for probe bead relaxation. The findings also provide direct experimental corroborations of hydrodynamic coupling at large, micrometer spatial scales and long, millisecond timescales, of relevance to, e.g., microfluidic device design and hydrodynamic-assisted colloidal assembly, improving the capability of optical tweezers, and understanding the coupling between micrometer-scale objects within a living cell.
RESUMEN
The time spent by a single RNA polymerase (RNAP) at specific locations along the DNA, termed "residence time," reports on the initiation, elongation, and termination stages of transcription. At the single-molecule level, this information can be obtained from dual ultrastable optical trapping experiments, revealing a transcriptional elongation of RNAP interspersed with residence times of variable duration. Successfully discriminating between long and short residence times was used by previous approaches to learn about RNAP's transcription elongation dynamics. Here, we propose an approach based on the Bayesian sticky hidden Markov model that treats all residence times for an Escherichia coli RNAP on an equal footing without a priori discriminating between long and short residence times. Furthermore, our method has two additional advantages: we provide full distributions around key point statistics and directly treat the sequence dependence of RNAP's elongation rate. By applying our approach to experimental data, we find assigned relative probabilities on long versus short residence times, force-dependent average residence time transcription elongation dynamics, â¼10% drop in the average backtracking durations in the presence of GreB, and â¼20% drop in the average residence time as a function of applied force in the presence of RNaseA.
Asunto(s)
Proteínas de Escherichia coli , Teorema de Bayes , ARN Polimerasas Dirigidas por ADN/genética , ARN Polimerasas Dirigidas por ADN/metabolismo , Proteínas de Escherichia coli/genética , Transcripción Genética , Factores de Elongación TranscripcionalRESUMEN
The hidden Markov model (HMM) is a framework for time series analysis widely applied to single-molecule experiments. Although initially developed for applications outside the natural sciences, the HMM has traditionally been used to interpret signals generated by physical systems, such as single molecules, evolving in a discrete state space observed at discrete time levels dictated by the data acquisition rate. Within the HMM framework, transitions between states are modeled as occurring at the end of each data acquisition period and are described using transition probabilities. Yet, whereas measurements are often performed at discrete time levels in the natural sciences, physical systems evolve in continuous time according to transition rates. It then follows that the modeling assumptions underlying the HMM are justified if the transition rates of a physical process from state to state are small as compared to the data acquisition rate. In other words, HMMs apply to slow kinetics. The problem is, because the transition rates are unknown in principle, it is unclear, a priori, whether the HMM applies to a particular system. For this reason, we must generalize HMMs for physical systems, such as single molecules, because these switch between discrete states in "continuous time". We do so by exploiting recent mathematical tools developed in the context of inferring Markov jump processes and propose the hidden Markov jump process. We explicitly show in what limit the hidden Markov jump process reduces to the HMM. Resolving the discrete time discrepancy of the HMM has clear implications: we no longer need to assume that processes, such as molecular events, must occur on timescales slower than data acquisition and can learn transition rates even if these are on the same timescale or otherwise exceed data acquisition rates.
Asunto(s)
Algoritmos , Cinética , Cadenas de Markov , ProbabilidadRESUMEN
Recent studies have shown that the diffusivity of enzymes increases in a substrate-dependent manner during catalysis. Although this observation has been reported and characterized for several different systems, the precise origin of this phenomenon is unknown. Calorimetric methods are often used to determine enthalpies from enzyme-catalysed reactions and can therefore provide important insight into their reaction mechanisms. The ensemble averages involved in traditional bulk calorimetry cannot probe the transient effects that the energy exchanged in a reaction may have on the catalyst. Here we obtain single-molecule fluorescence correlation spectroscopy data and analyse them within the framework of a stochastic theory to demonstrate a mechanistic link between the enhanced diffusion of a single enzyme molecule and the heat released in the reaction. We propose that the heat released during catalysis generates an asymmetric pressure wave that results in a differential stress at the protein-solvent interface that transiently displaces the centre-of-mass of the enzyme (chemoacoustic effect). This novel perspective on how enzymes respond to the energy released during catalysis suggests a possible effect of the heat of reaction on the structural integrity and internal degrees of freedom of the enzyme.
Asunto(s)
Biocatálisis , Difusión , Enzimas/metabolismo , Calor , Fosfatasa Alcalina/metabolismo , Animales , Calorimetría , Catalasa/metabolismo , Dominio Catalítico , Bovinos , Cinética , Saccharomyces cerevisiae/enzimología , Espectrometría de Fluorescencia , Termodinámica , Triosa-Fosfato Isomerasa/metabolismo , Ureasa/metabolismoRESUMEN
Recently, trapped-particle experiments have probed the instantaneous velocity of Brownian motion revealing that, at early times, hydrodynamic history forces dominate Stokes damping. In these experiments, nonuniform particle motion is well described by the Basset-Boussinesq-Oseen (BBO) equation, which captures the unsteady Basset history force at a low Reynolds number. Building off of these results, earlier we showed that, at low temperature, BBO particles could exploit fluid inertia in order to overcome potential barriers (generically modeled as a tilted washboard), while its Langevin counter-part could not. Here, we explore the behavior of neutrally buoyant BBO particles at finite temperature for moderate Stokes damping. Remarkably, we find that the transport of particles injected into a bumpy potential with sufficiently high barriers can be completely quenched at intermediate temperatures, whereas itinerancy may be possible above and below that temperature window. This effect is present for both Langevin and BBO dynamics, though these occur over drastically different temperature ranges. Furthermore, hydrodynamic memory mitigates these effects by sustaining initial particle momentum, even in the difficult intermediate temperature regime.
RESUMEN
Effective forces derived from experimental or in silico molecular dynamics time traces are critical in developing reduced and computationally efficient descriptions of otherwise complex dynamical problems. This helps motivate why it is important to develop methods to efficiently learn effective forces from time series data. A number of methods already exist to do this when data are plentiful but otherwise fail for sparse datasets or datasets where some regions of phase space are undersampled. In addition, any method developed to learn effective forces from time series data should be minimally a priori committal as to the shape of the effective force profile, exploit every data point without reducing data quality through any form of binning or pre-processing, and provide full credible intervals (error bars) about the prediction for the entirety of the effective force curve. Here, we propose a generalization of the Gaussian process, a key tool in Bayesian nonparametric inference and machine learning, which meets all of the above criteria in learning effective forces for the first time.
RESUMEN
Super-resolution microscopy provides direct insight into fundamental biological processes occurring at length scales smaller than light's diffraction limit. The analysis of data at such scales has brought statistical and machine learning methods into the mainstream. Here we provide a survey of data analysis methods starting from an overview of basic statistical techniques underlying the analysis of super-resolution and, more broadly, imaging data. We subsequently break down the analysis of super-resolution data into four problems: the localization problem, the counting problem, the linking problem, and what we've termed the interpretation problem.
RESUMEN
One way to achieve spatial resolution using fluorescence imaging-and track single molecules-is to use wide-field illumination and collect measurements over multiple sensors (camera pixels). Here we propose another way that uses confocal measurements and a single sensor. Traditionally, confocal microscopy has been used to achieve high temporal resolution at the expense of spatial resolution. This is because it utilizes very few, and commonly just one, sensors to collect data. Yet confocal data encode spatial information. Here we show that non-uniformities in the shape of the confocal excitation volume can be exploited to achieve spatial resolution. To achieve this, we formulate a specialized hidden Markov model and adapt a forward filtering-backward sampling Markov chain Monte Carlo scheme to efficiently handle molecular motion within a symmetric confocal volume characteristically used in fluorescence correlation spectroscopy. Our method can be used for single confocal volume applications or incorporated into larger computational schemes for specialized, multi-confocal volume, optical setups.
RESUMEN
We obtain a numerical solution of the equation for the synchronous unsteady motion of two spherical vesicles in incompressible viscous fluid in the presence of both Stokes drag and hydrodynamics memory. We find that for a given amount of work performed, the final distance traveled by each vesicle is increased by the presence of the other vesicle moving in the same direction. The result suggests that the unsteady transport of the vesicles by molecular motors in vivo may be facilitated due to an effective hydrodynamic interaction between the neighboring vesicles.
RESUMEN
Extracellular DNA is engulfed by innate immune cells and digested by endosomal DNaseâ II to generate an immune response. Quantitative information on endosomal stage-specific cargo processing is a critical parameter to predict and model the innate immune response. Biochemical assays quantify endosomal processing but lack organelle-specific information, while fluorescence microscopy has provided the latter without the former. Herein, we report a single molecule counting method based on fluorescence imaging that quantitatively maps endosomal processing of cargo DNA in innate immune cells with organelle-specific resolution. Our studies reveal that endosomal DNA degradation occurs mainly in lysosomes and is negligible in late endosomes. This method can be used to study cargo processing in diverse endocytic pathways and measure stage-specific activity of processing factors in endosomes.