Your browser doesn't support javascript.
loading
The Distance Between: An Algorithmic Approach to Comparing Stochastic Models to Time-Series Data.
Sherlock, Brock D; Boon, Marko A A; Vlasiou, Maria; Coster, Adelle C F.
Afiliação
  • Sherlock BD; School of Mathematics and Statistics, University of New South Wales, Sydney, NSW, 2052, Australia.
  • Boon MAA; Department of Mathematics and Computer Science, Eindhoven University of Technology, P.O. Box 513, 5600 MB, Eindhoven, The Netherlands.
  • Vlasiou M; Department of Mathematics and Computer Science, Eindhoven University of Technology, P.O. Box 513, 5600 MB, Eindhoven, The Netherlands.
  • Coster ACF; Faculty of Electrical Engineering, Mathematics and Computer Science, University of Twente, P.O. Box 217, 7500 AE, Enschede, The Netherlands.
Bull Math Biol ; 86(9): 111, 2024 Jul 26.
Article em En | MEDLINE | ID: mdl-39060776
ABSTRACT
While mean-field models of cellular operations have identified dominant processes at the macroscopic scale, stochastic models may provide further insight into mechanisms at the molecular scale. In order to identify plausible stochastic models, quantitative comparisons between the models and the experimental data are required. The data for these systems have small sample sizes and time-evolving distributions. The aim of this study is to identify appropriate distance metrics for the quantitative comparison of stochastic model outputs and time-evolving stochastic measurements of a system. We identify distance metrics with features suitable for driving parameter inference, model comparison, and model validation, constrained by data from multiple experimental protocols. In this study, stochastic model outputs are compared to synthetic data across three scales that of the data at the points the system is sampled during the time course of each type of experiment; a combined distance across the time course of each experiment; and a combined distance across all the experiments. Two broad categories of comparators at each point were considered, based on the empirical cumulative distribution function (ECDF) of the data and of the model outputs discrete based measures such as the Kolmogorov-Smirnov distance, and integrated measures such as the Wasserstein-1 distance between the ECDFs. It was found that the discrete based measures were highly sensitive to parameter changes near the synthetic data parameters, but were largely insensitive otherwise, whereas the integrated distances had smoother transitions as the parameters approached the true values. The integrated measures were also found to be robust to noise added to the synthetic data, replicating experimental error. The characteristics of the identified distances provides the basis for the design of an algorithm suitable for fitting stochastic models to real world stochastic data.
Assuntos
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Algoritmos / Processos Estocásticos / Conceitos Matemáticos / Modelos Biológicos Idioma: En Revista: Bull Math Biol Ano de publicação: 2024 Tipo de documento: Article

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Algoritmos / Processos Estocásticos / Conceitos Matemáticos / Modelos Biológicos Idioma: En Revista: Bull Math Biol Ano de publicação: 2024 Tipo de documento: Article