Your browser doesn't support javascript.
loading
Mostrar: 20 | 50 | 100
Resultados 1 - 20 de 230
Filtrar
Más filtros

Banco de datos
Tipo del documento
Intervalo de año de publicación
1.
Biostatistics ; 25(3): 919-932, 2024 Jul 01.
Artículo en Inglés | MEDLINE | ID: mdl-38332624

RESUMEN

Mediation analysis with contemporaneously observed multiple mediators is a significant area of causal inference. Recent approaches for multiple mediators are often based on parametric models and thus may suffer from model misspecification. Also, much of the existing literature either only allow estimation of the joint mediation effect or estimate the joint mediation effect just as the sum of individual mediator effects, ignoring the interaction among the mediators. In this article, we propose a novel Bayesian nonparametric method that overcomes the two aforementioned drawbacks. We model the joint distribution of the observed data (outcome, mediators, treatment, and confounders) flexibly using an enriched Dirichlet process mixture with three levels. We use standardization (g-computation) to compute all possible mediation effects, including pairwise and all other possible interaction among the mediators. We thoroughly explore our method via simulations and apply our method to a mental health data from Wisconsin Longitudinal Study, where we estimate how the effect of births from unintended pregnancies on later life mental depression (CES-D) among the mothers is mediated through lack of self-acceptance and autonomy, employment instability, lack of social participation, and increased family stress. Our method identified significant individual mediators, along with some significant pairwise effects.


Asunto(s)
Teorema de Bayes , Humanos , Análisis de Mediación , Femenino , Estudios Longitudinales , Modelos Estadísticos , Salud Mental
2.
Biostatistics ; 2023 Sep 16.
Artículo en Inglés | MEDLINE | ID: mdl-37811675

RESUMEN

We propose a nonparametric compound Poisson model for underreported count data that introduces a latent clustering structure for the reporting probabilities. The latter are estimated with the model's parameters based on experts' opinion and exploiting a proxy for the reporting process. The proposed model is used to estimate the prevalence of chronic kidney disease in Apulia, Italy, based on a unique statistical database covering information on m = 258 municipalities obtained by integrating multisource register information. Accurate prevalence estimates are needed for monitoring, surveillance, and management purposes; yet, counts are deemed to be considerably underreported, especially in some areas of Apulia, one of the most deprived and heterogeneous regions in Italy. Our results agree with previous findings and highlight interesting geographical patterns of the disease. We compare our model to existing approaches in the literature using simulated as well as real data on early neonatal mortality risk in Brazil, described in previous research: the proposed approach proves to be accurate and particularly suitable when partial information about data quality is available.

3.
Biostatistics ; 24(4): 922-944, 2023 10 18.
Artículo en Inglés | MEDLINE | ID: mdl-35657087

RESUMEN

Regional aggregates of health outcomes over delineated administrative units (e.g., states, counties, and zip codes), or areal units, are widely used by epidemiologists to map mortality or incidence rates and capture geographic variation. To capture health disparities over regions, we seek "difference boundaries" that separate neighboring regions with significantly different spatial effects. Matters are more challenging with multiple outcomes over each unit, where we capture dependence among diseases as well as across the areal units. Here, we address multivariate difference boundary detection for correlated diseases. We formulate the problem in terms of Bayesian pairwise multiple comparisons and seek the posterior probabilities of neighboring spatial effects being different. To achieve this, we endow the spatial random effects with a discrete probability law using a class of multivariate areally referenced Dirichlet process models that accommodate spatial and interdisease dependence. We evaluate our method through simulation studies and detect difference boundaries for multiple cancers using data from the Surveillance, Epidemiology, and End Results Program of the National Cancer Institute.


Asunto(s)
Teorema de Bayes , Humanos , Simulación por Computador , Probabilidad , Incidencia
4.
Biometrics ; 80(2)2024 Mar 27.
Artículo en Inglés | MEDLINE | ID: mdl-38640436

RESUMEN

Several epidemiological studies have provided evidence that long-term exposure to fine particulate matter (pm2.5) increases mortality rate. Furthermore, some population characteristics (e.g., age, race, and socioeconomic status) might play a crucial role in understanding vulnerability to air pollution. To inform policy, it is necessary to identify groups of the population that are more or less vulnerable to air pollution. In causal inference literature, the group average treatment effect (GATE) is a distinctive facet of the conditional average treatment effect. This widely employed metric serves to characterize the heterogeneity of a treatment effect based on some population characteristics. In this paper, we introduce a novel Confounder-Dependent Bayesian Mixture Model (CDBMM) to characterize causal effect heterogeneity. More specifically, our method leverages the flexibility of the dependent Dirichlet process to model the distribution of the potential outcomes conditionally to the covariates and the treatment levels, thus enabling us to: (i) identify heterogeneous and mutually exclusive population groups defined by similar GATEs in a data-driven way, and (ii) estimate and characterize the causal effects within each of the identified groups. Through simulations, we demonstrate the effectiveness of our method in uncovering key insights about treatment effects heterogeneity. We apply our method to claims data from Medicare enrollees in Texas. We found six mutually exclusive groups where the causal effects of pm2.5 on mortality rate are heterogeneous.


Asunto(s)
Contaminantes Atmosféricos , Contaminación del Aire , Estados Unidos/epidemiología , Contaminantes Atmosféricos/efectos adversos , Contaminantes Atmosféricos/análisis , Teorema de Bayes , Medicare , Contaminación del Aire/efectos adversos , Contaminación del Aire/análisis , Material Particulado/efectos adversos , Material Particulado/análisis , Exposición a Riesgos Ambientales/efectos adversos
5.
Stat Med ; 43(6): 1135-1152, 2024 Mar 15.
Artículo en Inglés | MEDLINE | ID: mdl-38197220

RESUMEN

The prevalence of chronic non-communicable diseases such as obesity has noticeably increased in the last decade. The study of these diseases in early life is of paramount importance in determining their course in adult life and in supporting clinical interventions. Recently, attention has been drawn to approaches that study the alteration of metabolic pathways in obese children. In this work, we propose a novel joint modeling approach for the analysis of growth biomarkers and metabolite associations, to unveil metabolic pathways related to childhood obesity. Within a Bayesian framework, we flexibly model the temporal evolution of growth trajectories and metabolic associations through the specification of a joint nonparametric random effect distribution, with the main goal of clustering subjects, thus identifying risk sub-groups. Growth profiles as well as patterns of metabolic associations determine the clustering structure. Inclusion of risk factors is straightforward through the specification of a regression term. We demonstrate the proposed approach on data from the Growing Up in Singapore Towards healthy Outcomes cohort study, based in Singapore. Posterior inference is obtained via a tailored MCMC algorithm, involving a nonparametric prior with mixed support. Our analysis has identified potential key pathways in obese children that allow for the exploration of possible molecular mechanisms associated with childhood obesity.


Asunto(s)
Obesidad Infantil , Adulto , Humanos , Niño , Obesidad Infantil/epidemiología , Estudios de Cohortes , Teorema de Bayes , Factores de Riesgo , Biomarcadores
6.
Stat Med ; 43(18): 3432-3446, 2024 Aug 15.
Artículo en Inglés | MEDLINE | ID: mdl-38853284

RESUMEN

Dysphagia, a common result of other medical conditions, is caused by malfunctions in swallowing physiology resulting in difficulty eating and drinking. The Modified Barium Swallow Study (MBSS), the most commonly used diagnostic tool for evaluating dysphagia, can be assessed using the Modified Barium Swallow Impairment Profile (MBSImP™). The MBSImP assessment tool consists of a hierarchical grouped data structure with multiple domains, a set of components within each domain which characterize specific swallowing physiologies, and a set of tasks scored on a discrete scale within each component. We lack sophisticated approaches to extract patterns of physiologic swallowing impairment from the MBSImP task scores within a component while still recognizing the nested structure of components within a domain. We propose a Bayesian hierarchical profile regression model, which uses a Bayesian profile regression model in conjunction with a hierarchical Dirichlet process mixture model to (1) cluster subjects into impairment profile patterns while respecting the hierarchical grouped data structure of the MBSImP, and (2) simultaneously determine associations between latent profile cluster membership for all components and the outcome of dysphagia severity. We apply our approach to a cohort of patients referred for an MBSS and assessed using the MBSImP. Our research results can be used to inform appropriate intervention strategies, and provide tools for clinicians to make better multidimensional management and treatment decisions for patients with dysphagia.


Asunto(s)
Teorema de Bayes , Trastornos de Deglución , Humanos , Análisis de Regresión , Femenino , Modelos Estadísticos , Masculino , Análisis por Conglomerados
7.
BMC Med Inform Decis Mak ; 24(1): 12, 2024 01 08.
Artículo en Inglés | MEDLINE | ID: mdl-38191403

RESUMEN

BACKGROUND: The handling of missing data is a challenge for inference and regression modelling. A particular challenge is dealing with missing predictor information, particularly when trying to build and make predictions from models for use in clinical practice. METHODS: We utilise a flexible Bayesian approach for handling missing predictor information in regression models. This provides practitioners with full posterior predictive distributions for both the missing predictor information (conditional on the observed predictors) and the outcome-of-interest. We apply this approach to a previously proposed counterfactual treatment selection model for type 2 diabetes second-line therapies. Our approach combines a regression model and a Dirichlet process mixture model (DPMM), where the former defines the treatment selection model, and the latter provides a flexible way to model the joint distribution of the predictors. RESULTS: We show that DPMMs can model complex relationships between predictor variables and can provide powerful means of fitting models to incomplete data (under missing-completely-at-random and missing-at-random assumptions). This framework ensures that the posterior distribution for the parameters and the conditional average treatment effect estimates automatically reflect the additional uncertainties associated with missing data due to the hierarchical model structure. We also demonstrate that in the presence of multiple missing predictors, the DPMM model can be used to explore which variable(s), if collected, could provide the most additional information about the likely outcome. CONCLUSIONS: When developing clinical prediction models, DPMMs offer a flexible way to model complex covariate structures and handle missing predictor information. DPMM-based counterfactual prediction models can also provide additional information to support clinical decision-making, including allowing predictions with appropriate uncertainty to be made for individuals with incomplete predictor data.


Asunto(s)
Diabetes Mellitus Tipo 2 , Humanos , Teorema de Bayes , Diabetes Mellitus Tipo 2/tratamiento farmacológico , Toma de Decisiones Clínicas , Incertidumbre
8.
Entropy (Basel) ; 26(4)2024 Apr 14.
Artículo en Inglés | MEDLINE | ID: mdl-38667889

RESUMEN

We consider a constructive definition of the multivariate Pareto that factorizes the random vector into a radial component and an independent angular component. The former follows a univariate Pareto distribution, and the latter is defined on the surface of the positive orthant of the infinity norm unit hypercube. We propose a method for inferring the distribution of the angular component by identifying its support as the limit of the positive orthant of the unit p-norm spheres and introduce a projected gamma family of distributions defined through the normalization of a vector of independent random gammas to the space. This serves to construct a flexible family of distributions obtained as a Dirichlet process mixture of projected gammas. For model assessment, we discuss scoring methods appropriate to distributions on the unit hypercube. In particular, working with the energy score criterion, we develop a kernel metric that produces a proper scoring rule and presents a simulation study to compare different modeling choices using the proposed metric. Using our approach, we describe the dependence structure of extreme values in the integrated vapor transport (IVT), data describing the flow of atmospheric moisture along the coast of California. We find clear but heterogeneous geographical dependence.

9.
Biostatistics ; 24(1): 209-225, 2022 12 12.
Artículo en Inglés | MEDLINE | ID: mdl-34296256

RESUMEN

Across several medical fields, developing an approach for disease classification is an important challenge. The usual procedure is to fit a model for the longitudinal response in the healthy population, a different model for the longitudinal response in the diseased population, and then apply Bayes' theorem to obtain disease probabilities given the responses. Unfortunately, when substantial heterogeneity exists within each population, this type of Bayes classification may perform poorly. In this article, we develop a new approach by fitting a Bayesian nonparametric model for the joint outcome of disease status and longitudinal response, and then we perform classification through the clustering induced by the Dirichlet process. This approach is highly flexible and allows for multiple subpopulations of healthy, diseased, and possibly mixed membership. In addition, we introduce an Markov chain Monte Carlo sampling scheme that facilitates the assessment of the inference and prediction capabilities of our model. Finally, we demonstrate the method by predicting pregnancy outcomes using longitudinal profiles on the human chorionic gonadotropin beta subunit hormone levels in a sample of Chilean women being treated with assisted reproductive therapy.


Asunto(s)
Teorema de Bayes , Femenino , Humanos , Cadenas de Markov , Método de Montecarlo , Análisis por Conglomerados , Probabilidad
10.
Biostatistics ; 23(2): 467-484, 2022 04 13.
Artículo en Inglés | MEDLINE | ID: mdl-32948880

RESUMEN

Heritability analysis plays a central role in quantitative genetics to describe genetic contribution to human complex traits and prioritize downstream analyses under large-scale phenotypes. Existing works largely focus on modeling single phenotype and currently available multivariate phenotypic methods often suffer from scaling and interpretation. In this article, motivated by understanding how genetic underpinning impacts human brain variation, we develop an integrative Bayesian heritability analysis to jointly estimate heritabilities for high-dimensional neuroimaging traits. To induce sparsity and incorporate brain anatomical configuration, we impose hierarchical selection among both regional and local measurements based on brain structural network and voxel dependence. We also use a nonparametric Dirichlet process mixture model to realize grouping among single nucleotide polymorphism-associated phenotypic variations, providing biological plausibility. Through extensive simulations, we show the proposed method outperforms existing ones in heritability estimation and heritable traits selection under various scenarios. We finally apply the method to two large-scale imaging genetics datasets: the Alzheimer's Disease Neuroimaging Initiative and United Kingdom Biobank and show biologically meaningful results.


Asunto(s)
Enfermedad de Alzheimer , Neuroimagen , Enfermedad de Alzheimer/diagnóstico por imagen , Enfermedad de Alzheimer/genética , Teorema de Bayes , Humanos , Neuroimagen/métodos , Fenotipo , Polimorfismo de Nucleótido Simple
11.
Biostatistics ; 23(2): 449-466, 2022 04 13.
Artículo en Inglés | MEDLINE | ID: mdl-32968805

RESUMEN

The study of racial/ethnic inequalities in health is important to reduce the uneven burden of disease. In the case of colorectal cancer (CRC), disparities in survival among non-Hispanic Whites and Blacks are well documented, and mechanisms leading to these disparities need to be studied formally. It has also been established that body mass index (BMI) is a risk factor for developing CRC, and recent literature shows BMI at diagnosis of CRC is associated with survival. Since BMI varies by racial/ethnic group, a question that arises is whether differences in BMI are partially responsible for observed racial/ethnic disparities in survival for CRC patients. This article presents new methodology to quantify the impact of the hypothetical intervention that matches the BMI distribution in the Black population to a potentially complex distributional form observed in the White population on racial/ethnic disparities in survival. Our density mediation approach can be utilized to estimate natural direct and indirect effects in the general causal mediation setting under stronger assumptions. We perform a simulation study that shows our proposed Bayesian density regression approach performs as well as or better than current methodology allowing for a shift in the mean of the distribution only, and that standard practice of categorizing BMI leads to large biases when BMI is a mediator variable. When applied to motivating data from the Cancer Care Outcomes Research and Surveillance (CanCORS) Consortium, our approach suggests the proposed intervention is potentially beneficial for elderly and low-income Black patients, yet harmful for young or high-income Black populations.


Asunto(s)
Neoplasias Colorrectales , Anciano , Teorema de Bayes , Índice de Masa Corporal , Neoplasias Colorrectales/diagnóstico , Humanos , Factores Socioeconómicos , Estados Unidos
12.
Theor Popul Biol ; 151: 28-43, 2023 06.
Artículo en Inglés | MEDLINE | ID: mdl-37030660

RESUMEN

This work presents a population genetic model of evolution, which includes haploid selection, mutation, recombination, and drift. The mutation-selection equilibrium can be expressed exactly in closed form for arbitrary fitness functions without resorting to diffusion approximations. Tractability is achieved by generating new offspring using n-parent rather than 2-parent recombination. While this enforces linkage equilibrium among offspring, it allows analysis of the whole population under linkage disequilibrium. We derive a general and exact relationship between fitness fluctuations and response to selection. Our assumptions allow analytical calculation of the stationary distribution of the model for a variety of non-trivial fitness functions. These results allow us to speak to genetic architecture, i.e., what stationary distributions result from different fitness functions. This paper presents methods for exactly deriving stationary states for finite and infinite populations. This method can be applied to many fitness functions, and we give exact calculations for four of these. These results allow us to investigate metastability, tradeoffs between fitness functions, and even consider error-correcting codes.


Asunto(s)
Modelos Genéticos , Recombinación Genética , Mutación , Desequilibrio de Ligamiento , Selección Genética
13.
Biometrics ; 79(2): 1370-1382, 2023 06.
Artículo en Inglés | MEDLINE | ID: mdl-35191539

RESUMEN

Recent advancements in miniaturized fluorescence microscopy have made it possible to investigate neuronal responses to external stimuli in awake behaving animals through the analysis of intracellular calcium signals. An ongoing challenge is deconvolving the temporal signals to extract the spike trains from the noisy calcium signals' time series. In this article, we propose a nested Bayesian finite mixture specification that allows the estimation of spiking activity and, simultaneously, reconstructing the distributions of the calcium transient spikes' amplitudes under different experimental conditions. The proposed model leverages two nested layers of random discrete mixture priors to borrow information between experiments and discover similarities in the distributional patterns of neuronal responses to different stimuli. Furthermore, the spikes' intensity values are also clustered within and between experimental conditions to determine the existence of common (recurring) response amplitudes. Simulation studies and the analysis of a dataset from the Allen Brain Observatory show the effectiveness of the method in clustering and detecting neuronal activities.


Asunto(s)
Encéfalo , Calcio , Animales , Teorema de Bayes , Simulación por Computador , Análisis por Conglomerados
14.
Biometrics ; 79(2): 642-654, 2023 06.
Artículo en Inglés | MEDLINE | ID: mdl-35165892

RESUMEN

An important experimental design problem in early-stage drug discovery is how to prioritize available compounds for testing when very little is known about the target protein. Informer-based ranking (IBR) methods address the prioritization problem when the compounds have provided bioactivity data on other potentially relevant targets. An IBR method selects an informer set of compounds, and then prioritizes the remaining compounds on the basis of new bioactivity experiments performed with the informer set on the target. We formalize the problem as a two-stage decision problem and introduce the Bayes Optimal Informer SEt (BOISE) method for its solution. BOISE leverages a flexible model of the initial bioactivity data, a relevant loss function, and effective computational schemes to resolve the two-step design problem. We evaluate BOISE and compare it to other IBR strategies in two retrospective studies, one on protein-kinase inhibition and the other on anticancer drug sensitivity. In both empirical settings BOISE exhibits better predictive performance than available methods. It also behaves well with missing data, where methods that use matrix completion show worse predictive performance.


Asunto(s)
Descubrimiento de Drogas , Proteínas , Teorema de Bayes , Estudios Retrospectivos , Descubrimiento de Drogas/métodos
15.
Stat Med ; 42(1): 15-32, 2023 01 15.
Artículo en Inglés | MEDLINE | ID: mdl-36317356

RESUMEN

There is a growing interest in current medical research to develop personalized treatments using a molecular-based approach. The broad goal is to implement a more precise and targeted decision-making process, relative to traditional treatments based primarily on clinical diagnoses. Specifically, we consider patients affected by Acute Myeloid Leukemia (AML), an hematological cancer characterized by uncontrolled proliferation of hematopoietic stem cells in the bone marrow. Because AML responds poorly to chemotherapeutic treatments, the development of targeted therapies is essential to improve patients' prospects. In particular, the dataset we analyze contains the levels of proteins involved in cell cycle regulation and linked to the progression of the disease. We evaluate treatment effects within a causal framework represented by a Directed Acyclic Graph (DAG) model, whose vertices are the protein levels in the network. A major obstacle in implementing the above program is represented by individual heterogeneity. We address this issue through a Dirichlet Process (DP) mixture of Gaussian DAG-models where both the graphical structure as well as the allied model parameters are regarded as uncertain. Our procedure determines a clustering structure of the units reflecting the underlying heterogeneity, and produces subject-specific estimates of causal effects based on Bayesian Model Averaging (BMA). With reference to the AML dataset, we identify different effects of protein regulation among individuals; moreover, our method clusters patients into groups that exhibit only mild similarities with traditional categories based on morphological features.


Asunto(s)
Leucemia Mieloide Aguda , Humanos , Teorema de Bayes , Causalidad , Leucemia Mieloide Aguda/etiología , Leucemia Mieloide Aguda/genética , Distribución Normal
16.
Stat Med ; 42(25): 4664-4680, 2023 Nov 10.
Artículo en Inglés | MEDLINE | ID: mdl-37647942

RESUMEN

Functional brain connectivity analysis is an increasingly important technique in neuroscience, psychiatry, and autism research. Functional connectivity can be measured by considering co-activation of brain regions in resting-state functional magnetic resonance imaging (rs-fMRI). We propose a novel Bayesian model to detect differential connections in cross-correlated functional connectivity between region of interest (ROI) pairs. The proposed sparse clustered neighborhood model induces a lower-dimensional sparsity and clustering based on a nonparametric Bayesian approach to model sparse differentially connected ROI pairs. Second, it induces a structured dependence model for modeling potential dependence among ROI pairs. We demonstrate Bayesian inference and performance of the proposed model in simulation studies and compare with a standard model. We utilize the proposed model to contrast functional connectivities between participants with autism spectrum disorder and neurotypical participants using cross-correlated rs-fMRI data from four sites of the Autism Brain Image Data Exchange.

17.
Stat Med ; 42(15): 2661-2691, 2023 07 10.
Artículo en Inglés | MEDLINE | ID: mdl-37037602

RESUMEN

Existing methods for estimating the mean outcome under a given sequential treatment rule often rely on intention-to-treat analyses, which estimate the effect of following a certain treatment rule regardless of compliance behavior of patients. There are two major concerns with intention-to-treat analyses: (1) the estimated effects are often biased toward the null effect; (2) the results are not generalizable and reproducible due to the potentially differential compliance behavior. These are particularly problematic in settings with a high level of non-compliance, such as substance use disorder studies. Our work is motivated by the Adaptive Treatment for Alcohol and Cocaine Dependence study (ENGAGE), which is a multi-stage trial that aimed to construct optimal treatment strategies to engage patients in therapy. Due to the relatively low level of compliance in this trial, intention-to-treat analyses essentially estimate the effect of being randomized to a certain treatment, instead of the actual effect of the treatment. We obviate this challenge by defining the target parameter as the mean outcome under a dynamic treatment regime conditional on a potential compliance stratum. We propose a flexible non-parametric Bayesian approach based on principal stratification, which consists of a Gaussian copula model for the joint distribution of the potential compliances, and a Dirichlet process mixture model for the treatment sequence specific outcomes. We conduct extensive simulation studies which highlight the utility of our approach in the context of multi-stage randomized trials. We show robustness of our estimator to non-linear and non-Gaussian settings as well.


Asunto(s)
Toma de Decisiones , Cooperación del Paciente , Humanos , Teorema de Bayes , Simulación por Computador , Resultado del Tratamiento
18.
Stat Med ; 42(12): 1931-1945, 2023 05 30.
Artículo en Inglés | MEDLINE | ID: mdl-36914221

RESUMEN

The analysis of large-scale datasets, especially in biomedical contexts, frequently involves a principled screening of multiple hypotheses. The celebrated two-group model jointly models the distribution of the test statistics with mixtures of two competing densities, the null and the alternative distributions. We investigate the use of weighted densities and, in particular, non-local densities as working alternative distributions, to enforce separation from the null and thus refine the screening procedure. We show how these weighted alternatives improve various operating characteristics, such as the Bayesian false discovery rate, of the resulting tests for a fixed mixture proportion with respect to a local, unweighted likelihood approach. Parametric and nonparametric model specifications are proposed, along with efficient samplers for posterior inference. By means of a simulation study, we exhibit how our model compares with both well-established and state-of-the-art alternatives in terms of various operating characteristics. Finally, to illustrate the versatility of our method, we conduct three differential expression analyses with publicly-available datasets from genomic studies of heterogeneous nature.


Asunto(s)
Genómica , Humanos , Funciones de Verosimilitud , Teorema de Bayes , Simulación por Computador
19.
Stat Med ; 42(30): 5555-5576, 2023 12 30.
Artículo en Inglés | MEDLINE | ID: mdl-37812818

RESUMEN

Understanding the spatio-temporal patterns of the coronavirus disease 2019 (COVID-19) is essential to construct public health interventions. Spatially referenced data can provide richer opportunities to understand the mechanism of the disease spread compared to the more often encountered aggregated count data. We propose a spatio-temporal Dirichlet process mixture model to analyze confirmed cases of COVID-19 in an urban environment. Our method can detect unobserved cluster centers of the epidemics, and estimate the space-time range of the clusters that are useful to construct a warning system. Furthermore, our model can measure the impact of different types of landmarks in the city, which provides an intuitive explanation of disease spreading sources from different time points. To efficiently capture the temporal dynamics of the disease patterns, we employ a sequential approach that uses the posterior distribution of the parameters for the previous time step as the prior information for the current time step. This approach enables us to incorporate time dependence into our model in a computationally efficient manner without complicating the model structure. We also develop a model assessment by comparing the data with theoretical densities, and outline the goodness-of-fit of our fitted model.


Asunto(s)
COVID-19 , Humanos , Teorema de Bayes , COVID-19/epidemiología , Modelos Estadísticos , Método de Montecarlo , Salud Pública , Análisis Espacio-Temporal
20.
Environmetrics ; 34(1): e2763, 2023 Feb.
Artículo en Inglés | MEDLINE | ID: mdl-37035022

RESUMEN

The relationship between particle exposure and health risks has been well established in recent years. Particulate matter (PM) is made up of different components coming from several sources, which might have different level of toxicity. Hence, identifying these sources is an important task in order to implement effective policies to improve air quality and population health. The problem of identifying sources of particulate pollution has already been studied in the literature. However, current methods require an a priori specification of the number of sources and do not include information on covariates in the source allocations. Here, we propose a novel Bayesian nonparametric approach to overcome these limitations. In particular, we model source contribution using a Dirichlet process as a prior for source profiles, which allows us to estimate the number of components that contribute to particle concentration rather than fixing this number beforehand. To better characterize them we also include meteorological variables (wind speed and direction) as covariates within the allocation process via a flexible Gaussian kernel. We apply the model to apportion particle number size distribution measured near London Gatwick Airport (UK) in 2019. When analyzing this data, we are able to identify the most common PM sources, as well as new sources that have not been identified with the commonly used methods.

SELECCIÓN DE REFERENCIAS
DETALLE DE LA BÚSQUEDA