RESUMO
Multi-dimensional functional data arises in numerous modern scientific experimental and observational studies. In this article, we focus on longitudinal functional data, a structured form of multidimensional functional data. Operating within a longitudinal functional framework we aim to capture low dimensional interpretable features. We propose a computationally efficient nonparametric Bayesian method to simultaneously smooth observed data, estimate conditional functional means and functional covariance surfaces. Statistical inference is based on Monte Carlo samples from the posterior measure through adaptive blocked Gibbs sampling. Several operative characteristics associated with the proposed modeling framework are assessed comparatively in a simulated environment. We illustrate the application of our work in two case studies. The first case study involves age-specific fertility collected over time for various countries. The second case study is an implicit learning experiment in children with autism spectrum disorder.
Assuntos
Transtorno do Espectro Autista , Teorema de Bayes , Criança , Humanos , Método de Monte CarloRESUMO
Predictions of COVID-19 case growth and mortality are critical to the decisions of political leaders, businesses, and individuals grappling with the pandemic. This predictive task is challenging due to the novelty of the virus, limited data, and dynamic political and societal responses. We embed a Bayesian time series model and a random forest algorithm within an epidemiological compartmental model for empirically grounded COVID-19 predictions. The Bayesian case model fits a location-specific curve to the velocity (first derivative) of the log transformed cumulative case count, borrowing strength across geographic locations and incorporating prior information to obtain a posterior distribution for case trajectories. The compartmental model uses this distribution and predicts deaths using a random forest algorithm trained on COVID-19 data and population-level characteristics, yielding daily projections and interval estimates for cases and deaths in U.S. states. We evaluated the model by training it on progressively longer periods of the pandemic and computing its predictive accuracy over 21-day forecasts. The substantial variation in predicted trajectories and associated uncertainty between states is illustrated by comparing three unique locations: New York, Colorado, and West Virginia. The sophistication and accuracy of this COVID-19 model offer reliable predictions and uncertainty estimates for the current trajectory of the pandemic in the U.S. and provide a platform for future predictions as shifting political and societal responses alter its course.
Assuntos
COVID-19/epidemiologia , COVID-19/mortalidade , Previsões/métodos , Modelos Estatísticos , Pandemias/estatística & dados numéricos , SARS-CoV-2 , Algoritmos , Teorema de Bayes , COVID-19/transmissão , Biologia Computacional , Humanos , Aprendizado de Máquina , Estados Unidos/epidemiologiaRESUMO
Functional brain imaging through electroencephalography (EEG) relies upon the analysis and interpretation of high-dimensional, spatially organized time series. We propose to represent time-localized frequency domain characterizations of EEG data as region-referenced functional data. This representation is coupled with a hierarchical regression modeling approach to multivariate functional observations. Within this familiar setting we discuss how several prior models relate to structural assumptions about multivariate covariance operators. An overarching modeling framework, based on infinite factorial decompositions, is finally proposed to balance flexibility and efficiency in estimation. The motivating application stems from a study of implicit auditory learning, in which typically developing (TD) children, and children with autism spectrum disorder (ASD) were exposed to a continuous speech stream. Using the proposed model, we examine differential band power dynamics as brain function is interrogated throughout the duration of a computer-controlled experiment. Our work offers a novel look at previous findings in psychiatry and provides further insights into the understanding of ASD. Our approach to inference is fully Bayesian and implemented in a highly optimized Rcpp package.
RESUMO
In emerging epidemics, early estimates of key epidemiological characteristics of the disease are critical for guiding public policy. In particular, identifying high-risk population subgroups aids policymakers and health officials in combating the epidemic. This has been challenging during the coronavirus disease 2019 (COVID-19) pandemic because governmental agencies typically release aggregate COVID-19 data as summary statistics of patient demographics. These data may identify disparities in COVID-19 outcomes between broad population subgroups, but do not provide comparisons between more granular population subgroups defined by combinations of multiple demographics. We introduce a method that helps to overcome the limitations of aggregated summary statistics and yields estimates of COVID-19 infection and case fatality rates - key quantities for guiding public policy related to the control and prevention of COVID-19 - for population subgroups across combinations of demographic characteristics. Our approach uses pseudo-likelihood based logistic regression to combine aggregate COVID-19 case and fatality data with population-level demographic survey data to estimate infection and case fatality rates for population subgroups across combinations of demographic characteristics. We illustrate our method on California COVID-19 data to estimate test-based infection and case fatality rates for population subgroups defined by gender, age, and race/ethnicity. Our analysis indicates that in California, males have higher test-based infection rates and test-based case fatality rates across age and race/ethnicity groups, with the gender gap widening with increasing age. Although elderly infected with COVID-19 are at an elevated risk of mortality, the test-based infection rates do not increase monotonically with age. The workforce population, especially, has a higher test-based infection rate than children, adolescents, and other elderly people in their 60-80. LatinX and African Americans have higher test-based infection rates than other race/ethnicity groups. The subgroups with the highest 5 test-based case fatality rates are all-male groups with race as African American, Asian, Multi-race, LatinX, and White, followed by African American females, indicating that African Americans are an especially vulnerable California subpopulation.
Assuntos
COVID-19/epidemiologia , Modelos Logísticos , Adolescente , Adulto , Fatores Etários , Idoso , Idoso de 80 Anos ou mais , COVID-19/mortalidade , California/epidemiologia , California/etnologia , Criança , Etnicidade , Feminino , Inquéritos Epidemiológicos , Humanos , Funções Verossimilhança , Masculino , Pessoa de Meia-Idade , Método de Monte Carlo , Pandemias , Grupos Raciais , Fatores de Risco , SARS-CoV-2/fisiologia , Fatores SexuaisRESUMO
Chronic myelogenous leukemia (CML) is a cancer of the white blood cells that results from increased and uncontrolled growth of myeloid cells in the bone marrow and the accumulation of these cells in the blood. The most common form of treatment for CML is imatinib, a tyrosine kinase inhibitor. Although imatinib is an effective treatment for CML and most patients treated with imatinib do attain some form of remission, imatinib does not completely eradicate all leukemia cells, and if treatment is stopped, all patients eventually relapse (Cortes, 2005). In Kim (2008), the authors developed a mathematical model for the dynamics of CML under imatinib treatment that incorporates the anti-leukemia immune response, and in Paquin (2011), the authors used this mathematical model to study strategic treatment interruptions as a potential therapeutic strategy for CML patients. Although the authors presented the results of several numerical simulations in Paquin (2011), the studies in that work did not include the possibility of imatinib-resistant mutations or an initial population of imatinib-resistant leukemia cells. As resistance is a significant consideration in any drug treatment, it is important to study the efficacy of the strategic treatment interruption plan in the presence of imatinib resistance. In this work, we modify the delay differential equations model of Kim (2008), Paquin (2011) to include the possibility of imatinib resistance, and we analyze strategic treatment interruptions as a potential therapeutic tool in the case of patients with imatinib-resistance leukemia cells.