RESUMEN
Comorbidity is widespread in the ageing population, implying multiple and complex medical needs for individuals and a public health burden. Determining risk factors and predicting comorbidity development can help identify at-risk subjects and design prevention strategies. Using socio-demographic and clinical data from approximately 11,000 subjects monitored over 11 years in the English Longitudinal Study of Ageing, we develop a dynamic Bayesian network (DBN) to model the onset and interaction of three cardio-metabolic comorbidities, namely type 2 diabetes (T2D), hypertension, and heart problems. The DBN allows us to identify risk factors for developing each morbidity, simulate ageing progression over time, and stratify the population based on the risk of outcome occurrence. By applying hierarchical agglomerative clustering to the simulated, dynamic risk of experiencing morbidities, we identified patients with similar risk patterns and the variables contributing to their discrimination. The network reveals a direct joint effect of biomarkers and lifestyle on outcomes over time, such as the impact of fasting glucose, HbA1c, and BMI on T2D development. Mediated cross-relationships between comorbidities also emerge, showcasing the interconnected nature of these health issues. The model presents good calibration and discrimination ability, particularly in predicting the onset of T2D (iAUC-ROC = 0.828, iAUC-PR = 0.294) and survival (iAUC-ROC = 0.827, iAUC-PR = 0.311). Stratification analysis unveils two distinct clusters for all comorbidities, effectively discriminated by variables like HbA1c for T2D and age at baseline for heart problems. The developed DBN constitutes an effective, highly-explainable predictive risk tool for simulating and stratifying the dynamic risk of developing cardio-metabolic comorbidities. Its use could help identify the effects of risk factors and develop health policies that prevent the occurrence of comorbidities.
Asunto(s)
Envejecimiento , Teorema de Bayes , Comorbilidad , Diabetes Mellitus Tipo 2 , Modelos Estadísticos , Humanos , Diabetes Mellitus Tipo 2/epidemiología , Femenino , Masculino , Anciano , Persona de Mediana Edad , Estudios Longitudinales , Factores de Riesgo , Hipertensión/epidemiología , Adulto , Anciano de 80 o más Años , Cardiopatías/epidemiologíaRESUMEN
BACKGROUND AND OBJECTIVE: In type 1 diabetes (T1D), a quantitative evaluation of the impact on hypoglycemia of suboptimal therapeutic decision (e.g. incorrect estimation of the ingested carbohydrates, inaccurate insulin timing, etc) is unavailable. Clinical trials to measure sensitivity to patient actions would be expensive, exposed to confounding factors and risky for the participants. In this work, a T1D patient decision simulator (T1D-PDS), realistically reproducing blood glucose dynamics in a large virtual population, is used to perform extensive in-silico trials and the so-derived data employed to implement a sensitivity analysis that ranks different behavioral factors based on their impact on a clinically meaningful parameter, the time below range (TBR). METHODS: Eleven behavioral factors impacting on hypoglycemia are considered. The T1D-PDS was used to perform multiple 2-week simulations involving 100 adults, by testing about 3500 different perturbations for nominal behavior. A local linear approximation of the function linking the TBR and the factors were computed to derive sensitivity indices (SIs), quantifying the impact of each factor on TBR variations. RESULTS: The obtained ranking quantifies importance of factors w.r.t. the others. Factors apparently related to hypoglycemia were correctly placed on the top of the ranking, including systematic (SI=2.05%) and random (SI=1.35%) carb-counting error, hypotreatment dose (SI=-1.21%), insulin bolus time w.r.t. mealtime (SI=1.09%). CONCLUSIONS: The obtained SIs allowed to rank behavioral factors based on their impact on TBR. The behavioral factors identified as most influential can be prioritized in patient training.
Asunto(s)
Diabetes Mellitus Tipo 1 , Hipoglucemia , Adulto , Humanos , Diabetes Mellitus Tipo 1/tratamiento farmacológico , Hipoglucemiantes , Automonitorización de la Glucosa Sanguínea , Hipoglucemia/tratamiento farmacológico , Insulina , GlucemiaRESUMEN
BACKGROUND: The ageing global population presents significant public health challenges, especially in relation to the subjective wellbeing of the elderly. In this study, our aim was to investigate the potential for developing a model to forecast the two-year variation of the perceived wellbeing of individuals aged over 50. We also aimed to identify the variables that predict changes in subjective wellbeing, as measured by the CASP-12 scale, over a two-year period. METHODS: Data from the European SHARE project were used, specifically the demographic, health, social and financial variables of 9422 subjects. The subjective wellbeing was measured through the CASP-12 scale. The study outcome was defined as binary, i.e., worsening/not worsening of the variation of CASP-12 in 2 years. Logistic regression, logistic regression with LASSO regularisation, and random forest were considered candidate models. Performance was assessed in terms of accuracy in correctly predicting the outcome, Area Under the Curve (AUC), and F1 score. RESULTS: The best-performing model was the random forest, achieving an accuracy of 65%, AUC = 0.659, and F1 = 0.710. All models proved to be able to generalise both across subjects and over time. The most predictive variables were the CASP-12 score at baseline, the presence of depression and financial difficulties. CONCLUSIONS: While we identify the random forest model as the more suitable, given the similarity of performance, the models based on logistic regression or on logistic regression with LASSO regularisation are also possible options.
Asunto(s)
Envejecimiento , Aprendizaje Automático , Humanos , Anciano , Persona de Mediana Edad , Predicción , Modelos Logísticos , Bosques AleatoriosRESUMEN
BACKGROUND: Amyotrophic Lateral Sclerosis (ALS) is a fatal neurodegenerative disorder characterised by the progressive loss of motor neurons in the brain and spinal cord. The fact that ALS's disease course is highly heterogeneous, and its determinants not fully known, combined with ALS's relatively low prevalence, renders the successful application of artificial intelligence (AI) techniques particularly arduous. OBJECTIVE: This systematic review aims at identifying areas of agreement and unanswered questions regarding two notable applications of AI in ALS, namely the automatic, data-driven stratification of patients according to their phenotype, and the prediction of ALS progression. Differently from previous works, this review is focused on the methodological landscape of AI in ALS. METHODS: We conducted a systematic search of the Scopus and PubMed databases, looking for studies on data-driven stratification methods based on unsupervised techniques resulting in (A) automatic group discovery or (B) a transformation of the feature space allowing patient subgroups to be identified; and for studies on internally or externally validated methods for the prediction of ALS progression. We described the selected studies according to the following characteristics, when applicable: variables used, methodology, splitting criteria and number of groups, prediction outcomes, validation schemes, and metrics. RESULTS: Of the starting 1604 unique reports (2837 combined hits between Scopus and PubMed), 239 were selected for thorough screening, leading to the inclusion of 15 studies on patient stratification, 28 on prediction of ALS progression, and 6 on both stratification and prediction. In terms of variables used, most stratification and prediction studies included demographics and features derived from the ALSFRS or ALSFRS-R scores, which were also the main prediction targets. The most represented stratification methods were K-means, and hierarchical and expectation-maximisation clustering; while random forests, logistic regression, the Cox proportional hazard model, and various flavours of deep learning were the most widely used prediction methods. Predictive model validation was, albeit unexpectedly, quite rarely performed in absolute terms (leading to the exclusion of 78 eligible studies), with the overwhelming majority of included studies resorting to internal validation only. CONCLUSION: This systematic review highlighted a general agreement in terms of input variable selection for both stratification and prediction of ALS progression, and in terms of prediction targets. A striking lack of validated models emerged, as well as a general difficulty in reproducing many published studies, mainly due to the absence of the corresponding parameter lists. While deep learning seems promising for prediction applications, its superiority with respect to traditional methods has not been established; there is, instead, ample room for its application in the subfield of patient stratification. Finally, an open question remains on the role of new environmental and behavioural variables collected via novel, real-time sensors.
Asunto(s)
Esclerosis Amiotrófica Lateral , Humanos , Esclerosis Amiotrófica Lateral/diagnóstico , Inteligencia Artificial , Encéfalo , Análisis por Conglomerados , Bases de Datos FactualesRESUMEN
BACKGROUND: In the management of type 1 diabetes (T1D), systematic and random errors in carb-counting can have an adverse effect on glycemic control. In this study, we performed an in silico trial aiming at quantifying the impact of different levels of carb-counting error on glycemic control. METHODS: The T1D patient decision simulator was used to simulate 7-day glycemic profiles of 100 adults using open-loop therapy. The simulation was repeated for different values of systematic and random carb-counting errors, generated with Gaussian distribution varying the error mean from -10% to +10% and standard deviation (SD) from 0% to 50%. The effect of the error was evaluated by computing the difference of time inside (∆TIR), above (∆TAR) and below (∆TBR) the target glycemic range (70-180mg/dl) compared to the reference case, that is, absence of error. Finally, 3 linear regression models were developed to mathematically describe how error mean and SD variations result in ∆TIR, ∆TAR, and ∆TBR changes. RESULTS: Random errors globally deteriorate the glycemic control; systematic underestimations lead to, on average, up to 5.2% more TAR than the reference case, while systematic overestimation results in up to 0.8% more TBR. The different time in range metrics were linearly related with error mean and SD (R2>0.95), with slopes of ßMEAN=0.21, ßSD=-0.07 for ∆TIR, ßMEAN=-0.25, ßSD=+0.06 for ∆TAR, and ßMEAN=0.05, ßSD=+0.01 for ∆TBR. CONCLUSIONS: The quantification of carb-counting error impact performed in this work may be useful understanding causes of glycemic variability and the impact of possible therapy adjustments or behavior changes in different glucose metrics.
Asunto(s)
Diabetes Mellitus Tipo 1 , Adulto , Humanos , Diabetes Mellitus Tipo 1/terapia , Control Glucémico , Glucemia , Automonitorización de la Glucosa SanguíneaRESUMEN
Background: The error in estimating meal carbohydrates (CHO) amount is a critical mistake committed by type 1 diabetes (T1D) subjects. The aim of this study is both to investigate which factors, related to meals and subjects, affect the CHO counting error most and to develop a mathematical model of CHO counting error embeddable in T1D patient decision simulators to conduct in silico clinical trials. Methods: A published dataset of 50 T1D adults is used, which includes a patient's CHO count of 692 meals, dietitian's estimates of meal composition (used as reference), and several potential explanatory factors. The CHO counting error is modeled by multiple linear regression, with stepwise variable selection starting from 10 candidate predictors, that is, education level, insulin treatment duration, age, body weight, meal type, CHO, lipid, energy, protein, and fiber content. Inclusion of quadratic and interaction terms is also evaluated. Results: Larger errors correspond to larger meals, and most of the large meals are underestimated. The linear model selects CHO (P < 0.00001), meal type (P < 0.00001), and body weight (P = 0.047), whereas its extended version embeds a quadratic term of CHO (P < 0.00001) and interaction terms of meal type with CHO (P = 0.0001) and fiber amount (P = 0.001). The extended model explains 34.9% of the CHO counting error variance. Comparison with the CHO counting error description previously used in the T1D patient decision simulator shows that the proposed models return more credible realizations. Conclusions: The most important predictors of CHO counting errors are CHO and meal type. The mathematical models proposed improve the description of patients' behavior in the T1D patient decision simulator.