RESUMEN
Sedentary behavior is considered as a major public health challenge, linked with many chronic diseases and premature mortality. In this paper, we propose a steps counting -based machine learning approach for the prediction of sedentary behavior. Our work focuses on analyzing historical data from multiple users of wearable physical activity trackers and exploring the performance of four machine learning algorithms, i.e., Logistic Regression, Random Forest, XGBoost, Convolutional Neural Networks, as well as a Majority Vote Ensemble of the algorithms. To train and test our models we employed a crowd sourced dataset containing a month's data of 33 users. For further evaluation, we employed a dataset containing 6 months of data of an additional user. The results revealed that while all models succeed in predicting next-day sedentary behavior, the ensemble model outperforms all baselines, as it manages to predict sedentary behavior and reduce false positives more effectively. On the multi-subjects test dataset, our ensemble model achieved an accuracy of 82.12% with a sensitivity of 74.53% and a specificity of 85.71%. On the additional unseen dataset, we achieved 76.88% in accuracy, 63.27% in sensitivity and 81.75% in specificity. These outcomes provide the ground towards the development of real-life artificially intelligent systems for sedentary behavior prediction.
Asunto(s)
Aprendizaje Automático , Conducta Sedentaria , Algoritmos , Humanos , Modelos Logísticos , Redes Neurales de la ComputaciónRESUMEN
Depressive disorder (DD) is a mental illness affecting more than 300 million people worldwide, whereas social stigma and subtle, variant symptoms impede diagnosis. Psychomotor retardation is a common component of DD with a negative impact on motor function, usually reflected on patients' routine activities, including, nowadays, their interaction with mobile devices. Therefore, such interactions constitute an enticing source of information towards unsupervised screening for DD symptoms in daily life. In this vein, this paper proposes a machine learning-based method for discriminating between subjects with depressive tendency and healthy controls, as denoted by self-reported Patient Health Questionnaire-9 (PHQ-9) compound scores, based on typing patterns captured in-the-wild. The latter consisted of keystroke timing sequences and typing metadata, passively collected during natural typing on touchscreen smartphones by 11/14 subjects with/without depressive tendency. Statistical features were extracted and tested in univariate and multivariate classification pipelines to reach a decision on subjects' status. The best-performing pipeline achieved an AUC = 0.89 (0.72-1.00; 95% Confidence Interval) and 0.82/0.86 sensitivity/specificity, with the outputted probabilities significantly correlating (>0.60) with the respective PHQ-9 scores. This work adds to the findings of previous research associating typing patterns with psycho-motor impairment and contributes to the development of an unobtrusive, high-frequency monitoring of depressive tendency in everyday living.