RESUMO
BACKGROUND: Given that major depressive disorder (MDD) is both biologically and clinically heterogeneous, a diagnostic system integrating neurobiological markers and clinical characteristics would allow for better diagnostic accuracy and, consequently, treatment efficacy. OBJECTIVE: Our study aimed to evaluate the discriminative and predictive ability of unimodal, bimodal, and multimodal approaches in a total of seven machine learning (ML) models-clinical, demographic, functional near-infrared spectroscopy (fNIRS), combinations of two unimodal models, as well as a combination of all three-for MDD. METHODS: We recruited 65 adults with MDD and 68 matched healthy controls, who provided both sociodemographic and clinical information, and completed the HAM-D questionnaire. They were also subject to fNIRS measurement when participating in the verbal fluency task. Using the nested cross validation procedure, the classification performance of each ML model was evaluated based on the area under the receiver operating characteristic curve (ROC), balanced accuracy, sensitivity, and specificity. RESULTS: The multimodal ML model was able to distinguish between depressed patients and healthy controls with the highest balanced accuracy of 87.98 ± 8.84% (AUC = 0.92; 95% CI (0.84-0.99) when compared with the uni- and bi-modal models. CONCLUSIONS: Our multimodal ML model demonstrated the highest diagnostic accuracy for MDD. This reinforces the biological and clinical heterogeneity of MDD and highlights the potential of this model to improve MDD diagnosis rates. Furthermore, this model is cost-effective and clinically applicable enough to be established as a robust diagnostic system for MDD based on patients' biosignatures.