Predict DLBCL patients' recurrence within two years with Gaussian mixture model cluster oversampling and multi-kernel learning.
Comput Methods Programs Biomed
; 226: 107103, 2022 Nov.
Article
em En
| MEDLINE
| ID: mdl-36088813
BACKGROUND AND OBJECTIVE: Diffuse large B-cell lymphoma (DLBCL) is common in adults' non-Hodgkin's lymphoma. Relapse mainly occurs within two years after diagnosis and has a poor prognosis. Relapse after two years is less frequent and has a better prognosis. In this work, we constructed a relapse prediction model for diffuse large B-cell lymphoma patients within two years, expecting to provide a reference for Clinicians to implement individualized treatment. METHOD: We propose a secondary-level class imbalance method based on Gaussian mixture model (GMM) clustering resampling to balance the data. Then use a multi-kernel support vector machine(SVM) to inscribe heterogeneous clinical data. Finally, merging them to identify recurrence patients within two years. RESULTS: Among all the class imbalance methods in this work, Inverse Weighted -GMM +SMOTEENN has the best performance. Compared with NO-GMM (Directl use the SMOTEENN without the GMM clustering process), its Area Under the ROC Curve(AUC) increases by 8.75%, and ECE and brier scores decrease 2.07% and 3.09%, respectively. Among the four classification algorithms in this work, Multiple kernel learning (MKL) has the most minimized brier scores and expected calibration error(ECE), the largest AUC, accuracy, Recall, precision and F1, has the best discrimination and calibration. CONCLUSION: Our inverse weighted -GMM+SMOTEENN+MKL (GMM-SENN-MKL) method can handle data class imbalance and clinical heterogeneity data well and can be used to predict recurrence in DLBCL patients.
Palavras-chave
Texto completo:
1
Coleções:
01-internacional
Temas:
Geral
/
Tipos_de_cancer
/
Outros_tipos
Base de dados:
MEDLINE
Assunto principal:
Linfoma Difuso de Grandes Células B
/
Recidiva Local de Neoplasia
Tipo de estudo:
Prognostic_studies
/
Risk_factors_studies
Limite:
Adult
/
Humans
Idioma:
En
Revista:
Comput Methods Programs Biomed
Assunto da revista:
INFORMATICA MEDICA
Ano de publicação:
2022
Tipo de documento:
Article
País de afiliação:
China