Your browser doesn't support javascript.
loading
Predict DLBCL patients' recurrence within two years with Gaussian mixture model cluster oversampling and multi-kernel learning.
Xing, Meng; Zhang, Yanbo; Yu, Hongmei; Yang, Zhenhuan; Li, Xueling; Li, Qiong; Zhao, Yanlin; Zhao, Zhiqiang; Luo, Yanhong.
Afiliação
  • Xing M; Department of Health Statistics, School of Public Health, Shanxi Medical University, Taiyuan, China; Shanxi Provincial Key Laboratory of Major Diseases Risk Assessment, Taiyuan, China.
  • Zhang Y; Department of Health Statistics, School of Public Health, Shanxi Medical University, Taiyuan, China; Shanxi Provincial Key Laboratory of Major Diseases Risk Assessment, Taiyuan, China.
  • Yu H; Department of Health Statistics, School of Public Health, Shanxi Medical University, Taiyuan, China; Shanxi Provincial Key Laboratory of Major Diseases Risk Assessment, Taiyuan, China.
  • Yang Z; Department of Health Statistics, School of Public Health, Shanxi Medical University, Taiyuan, China; Shanxi Provincial Key Laboratory of Major Diseases Risk Assessment, Taiyuan, China.
  • Li X; Department of Health Statistics, School of Public Health, Shanxi Medical University, Taiyuan, China; Shanxi Provincial Key Laboratory of Major Diseases Risk Assessment, Taiyuan, China.
  • Li Q; Department of Health Statistics, School of Public Health, Shanxi Medical University, Taiyuan, China; Shanxi Provincial Key Laboratory of Major Diseases Risk Assessment, Taiyuan, China.
  • Zhao Y; Department of Health Statistics, School of Public Health, Shanxi Medical University, Taiyuan, China; Shanxi Provincial Key Laboratory of Major Diseases Risk Assessment, Taiyuan, China.
  • Zhao Z; Department of Hematology, Shanxi Cancer Hospital, Taiyuan, China. Electronic address: zqzhao69@163.com.
  • Luo Y; Department of Health Statistics, School of Public Health, Shanxi Medical University, Taiyuan, China; Shanxi Provincial Key Laboratory of Major Diseases Risk Assessment, Taiyuan, China. Electronic address: sxmulyh@163.com.
Comput Methods Programs Biomed ; 226: 107103, 2022 Nov.
Article em En | MEDLINE | ID: mdl-36088813
BACKGROUND AND OBJECTIVE: Diffuse large B-cell lymphoma (DLBCL) is common in adults' non-Hodgkin's lymphoma. Relapse mainly occurs within two years after diagnosis and has a poor prognosis. Relapse after two years is less frequent and has a better prognosis. In this work, we constructed a relapse prediction model for diffuse large B-cell lymphoma patients within two years, expecting to provide a reference for Clinicians to implement individualized treatment. METHOD: We propose a secondary-level class imbalance method based on Gaussian mixture model (GMM) clustering resampling to balance the data. Then use a multi-kernel support vector machine(SVM) to inscribe heterogeneous clinical data. Finally, merging them to identify recurrence patients within two years. RESULTS: Among all the class imbalance methods in this work, Inverse Weighted -GMM +SMOTEENN has the best performance. Compared with NO-GMM (Directl use the SMOTEENN without the GMM clustering process), its Area Under the ROC Curve(AUC) increases by 8.75%, and ECE and brier scores decrease 2.07% and 3.09%, respectively. Among the four classification algorithms in this work, Multiple kernel learning (MKL) has the most minimized brier scores and expected calibration error(ECE), the largest AUC, accuracy, Recall, precision and F1, has the best discrimination and calibration. CONCLUSION: Our inverse weighted -GMM+SMOTEENN+MKL (GMM-SENN-MKL) method can handle data class imbalance and clinical heterogeneity data well and can be used to predict recurrence in DLBCL patients.
Assuntos
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Temas: Geral / Tipos_de_cancer / Outros_tipos Base de dados: MEDLINE Assunto principal: Linfoma Difuso de Grandes Células B / Recidiva Local de Neoplasia Tipo de estudo: Prognostic_studies / Risk_factors_studies Limite: Adult / Humans Idioma: En Revista: Comput Methods Programs Biomed Assunto da revista: INFORMATICA MEDICA Ano de publicação: 2022 Tipo de documento: Article País de afiliação: China

Texto completo: 1 Coleções: 01-internacional Temas: Geral / Tipos_de_cancer / Outros_tipos Base de dados: MEDLINE Assunto principal: Linfoma Difuso de Grandes Células B / Recidiva Local de Neoplasia Tipo de estudo: Prognostic_studies / Risk_factors_studies Limite: Adult / Humans Idioma: En Revista: Comput Methods Programs Biomed Assunto da revista: INFORMATICA MEDICA Ano de publicação: 2022 Tipo de documento: Article País de afiliação: China