Your browser doesn't support javascript.
loading
Novel ensemble feature selection techniques applied to high-grade gastroenteropancreatic neuroendocrine neoplasms for the prediction of survival.
Jenul, Anna; Stokmo, Henning Langen; Schrunner, Stefan; Hjortland, Geir Olav; Revheim, Mona-Elisabeth; Tomic, Oliver.
Afiliação
  • Jenul A; Department of Data Science, Norwegian University of Life Sciences, Universitetstunet 3, 1433 Ås, Norway. Electronic address: anna.jenul@nmbu.no.
  • Stokmo HL; Department of Nuclear Medicine, Division of Radiology and Nuclear Medicine, Oslo University Hospital, Oslo, Norway; Institute of Clinical Medicine, University of Oslo, Oslo, Norway. Electronic address: h.l.stokmo@studmed.uio.no.
  • Schrunner S; Department of Data Science, Norwegian University of Life Sciences, Universitetstunet 3, 1433 Ås, Norway. Electronic address: stefan.schrunner@nmbu.no.
  • Hjortland GO; Department of Oncology, Oslo University Hospital, Oslo, Norway. Electronic address: goo@ous-hf.no.
  • Revheim ME; Department of Nuclear Medicine, Division of Radiology and Nuclear Medicine, Oslo University Hospital, Oslo, Norway; Institute of Clinical Medicine, University of Oslo, Oslo, Norway; The Intervention Centre, Division of Technology and Innovation, Oslo University Hospital, Oslo, Norway. Electronic add
  • Tomic O; Department of Data Science, Norwegian University of Life Sciences, Universitetstunet 3, 1433 Ås, Norway. Electronic address: oliver.tomic@nmbu.no.
Comput Methods Programs Biomed ; 244: 107934, 2024 Feb.
Article em En | MEDLINE | ID: mdl-38016391
ABSTRACT
BACKGROUND AND

OBJECTIVE:

Determining the most informative features for predicting the overall survival of patients diagnosed with high-grade gastroenteropancreatic neuroendocrine neoplasms is crucial to improve individual treatment plans for patients, as well as the biological understanding of the disease. The main objective of this study is to evaluate the use of modern ensemble feature selection techniques for this purpose with respect to (a) quantitative performance measures such as predictive performance, (b) clinical interpretability, and (c) the effect of integrating prior expert knowledge.

METHODS:

The Repeated Elastic Net Technique for Feature Selection (RENT) and the User-Guided Bayesian Framework for Feature Selection (UBayFS) are recently developed ensemble feature selectors investigated in this work. Both allow the user to identify informative features in datasets with low sample sizes and focus on model interpretability. While RENT is purely data-driven, UBayFS can integrate expert knowledge a priori in the feature selection process. In this work, we compare both feature selectors on a dataset comprising 63 patients and 110 features from multiple sources, including baseline patient characteristics, baseline blood values, tumor histology, imaging, and treatment information.

RESULTS:

Our experiments involve data-driven and expert-driven setups, as well as combinations of both. In a five-fold cross-validated experiment without expert knowledge, our results demonstrate that both feature selectors allow accurate predictions A reduction from 110 to approximately 20 features (around 82%) delivers near-optimal predictive performances with minor variations according to the choice of the feature selector, the predictive model, and the fold. Thereafter, we use findings from clinical literature as a source of expert knowledge. In addition, expert knowledge has a stabilizing effect on the feature set (an increase in stability of approximately 40%), while the impact on predictive performance is limited.

CONCLUSIONS:

The features WHO Performance Status, Albumin, Platelets, Ki-67, Tumor Morphology, Total MTV, Total TLG, and SUVmax are the most stable and predictive features in our study. Overall, this study demonstrated the practical value of feature selection in medical applications not only to improve quantitative performance but also to deliver potentially new insights to experts.
Assuntos
Palavras-chave

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Neoplasias Limite: Humans Idioma: En Ano de publicação: 2024 Tipo de documento: Article

Texto completo: 1 Base de dados: MEDLINE Assunto principal: Neoplasias Limite: Humans Idioma: En Ano de publicação: 2024 Tipo de documento: Article