Your browser doesn't support javascript.
loading
OSCAR: Optimal subset cardinality regression using the L0-pseudonorm with applications to prognostic modelling of prostate cancer.
Halkola, Anni S; Joki, Kaisa; Mirtti, Tuomas; Mäkelä, Marko M; Aittokallio, Tero; Laajala, Teemu D.
Affiliation
  • Halkola AS; Department of Mathematics and Statistics, University of Turku, Turku, Finland.
  • Joki K; Department of Mathematics and Statistics, University of Turku, Turku, Finland.
  • Mirtti T; Research Program in Systems Oncology, Faculty of Medicine, University of Helsinki, Helsinki, Finland.
  • Mäkelä MM; Department of Pathology, Diagnostic Center, Helsinki University Hospital, Helsinki, Finland.
  • Aittokallio T; Department of Biomedical Engineering, School of Medicine, Emory University, Atlanta, Georgia, United States of America.
  • Laajala TD; Department of Mathematics and Statistics, University of Turku, Turku, Finland.
PLoS Comput Biol ; 19(3): e1010333, 2023 03.
Article in En | MEDLINE | ID: mdl-36897911
ABSTRACT
In many real-world applications, such as those based on electronic health records, prognostic prediction of patient survival is based on heterogeneous sets of clinical laboratory measurements. To address the trade-off between the predictive accuracy of a prognostic model and the costs related to its clinical implementation, we propose an optimized L0-pseudonorm approach to learn sparse solutions in multivariable regression. The model sparsity is maintained by restricting the number of nonzero coefficients in the model with a cardinality constraint, which makes the optimization problem NP-hard. In addition, we generalize the cardinality constraint for grouped feature selection, which makes it possible to identify key sets of predictors that may be measured together in a kit in clinical practice. We demonstrate the operation of our cardinality constraint-based feature subset selection method, named OSCAR, in the context of prognostic prediction of prostate cancer patients, where it enables one to determine the key explanatory predictors at different levels of model sparsity. We further explore how the model sparsity affects the model accuracy and implementation cost. Lastly, we demonstrate generalization of the presented methodology to high-dimensional transcriptomics data.
Subject(s)

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Prostatic Neoplasms / Algorithms Type of study: Prognostic_studies Limits: Humans / Male Language: En Journal: PLoS Comput Biol Journal subject: BIOLOGIA / INFORMATICA MEDICA Year: 2023 Document type: Article Affiliation country: Finland Publication country: EEUU / ESTADOS UNIDOS / ESTADOS UNIDOS DA AMERICA / EUA / UNITED STATES / UNITED STATES OF AMERICA / US / USA

Full text: 1 Collection: 01-internacional Database: MEDLINE Main subject: Prostatic Neoplasms / Algorithms Type of study: Prognostic_studies Limits: Humans / Male Language: En Journal: PLoS Comput Biol Journal subject: BIOLOGIA / INFORMATICA MEDICA Year: 2023 Document type: Article Affiliation country: Finland Publication country: EEUU / ESTADOS UNIDOS / ESTADOS UNIDOS DA AMERICA / EUA / UNITED STATES / UNITED STATES OF AMERICA / US / USA