Your browser doesn't support javascript.
loading
Controlled variable selection in Weibull mixture cure models for high-dimensional data.
Fu, Han; Nicolet, Deedra; Mrózek, Krzysztof; Stone, Richard M; Eisfeld, Ann-Kathrin; Byrd, John C; Archer, Kellie J.
  • Fu H; Division of Biostatistics, College of Public Health, The Ohio State University, Columbus, Ohio, USA.
  • Nicolet D; Clara D. Bloomfield Center for Leukemia Outcomes Research, The Ohio State University Comprehensive Cancer Center, Columbus, Ohio, USA.
  • Mrózek K; Alliance Statistics and Data Management Center, The Ohio State University Comprehensive Cancer Center, Columbus, Ohio, USA.
  • Stone RM; Clara D. Bloomfield Center for Leukemia Outcomes Research, The Ohio State University Comprehensive Cancer Center, Columbus, Ohio, USA.
  • Eisfeld AK; Dana-Farber/Partners Cancer, Harvard University, Boston, Massachusetts, USA.
  • Byrd JC; Clara D. Bloomfield Center for Leukemia Outcomes Research, The Ohio State University Comprehensive Cancer Center, Columbus, Ohio, USA.
  • Archer KJ; Department of Internal Medicine, University of Cincinnati, Cincinnati, Ohio, USA.
Stat Med ; 41(22): 4340-4366, 2022 09 30.
Article en En | MEDLINE | ID: mdl-35792553
ABSTRACT
Medical breakthroughs in recent years have led to cures for many diseases. The mixture cure model (MCM) is a type of survival model that is often used when a cured fraction exists. Many have sought to identify genomic features associated with a time-to-event outcome which requires variable selection strategies for high-dimensional spaces. Unfortunately, currently few variable selection methods exist for MCMs especially when there are more predictors than samples. This study develops high-dimensional penalized Weibull MCMs, which allow for identification of prognostic factors associated with both cure status and/or survival. We demonstrated how such models may be estimated using two different iterative algorithms. The model-X knockoffs method was combined with these algorithms to control the false discovery rate (FDR) in variable selection. Through extensive simulation studies, our penalized MCMs have been shown to outperform alternative methods on multiple metrics and achieve high statistical power with FDR being controlled. In an acute myeloid leukemia (AML) application with gene expression data, our proposed approach identified 14 genes associated with potential cure and 12 genes with time-to-relapse, which may help inform treatment decisions for AML patients.
Asunto(s)
Palabras clave

Texto completo: 1 Banco de datos: MEDLINE Asunto principal: Proyectos de Investigación / Algoritmos Tipo de estudio: Prognostic_studies / Risk_factors_studies Límite: Humans Idioma: En Año: 2022 Tipo del documento: Article

Texto completo: 1 Banco de datos: MEDLINE Asunto principal: Proyectos de Investigación / Algoritmos Tipo de estudio: Prognostic_studies / Risk_factors_studies Límite: Humans Idioma: En Año: 2022 Tipo del documento: Article