Your browser doesn't support javascript.
loading
Scalable proximal methods for cause-specific hazard modeling with time-varying coefficients.
Wu, Wenbo; Taylor, Jeremy M G; Brouwer, Andrew F; Luo, Lingfeng; Kang, Jian; Jiang, Hui; He, Kevin.
Afiliação
  • Wu W; Department of Biostatistics, University of Michigan, 1420 Washington Heights, Ann Arbor, MI, 48109-2029, USA.
  • Taylor JMG; Department of Biostatistics, University of Michigan, 1420 Washington Heights, Ann Arbor, MI, 48109-2029, USA.
  • Brouwer AF; Department of Epidemiology, University of Michigan, 1420 Washington Heights, Ann Arbor, MI, 48109-2029, USA.
  • Luo L; Department of Biostatistics, University of Michigan, 1420 Washington Heights, Ann Arbor, MI, 48109-2029, USA.
  • Kang J; Department of Biostatistics, University of Michigan, 1420 Washington Heights, Ann Arbor, MI, 48109-2029, USA.
  • Jiang H; Department of Biostatistics, University of Michigan, 1420 Washington Heights, Ann Arbor, MI, 48109-2029, USA.
  • He K; Department of Biostatistics, University of Michigan, 1420 Washington Heights, Ann Arbor, MI, 48109-2029, USA. kevinhe@umich.edu.
Lifetime Data Anal ; 28(2): 194-218, 2022 04.
Article em En | MEDLINE | ID: mdl-35092553
Survival modeling with time-varying coefficients has proven useful in analyzing time-to-event data with one or more distinct failure types. When studying the cause-specific etiology of breast and prostate cancers using the large-scale data from the Surveillance, Epidemiology, and End Results (SEER) Program, we encountered two major challenges that existing methods for estimating time-varying coefficients cannot tackle. First, these methods, dependent on expanding the original data in a repeated measurement format, result in formidable time and memory consumption as the sample size escalates to over one million. In this case, even a well-configured workstation cannot accommodate their implementations. Second, when the large-scale data under analysis include binary predictors with near-zero variance (e.g., only 0.6% of patients in our SEER prostate cancer data had tumors regional to the lymph nodes), existing methods suffer from numerical instability due to ill-conditioned second-order information. The estimation accuracy deteriorates further with multiple competing risks. To address these issues, we propose a proximal Newton algorithm with a shared-memory parallelization scheme and tests of significance and nonproportionality for the time-varying effects. A simulation study shows that our scalable approach reduces the time and memory costs by orders of magnitude and enjoys improved estimation accuracy compared with alternative approaches. Applications to the SEER cancer data demonstrate the real-world performance of the proximal Newton algorithm.
Assuntos
Palavras-chave

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Neoplasias da Próstata Tipo de estudo: Prognostic_studies Limite: Humans / Male Idioma: En Revista: Lifetime Data Anal Ano de publicação: 2022 Tipo de documento: Article País de afiliação: Estados Unidos

Texto completo: 1 Coleções: 01-internacional Base de dados: MEDLINE Assunto principal: Neoplasias da Próstata Tipo de estudo: Prognostic_studies Limite: Humans / Male Idioma: En Revista: Lifetime Data Anal Ano de publicação: 2022 Tipo de documento: Article País de afiliação: Estados Unidos