Scalable proximal methods for cause-specific hazard modeling with time-varying coefficients
Wenbo Wu,
Jeremy M. G. Taylor,
Andrew F. Brouwer,
Lingfeng Luo,
Jian Kang,
Hui Jiang and
Kevin He ()
Additional contact information
Wenbo Wu: University of Michigan
Jeremy M. G. Taylor: University of Michigan
Andrew F. Brouwer: University of Michigan
Lingfeng Luo: University of Michigan
Jian Kang: University of Michigan
Hui Jiang: University of Michigan
Kevin He: University of Michigan
Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data, 2022, vol. 28, issue 2, No 2, 194-218
Abstract:
Abstract Survival modeling with time-varying coefficients has proven useful in analyzing time-to-event data with one or more distinct failure types. When studying the cause-specific etiology of breast and prostate cancers using the large-scale data from the Surveillance, Epidemiology, and End Results (SEER) Program, we encountered two major challenges that existing methods for estimating time-varying coefficients cannot tackle. First, these methods, dependent on expanding the original data in a repeated measurement format, result in formidable time and memory consumption as the sample size escalates to over one million. In this case, even a well-configured workstation cannot accommodate their implementations. Second, when the large-scale data under analysis include binary predictors with near-zero variance (e.g., only 0.6% of patients in our SEER prostate cancer data had tumors regional to the lymph nodes), existing methods suffer from numerical instability due to ill-conditioned second-order information. The estimation accuracy deteriorates further with multiple competing risks. To address these issues, we propose a proximal Newton algorithm with a shared-memory parallelization scheme and tests of significance and nonproportionality for the time-varying effects. A simulation study shows that our scalable approach reduces the time and memory costs by orders of magnitude and enjoys improved estimation accuracy compared with alternative approaches. Applications to the SEER cancer data demonstrate the real-world performance of the proximal Newton algorithm.
Keywords: Kronecker product; B-spline; Proximal algorithm; Parallel computing; Breast cancer; Prostate cancer (search for similar items in EconPapers)
Date: 2022
References: View references in EconPapers View complete reference list from CitEc
Citations:
Downloads: (external link)
http://link.springer.com/10.1007/s10985-021-09544-2 Abstract (text/html)
Access to the full text of the articles in this series is restricted.
Related works:
This item may be available elsewhere in EconPapers: Search for items with the same title.
Export reference: BibTeX
RIS (EndNote, ProCite, RefMan)
HTML/Text
Persistent link: https://EconPapers.repec.org/RePEc:spr:lifeda:v:28:y:2022:i:2:d:10.1007_s10985-021-09544-2
Ordering information: This journal article can be ordered from
http://www.springer.com/journal/10985
DOI: 10.1007/s10985-021-09544-2
Access Statistics for this article
Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data is currently edited by Mei-Ling Ting Lee
More articles in Lifetime Data Analysis: An International Journal Devoted to Statistical Methods and Applications for Time-to-Event Data from Springer
Bibliographic data for series maintained by Sonal Shukla () and Springer Nature Abstracting and Indexing ().