A Fast and Scalable Implementation Method for Competing Risks Data with the R Package fastcmprsk

Advancements in medical informatics tools and high-throughput biological experimentation make large-scale biomedical data routinely accessible to researchers. Competing risks data are typical in biomedical studies where individuals are at risk to more than one cause (type of event) which can preclude the others from happening. The (Fine and Gray 1999) proportional subdistribution hazards model is a popular and well-appreciated model for competing risks data and is currently implemented in a number of statistical software packages. However, current implementations are not computationally scalable for large-scale competing risks data. We have developed an R package, fastcmprsk, that uses a novel forward-backward scan algorithm to significantly reduce the computational complexity for parameter estimation by exploiting the structure of the subject-specific risk sets. Numerical studies compare the speed and scalability of our implementation to current methods for unpenalized and penalized Fine-Gray regression and show impressive gains in computational efficiency.

Eric S. Kawaguchi (University of Southern California) , Jenny I. Shen (The Lundquist Institute at Harbor-UCLA Medical Center) , Gang Li (University of California, Los Angeles) , Marc A. Suchard (University of California, Los Angeles)
2021-01-14

Supplementary materials

Supplementary materials are available in addition to this article. It can be downloaded at RJ-2021-010.zip

J. P. Fine and R. J. Gray. A proportional hazards model for the subdistribution of a competing risk. Journal of the American Statistical Association, 94(446): 496–509, 1999. DOI 10.1080/01621459.1999.10474144.

References

Reuse

Text and figures are licensed under Creative Commons Attribution CC BY 4.0. The figures that have been reused from other sources don't fall under this license and can be recognized by a note in their caption: "Figure from ...".

Citation

For attribution, please cite this work as

Kawaguchi, et al., "A Fast and Scalable Implementation Method for Competing Risks Data with the R Package fastcmprsk", The R Journal, 2021

BibTeX citation

@article{RJ-2021-010,
  author = {Kawaguchi, Eric S. and Shen, Jenny I. and Li, Gang and Suchard, Marc A.},
  title = {A Fast and Scalable Implementation Method for Competing Risks Data with the R Package fastcmprsk},
  journal = {The R Journal},
  year = {2021},
  note = {https://doi.org/10.32614/RJ-2021-010},
  doi = {10.32614/RJ-2021-010},
  volume = {12},
  issue = {2},
  issn = {2073-4859},
  pages = {43-60}
}