Learn R Programming

LTRCforests (version 0.5.5)

LTRCforests-package: Constructs forest methods for left-truncated and right-censored (LTRC) survival data

Description

Constructs a LTRC conditional inference forest (LTRCCIF) or a LTRC relative risk forest (LTRCRRF) for left-truncated and right-censored data, it also allows for (left-truncated) right-censored survival data with time-varying covariates. The main functions of this package are ltrccif and ltrcrrf.

Arguments

Details

Problem setup and existing methods

Continuous-time survival data with time-varying covariates are common in practice. Methods like the Cox proportional hazards model rely on restrictive assumptions such as proportional hazards and a log-linear relationship between the hazard function and covariates. Furthermore, because these methods are often parametric, nonlinear effects of variables must be modeled by transformations or expanding the design matrix to include specialized basis functions for more complex data structures in real world applications. The functions LTRCIT and LTRCART provide a conditional inference tree method and a relative risk tree method for left-truncated right-censored survival data, which also allows for right-censored survival data with time-varying covariates. Tree estimators are nonparametric and as such often exhibit low bias and high variance. Ensemble methods like bagging and random forest can reduce variance while preserving low bias. The most popular survival forest methods, including conditional inference forest (see cforest), relative risk forest, and random survival forest method (see rfsrc) can only be applied to right-censored survival data with time-invariant covariates.

LTRC forests

This package implements ltrccif and ltrcrrf. ltrccif extends the conditional inference forest (see cforest) to LTRC survival data. It uses LTRC conditional inference survival trees as base learners. ltrcrrf extends the relative risk forest (Ishwaran et al. 2004) to left-truncated right-censored survival data. It uses LTRC risk relative tree as base learners. The main functions ltrccif and ltrcrrf fit a corresponding LTRC forest for LTRC data, with parameter mtry tuned by tune.ltrccif or tune.ltrcrrf. This tuning procedure relies on the evaluation of the out-of-bag errors, which is performed by the function sbrier_ltrc. print prints summary output for ltrccif objects and ltrcrrf objects. predictProb constructs survival function estimates for ltrccif objects and ltrcrrf objects.

For (left-truncated) right-censored survival data with time-varying covariates, one can first reformat the data structure to one with LTRC observations, where the multiple records of a subject become a list of pseudo-subjects and are treated independently. This procedure is usually referred to as the Andersen-Gill method (Andersen and Gill, 1982). Then LTRC forest methods can be applied on this reformatted dataset.

Overall, the methods in this package can handle all combinations of left truncation, right censoring, time-invariant covariates, and time-varying covariates. If one is in the traditional case with right-censored data and time-invariant covariates, however, then it is recommended to use the functions cforest and rfsrc directly to construct conditional inference forests and random survival forests, respectively.

References

Andersen, P. and Gill, R. (1982). Cox<U+2019>s regression model for counting processes, a large sample study. Annals of Statistics, 10:1100-1120.

H. Ishwaran, E. H. Blackstone, C. Pothier, and M. S. Lauer. (2004). Relative risk forests for exercise heart rate recovery as a predictor of mortality. Journal of the American StatisticalAssociation, 99(1):591<U+2013>600.

Fu, W. and Simonoff, J.S. (2016). Survival trees for left-truncated and right-censored data, with application to time-varying covariate data. Biostatistics, 18(2):352<U+2013>369.

See Also

ltrccif, ltrcrrf, predictProb, print, tune.ltrccif, tune.ltrcrrf, sbrier_ltrc