This dataset is a transformation of the example dataset used by Crowther and Lambert in their multistate paper.
data(BrCa)
A data frame with 2982 observations on the following 17 variables:
pid
Person-id; numeric
year
Calendar year of diagnosis
age
Age at diagnosis
meno
Menopausal status; a factor with levels pre
post
size
Tumour size; a factor with levels <=20 mm
>20-50 mm
>50 mm
grade
Tumour grade; a factor with levels 2
3
nodes
Number of positive lymph nodes, a numeric vector
pr
Progesteron receptor level
pr.tr
Transformed progesteron level
er
Estrogen receptor level
hormon
Hormon therapy at diagnosis; a factor with levels no
yes
chemo
Chemotherapy treatment; a factor with levels no
yes
tor
Time of relapse, years since diagnosis
tom
Time of metastasis, years since diagnosis
tod
Time of death, years since diagnosis
tox
Time of exit from study, years since diagnosis
xst
Vital status at exit; a factor with levels Alive
Dead
The dataset has been modified to contain the times (since diagnosis) of the events of interest, to comply with the usual structure of data.
The data were used as example in the paper by Crowther and Lambert: Parametric multistate survival models: Flexible modelling allowing transition-specific distributions with application to estimating clinically useful measures of effect differences; Stat Med 36 (29), pp 4719-4742, 2017. (No, it is not the paper, just the title.)
A parallel analysis using the Lexis
machinery is available
as: http://bendixcarstensen.com/AdvCoh/papers/bcMS.pdf