Example dataset with partially observed covariates.
smdi_data
smdi_data
A data frame with 2,500 rows and 14 columns:
Treatment assignment variable (binary). Indicates initiation of the exposure of interest (1) versus a comparator regimen (0)
Age at baseline in years
Is gender female (0 = no, 1 = yes)
ECOG performance score at baseline (0 versus 1). Shows 30% missingness following an MCAR mechanism.
Smoking status at baseline (0 = non-smoker, 1 = smoker)
Physical activity at baseline (not active versus active)
EGFR mutation status (0 = wild-type, 1 = alteration). Shows 20% missingness following an MAR mechanism.
ALK transolcation mutation status (0 = wild-type, 1 = alteration)
PD-L1 cell staining biomarker in %. Shows 40% missingness following an MNAR(value) mechanism
Tumor histology (0 = nonsquamous, 1 = squamous)
Socio-economic status (multi-categorical: 1-low, 2-middle, 3-high)
COPD comorbidity at baseline
time to censoring event
event indicator at time t; 0 = censored, 1 = deceased