Simulated data from a case-control study containing demographic variables, marker values and cancer outcome. The prevalence of cancer in the population that this data is sampled from is approximately 0.11.
dcaData_cc
A data frame with 500 rows and 6 variables:
Age: Age in years.
Female: Indicator for female gender.
Smokes: Indicator for smoking status.
Marker1: simulated biomarker.
Marker2: simulated biomarker.
Cancer: Indicator for cancer.