Usage
mkSEER(df,seerHome="~/data/SEER",outDir="mrgd",outFile="cancDef",
indices = list(c("sex","race"), c("histo3","seqnum"), "ICD9"),
writePops=TRUE,writeRData=TRUE,writeDB=TRUE)
Arguments
df
A data frame that was the output of pickFields()
. This determines which fields to transfer.
Using the output of getFields()
is a common mistake that must be avoided.
seerHome
The directory that contains the SEER population and incidence directories. This should be writable by the user.
outDir
seerHome subdirectory to write to. Default is mrgd for all registries merged together.
outFile
Base name of the SQLite database and cancer binary. Default = CancDef (Cancer Default).
indices
Passed to copy_to()
in dplyr. writePops
TRUE if you wish to write out the population data frame binaries. Doing so takes ~10 seconds, so
savings of FALSE are small.
writeRData
TRUE if you wish to write out the cancer data frame binary. Writing files takes most of the time.
writeDB
TRUE if you wish to write cancer, popga, popsa, and popsae data frames to SQLite database tables.