pickFields(sas,picks=c("casenum","reg","race","sex","agedx", "yrbrth","seqnum","modx","yrdx","histo3","radiatn", "ICD9","COD","surv"))
getFields()
using the SAS file found in
the incidence directory of seerHome
, the root of the SEER ASCII data installation.sas
. Downstream, mkSEER will not work without a minimum of
"reg","race","sex","agedx","histo3","radiatn","ICD9"
, but to estimate
survival and second cancer risks the default is the minimum.
sas
, shortened to just the rows of picks
, and expanded to include
spacer rows of fields of no interest pooled into single strings: the width of such a spacer row is equal to
the distance in bytes between the fields of interest above and below it. This data frame is then
used by laf_open_fwf()
of LaF in mkSEER()
to read the SEER files. Proper use of this function, and of the SEER data in general,
requires an understanding of the contents of seerdic.pdf in the incidence directory of seerHome
.
SEERaBomb
is faster than SEER*Stat
because it tailors/streamlines the database to your interests. The default picks are a reasonable place to start; if you
determine later that you need more fields, you can always rebuild the binaries. Grabbing all fields is
discouraged, but if you want this anyway, note that you still need pickFields to create a data type column, i.e. you cannot bypass pickFields by sending the output of getFields straight to mkSEER.SEERaBomb-package, getFields, pickFields, mkSEER
## Not run:
# library(SEERaBomb)
# (df=getFields())
# (df=pickFields(df))
#
# ## End(Not run)
Run the code above in your browser using DataLab