Learn R Programming

SEER2R (version 1.0)

read.SeerStat: read from SEER*STAT export files

Description

SEER*STAT presents results in matrix session and users can export results into data file (either in plain text format or gz format) and dictionary information into dic file. read.SeerStat reads data and dictionary information from SEER*STAT export files.

Usage

read.SeerStat(DICfileName, TXTfileName = NULL, UseVarLabelsInData = FALSE,ReadHeaderOnly=FALSE,...)

Arguments

DICfileName
: filename of the dic file. The default extention is 'dic'. If 'DICfileName' does not contain at the end a string '.dic' (letter case does not matter), then '.dic' will be added.
TXTfileName
: filename of the associated data file. If TXTfileName = NULL, then a string of DICfileName with extention substituted by 'txt' (for uncompressed data file) or 'gz' (for gzip compressed data file) will be used (whether 'txt' or 'gz' depends on informa
UseVarLabelsInData
: a logic value. If true, then variable labels read from the dic file will replace associated numeric values in the data.frame object, which stores data from the associated data file and is returned by this function read.SeerStat. If false, then dat
ReadHeaderOnly
: a logic value. If true, then only the list storing the information read from the dic file will be returned. Otherwise, A data frame object containing a representation of the data in the associated data file will be returned.
...
: Arguments to be passed to read.table for reading from the associated data file.

Value

    Details

    ``read.SeerStat'' reads data from a SEER*Stat data file into an object of data.frame in R and stores information from the associated dictionary file in an attribute variable (named ``DICInfo'') of the data.frame object. The variables of the SEER*Stat data file are stored in columns of the data.frame object. The column names of the data.frame object are based on the variable names in the associated SEER*Stat dic file, with special characters ``,:()<>={}!@#$to a single `_'. For example, the column name will be ``Example_Variable_1'' if the variable name in the SEER*Stat dic file is ``Example* (Variable 1)''.

    References

    Jun Luo and Binbing Yu, 'SEER2R: An interface between SEER cancer registry data and R'

    See Also

    write.SeerStat, SEER2R

    Examples

    Run this code
    #load testing data: there are three data objects, i.e. SEER2RTestData1,SEER2RTestData2,SEER2RTestData3 
    data("SEER2RTestdata");
    
    #create one SEER*STAT export dic and the associated text data file for testing purpose
    dicinfoused = write.SeerStat(SEER2RTestData2, DICfileName = "testrun1.dic",UseVarLabelsInTxtFile=FALSE);
    
    #usage of read.SeerStat
    mydata = read.SeerStat("testrun1.dic",UseVarLabelsInData=FALSE);
    #get informatin inside the dic file
    DICInfo = attr(mydata, "DICInfo");
    
    #change names of columns whose names contains strings "site" or "sex"; 
    #the order of strings does not matter
    testdatanewnames = attr(mydata,"assignColNames")(mydata,c("sex","site"));
    
    #extract columns whose names contains strings "site" or "sex";
    testdata = attr(mydata,"getSubDataByVarName")(mydata,c("site","sex"));
    
    #usage of write.SeerStat
    dicinfoused = write.SeerStat(mydata, DICfileName = "testrun2.dic", UseVarLabelsInTxtFile = FALSE);

    Run the code above in your browser using DataLab