Learn R Programming

Cyclops (version 3.4.0)

readCyclopsData: Read Cyclops data from file

Description

readCyclopsData reads a Cyclops-formatted text file.

Usage

readCyclopsData(fileName, modelType)

Value

A list that contains a Cyclops model data object pointer and an operation duration

Arguments

fileName

Name of text file to be read. If fileName does not contain an absolute path,

modelType

character string: Valid types are listed below.

Models

Currently supported model types are:

"ls"Least squares
"pr"Poisson regression
"lr"Logistic regression
"clr"Conditional logistic regression
"cpr"Conditional Poisson regression
"sccs"Self-controlled case series
"cox"Cox proportional hazards regression
"fgr"Fine-Gray proportional subdistribution hazards regression

Details

This function reads a Cyclops-formatted text file and returns a Cyclops data object. The first line of the file may start with '#', indicating that it contains header options. Valid header options are:

row_label(assume file contains a numeric column of unique row identifiers)
stratum_label(assume file contains a numeric column of stratum identifiers)
weight(assume file contains a column of row-specific model weights, currently unused)
offset(assume file contains a dense column of linear predictor offsets)
bbr_outcome(assume logistic outcomes are encoded -1/+1 following BBR)
log_offset(assume file contains a dense column of values x_i for which log(x_i) is the offset)
add_intercept(automatically include an intercept column of all 1s for each entry)
indicator_only(assume all covariates 0/1-valued and only covariate name is given)
sparse(force all BBR formatted covariates to be represented as sparse, instead of
sparse-indicator, columns .. really only for debugging)
dense(force all BBR formatted covariates to be represented as dense columns.. really
only for debugging)

Successive lines of the file are white-space delimited and follow the format:

	[Row ID] {Stratum ID} [Weight] <Outcome> {Censored} {Offset} <BBR covariates>

  • [optional]

  • <required>

  • {required or optional depending on model}

Bayesian binary regression (BBR) covariates are white-space delimited and generally in a sparse <name>:<value> format, where name must (currently) be numeric and value is non-zero. If option indicator_only is specified, then format is simply <name>. Row ID and Stratum ID must be numeric, and rows must be sorted such that equal Stratum ID are consecutive. Stratum ID is required for clr and sccs models. Censored is required for a cox model. Offset is (currently) required for a sccs model.

Examples

Run this code
if (FALSE) {
dataPtr = readCyclopsData(system.file("extdata/infert_ccd.txt", package="Cyclops"), "clr")
}

Run the code above in your browser using DataLab