Learn R Programming

stagedtrees (version 2.1.0)

generate_linear_dataset: Generate a random binary dataset for classification

Description

Randomly generate a simple classification problem.

Usage

generate_linear_dataset(
  n = 2,
  N = 10000,
  eps = 1.2,
  gamma = runif(1, min = -n, max = n),
  alpha = runif(n, min = -n, max = n)
)

Arguments

n

number of variables.

N

number of observations.

eps

noise.

gamma

numeric.

alpha

numeric vector of length n.

Value

A data.frame with n independent random variables and one class variable C computed as sign(sum(x * alpha) + runif(1, -eps, eps) + gamma).

Examples

Run this code
# NOT RUN {
DD <- generate_linear_dataset(n = 5, 1000)
# }

Run the code above in your browser using DataLab