Learn R Programming

Rborist (version 0.3-7)

presample: Forest-wide Observation Sampling

Description

Observations sampled for each tree to be trained. In the case of the Random Forest algorithm, this is the bag.

Usage

# S3 method for default
presample(y,
                            nHoldout = 0,
                            samplingWeight = numeric(0),
                            nSamp = 0,
                            nRep = 500,
                            withRepl =  TRUE,
                            verbose = FALSE,
                            nTree = 0,
                            ...)

Value

an object of class Sampler consisting of:

  • yTrain the sampled vector.

  • nSamp the sample sizes drawn.

  • nRep the number of independent samples.

  • nTree synonymous with nRep. Deprecated.

  • samples a packed data structure encoding the observation index and corresponding sample count.

  • hash a hashed digest of the data items.

Arguments

y

A vector to be sampled, typically the response.

nHoldout

Number of observations to omit from sampling. Augmented by unobserved response values.

samplingWeight

Per-observation sampling weights. Default is uniform.

nSamp

Size of sample draw. Default draws y length.

nRep

Number of samples to draw. Replaces deprecated nTree.

withRepl

true iff sampling is with replacement.

verbose

true iff tracing execution.

nTree

Number of samples to draw. Deprecated.

...

not currently used.

References

Tille, Yves. Sampling algorithms. Springer New York, 2006.

Examples

Run this code
  if (FALSE) {
    y <- runif(1000)

    # Samples with replacement, 500 vectors of length 1000:
    ps <- presample(y)

    # Samples, as above, with 63 observations held out:
    ps <- presample(y, nHoldout = 63)

    # Samples without replacement, 250 vectors of length 500:
    ps2 <- presample(y, nTree=250, nSamp=500, withRepl = FALSE)


  }

Run the code above in your browser using DataLab