predict.randomForest: predict method for random forest objects

Description

Prediction of test data using random forest.

Usage

"predict"(object, newdata, type="response", norm.votes=TRUE, predict.all=FALSE, proximity=FALSE, nodes=FALSE, cutoff, ...)

Arguments

object

an object of class randomForest, as that created by the function randomForest.

newdata

a data frame or matrix containing new data. (Note: If not given, the out-of-bag prediction in object is returned.

type

one of response, prob. or votes, indicating the type of output: predicted values, matrix of class probabilities, or matrix of vote counts. class is allowed, but automatically converted to "response", for backward compatibility.

norm.votes

Should the vote counts be normalized (i.e., expressed as fractions)? Ignored if object$type is regression.

predict.all

Should the predictions of all trees be kept?

proximity

Should proximity measures be computed? An error is issued if object$type is regression.

nodes

Should the terminal node indicators (an n by ntree matrix) be return? If so, it is in the ``nodes'' attribute of the returned object.

cutoff

(Classification only) A vector of length equal to number of classes. The `winning' class for an observation is the one with the maximum ratio of proportion of votes to cutoff. Default is taken from the forest$cutoff component of object (i.e., the setting used when running randomForest).

...

not used currently.

Value

response: predicted classes (the classes with majority vote).
prob: matrix of class probabilities (one column for each class and one row for each input).
vote: matrix of vote counts (one column for each class and one row for each new input); either in raw counts or in fractions (if norm.votes=TRUE).

References

Breiman, L. (2001), Random Forests, Machine Learning 45(1), 5-32.

Examples

Run this code

data(iris)
set.seed(111)
ind <- sample(2, nrow(iris), replace = TRUE, prob=c(0.8, 0.2))
iris.rf <- randomForest(Species ~ ., data=iris[ind == 1,])
iris.pred <- predict(iris.rf, iris[ind == 2,])
table(observed = iris[ind==2, "Species"], predicted = iris.pred)
## Get prediction for all trees.
predict(iris.rf, iris[ind == 2,], predict.all=TRUE)
## Proximities.
predict(iris.rf, iris[ind == 2,], proximity=TRUE)
## Nodes matrix.
str(attr(predict(iris.rf, iris[ind == 2,], nodes=TRUE), "nodes"))