Learn R Programming

CORElearn (version 1.57.3)

rfProximity: A random forest based proximity function

Description

Random forest computes similarity between instances with classification of out-of-bag instances. If two out-of-bag cases are classified in the same tree leaf the proximity between them is incremented.

Usage

rfProximity(model, outProximity=TRUE)

Value

Function returns an M by M matrix where M is the number of training instances. Returned matrix is used as an input to other function (see rfOutliers

and rfClustering).

Arguments

model

a CORElearn model of type random forest.

outProximity

if TRUE, function returns a proximity matrix, else it returns a distance matrix.

Author

John Adeyanju Alao (as a part of his BSc thesis) and Marko Robnik-Sikonja (thesis supervisor)

Details

A proximity is transformed into distance with expression distance=sqrt(1-proximity).

References

Leo Breiman: Random Forests. Machine Learning Journal, 45:5-32, 2001

See Also

CoreModel, rfOutliers, cmdscale, rfClustering.

Examples

Run this code
md <- CoreModel(Species ~ ., iris, model="rf", rfNoTrees=30, maxThreads=1)
pr <- rfProximity(md, outProximity=TRUE)
# visualization
require(lattice)
levelplot(pr)

destroyModels(md) # clean up

Run the code above in your browser using DataLab