Learn R Programming

Contrast Trees and Distribution Boosting

Contrast trees are used to assess the accuracy of many types of machine learning estimates that are not amenable to standard validation techniques. These include properties of the conditional distribution $p_{y}(y,|,\mathbf{x})$ (means, quantiles, complete distribution) as functions of $\mathbf{x}$. Given a set of predictor variables $\mathbf{x}=(x_{1},x_{2},$$,x_{p})$ and two outcome variables $y$ and $z$ associated with each $\mathbf{x}$, a contrast tree attempts to partition the space of $\mathbf{x}$ values into local regions within which the respective distributions of $y,|,\mathbf{x}$ and $z,|,\mathbf{x}$, or selected properties of those distributions such as means or quantiles, are most different.

For more details, please see the tutorial.

Copy Link

Version

Install

install.packages('conTree')

Monthly Downloads

187

Version

0.3-1

License

Apache License 2.0

Maintainer

Balasubramanian Narasimhan

Last Published

November 22nd, 2023

Functions in conTree (0.3-1)

prune.seq

Show all possible pruned subtrees
getnodes

Get terminal node observation assignments
nodesum

Summarize contrast tree
conTree-package

Contrast and Boosted Trees
air_quality

Air Quality Data from UC Irvine Machine Learning Repository
onesample_parameters

Return the one sample parameters used in fortran discrepancy functions
contrast

Build contrast tree
lofcurve

Produce lack-of-fit curve for a contrast tree
predtrast

Predict y-values from boosted contrast model
save_rfun

Save the function f for calling from fortran
treesum

Print terminal node x-region boundaries
prune

Prune a contrast tree
age_data

Age and Demographics data
census

Census Data Example from UC Irvine Machine Learning Repository
ydist

Transform z-values t(z) such that the distribution of \(p(t(z) | x)\) approximates \(p(t(y | x)\) for type = 'dist' only
xval

Cross-validate boosted contrast tree boosted with (new) data