discretize takes a data frame of continuous variables as its first
argument and returns a secdond data frame of discrete variables, transformed
using of three methods: interval, quantile or hartemink.
dedup screens the data for pairs of highly correlated variables, and
discards one in each pair.