WH_hclust

A MatH object (a matrix of distributionH).

A logic value (default is FALSE), if TRUE histograms are recomputed in order to speed-up the algorithm.

simplify

An integer, if <code>simplify</code>=TRUE is the number of quantiles used for recodify the histograms.

A logic value (default is FALSE). If TRUE, histogram-valued data are standardized, variable by variable, 
using the Wassertein based standard deviation. Use if one wants to have variables with std equal to one.

standardize

A string default "WDIST" the L2 Wasserstein distance (other distances will be implemented)

distance

A string, default="complete", is the the agglomeration method to be used.
This should be (an unambiguous abbreviation of) one of "<code>ward.D</code>", "<code>ward.D2</code>",
 "<code>single</code>", "<code>complete</code>", "<code>average</code>" (= UPGMA), "<code>mcquitty</code>" 
 (= WPGMA), "<code>median</code>" (= WPGMC) or "<code>centroid</code>" (= UPGMC).

method

The function implements a Hierarchical clustering 
 for a set of histogram-valued data, based on the L2 Wassertein distance.
 Extends the <code>hclust</code> function of the stat package.

In the framework of Symbolic Data Analysis, a relatively new
approach to the statistical analysis of multi-valued data, we consider
histogram-valued data, i.e., data described by univariate histograms. The
methods and the basic statistics for histogram-valued data are mainly based
on the L2 Wasserstein metric between distributions, i.e., the Euclidean metric
between quantile functions. The package contains unsupervised classification
techniques, least square regression and tools for histogram-valued data and for
histogram time series. An introducing paper is Irpino A. Verde R. (2015) <doi:10.1007/s11634-014-0176-4>.

WH_hclust: Hierarchical clustering of histogram data

Description

Usage

Arguments

Value

References

See Also

Examples