Learn R Programming

robCompositions (version 2.0.0)

outCoDa: Outlier detection for compositional data

Description

Outlier detection for compositional data using standard and robust statistical methods.

Usage

outCoDa(x, quantile = 0.975, method = "robust", h = 1/2, coda = TRUE)
"print"(x, ...)
"plot"(x, y, ..., which = 1)

Arguments

x
compositional data
quantile
quantile, corresponding to a significance level, is used as a cut-off value for outlier identification: observations with larger (squared) robust Mahalanobis distance are considered as potential outliers.
method
either “robust” (default) or “standard”
h
the size of the subsets for the robust covariance estimation according the MCD-estimator for which the determinant is minimized (the default is (n+p+1)/2).
coda
if TRUE, data transformed to coordinate representation before outlier detection.
...
additional parameters for print and plot method passed through
y
unused second plot argument for the plot method
which
1 ... MD against index 2 ... distance-distance plot

Value

mahalDist
resulting Mahalanobis distance
limit
quantile of the Chi-squared distribution
outlierIndex
logical vector indicating outliers and non-outliers
method
method used

Details

The outlier detection procedure is based on (robust) Mahalanobis distances after a isometric logratio transformation of the data. Observations with squared Mahalanobis distance greater equal a certain quantile of the Chi-squared distribution are marked as outliers.

If method “robust” is chosen, the outlier detection is based on the homogeneous majority of the compositional data set. If method “standard” is used, standard measures of location and scatter are applied during the outlier detection procedure.

plot method: the Mahalanobis distance are plotted against the index. The dashed line indicates the (1 - alpha) quantile of the Chi-squared distribution. Observations with Mahalanobis distance greater than this quantile could be considered as compositional outliers.

References

Egozcue J.J., V. Pawlowsky-Glahn, G. Mateu-Figueras and C. Barcel'o-Vidal (2003) Isometric logratio transformations for compositional data analysis. Mathematical Geology, 35(3) 279-300. \

Filzmoser, P., and Hron, K. (2008) Outlier detection for compositional data using robust methods. Math. Geosciences, 40 233-248.\

Rousseeuw, P.J., Van Driessen, K. (1999) A fast algorithm for the minimum covariance determinant estimator. Technometrics, 41 212-223.

See Also

isomLR

Examples

Run this code

data(expenditures)
oD <- outCoDa(expenditures)
oD
## providing a function:
oD <- outCoDa(expenditures, coda = log)

Run the code above in your browser using DataLab