Learn R Programming

GPCSIV (version 0.1.0)

GPCSIV-package: Generalized Principal Component of Symbolic Interval Variables.

Description

This package implements an extension of principal component analysis (PCA) tailored to handle multiple data tables. These multiple data tables contain the same number of Interval variables and the same observations. This package can handle Big Data in the sense that the variation in massive data can be described by intervals [a, b] and multiple tables. If only one data table is specified, in this case this package performs a PCA of interval data.

Arguments

Details

Package:
GPCSIV
Type:
Package
Version:
1.0
Date:
2013-06-06
License:
GPL (>= 2)

Each dataset can be in csv, excel, access, txt,...,file. The only constraint is that for each variable, the maximum column must follow the minimum column. The Resdata class implemented returns two list of data frames (list of minimums and maximums). These lists of data frames are the inputs of the gpca function.

References

Billard, L. and E. Diday (2006). Symbolic Data Analysis: conceptual statistics and data Mining. Berlin: Wiley series in computational statistics.

Diday, E. and M. Noirhomme-Fraiture (2008). Symbolic Data Analysis and the SODAS Software. Chichester: Wiley Interscience.

Makosso-Kallyth, S (2013). Analysis of m sets of symbolic interval variables. Revue des Nouvelles Technologies de l'Information, vol. RNTI-E25. pp. 97-108.