Learn R Programming

dimensio

Overview

Simple Principal Components Analysis (PCA; see vignette("pca")) and (Multiple) Correspondence Analysis (CA) based on the Singular Value Decomposition (SVD). This package provides S4 classes and methods to compute, extract, summarize and visualize results of multivariate data analysis. It also includes methods for partial bootstrap validation.

There are many very good packages for multivariate data analysis (such as FactoMineR, ade4, vegan or ca, all extended by FactoExtra). dimensio is designed to be as simple as possible, providing all the necessary tools to explore the results of the analysis.


To cite dimensio in publications use:

Frerebeau N (2024). dimensio: Multivariate Data Analysis. Université Bordeaux Montaigne, Pessac, France. doi:10.5281/zenodo.4478530 https://doi.org/10.5281/zenodo.4478530, R package version 0.9.0, https://packages.tesselle.org/dimensio/.

This package is a part of the tesselle project https://www.tesselle.org.

Installation

You can install the released version of dimensio from CRAN with:

install.packages("dimensio")

And the development version from GitHub with:

# install.packages("remotes")
remotes::install_github("tesselle/dimensio")

Usage

## Load package
library(dimensio)

Compute

## Load data
data(iris)

## Compute PCA
X <- pca(iris, center = TRUE, scale = TRUE, sup_quali = "Species")

Extract

dimensio provides several methods to extract the results:

  • get_data() returns the original data.
  • get_contributions() returns the contributions to the definition of the principal dimensions.
  • get_coordinates() returns the principal or standard coordinates.
  • get_correlations() returns the correlations between variables and dimensions.
  • get_cos2() returns the cos2 values (i.e. the quality of the representation of the points on the factor map).
  • get_eigenvalues() returns the eigenvalues, the percentages of variance and the cumulative percentages of variance.

Visualize

The package allows to quickly visualize the results:

  • biplot() produces a biplot.
  • screeplot() produces a scree plot.
  • viz_rows()/viz_individuals() displays row/individual principal coordinates.
  • viz_columns()/viz_variables() displays columns/variable principal coordinates. viz_variables() depicts the variables by rays emanating from the origin (both their lengths and directions are important to the interpretation).
  • viz_contributions() displays (joint) contributions.
  • viz_cos2() displays (joint) cos2.

The viz_*() functions allow to highlight additional information by varying different graphical elements (color, transparency, shape and size of symbols…).

## Form biplot
biplot(X, type = "form")
## Highlight species
viz_individuals(
  x = X, 
  extra_quali = iris$Species, 
  color = c("#004488", "#DDAA33", "#BB5566")
)
## Add ellipses
viz_tolerance(
  x = X, 
  group = iris$Species, 
  level = 0.95,
  border = c("#004488", "#DDAA33", "#BB5566")
)

## Highlight petal length
viz_individuals(
  x = X,
  extra_quanti = iris$Petal.Length,
  color = color("iridescent")(255), 
  size = c(1, 2)
)

## Plot variables factor map
viz_variables(X)

## Scree plot
screeplot(X, eigenvalues = FALSE, cumulative = TRUE)

Contributing

Please note that the dimensio project is released with a Contributor Code of Conduct. By contributing to this project, you agree to abide by its terms.

References

Aitchison, John, and Michael Greenacre. 2002. “Biplots of Compositional Data.” Journal of the Royal Statistical Society: Series C (Applied Statistics) 51 (4): 375–92. https://doi.org/10.1111/1467-9876.00275.

Gower, J. C. 1966. “Some Distance Properties of Latent Root and Vector Methods Used in Multivariate Analysis.” Biometrika 53 (3-4): 325–38. https://doi.org/10.1093/biomet/53.3-4.325.

Greenacre, Michael J. 1984. Theory and Applications of Correspondence Analysis. London ; Orlando, Fla: Academic Press.

———. 2007. Correspondence Analysis in Practice. Seconde edition. Interdisciplinary Statistics Series. Boca Raton: Chapman & Hall/CRC.

———. 2010. Biplots in Practice. Bilbao: Fundación BBVA.

Lebart, Ludovic, Marie Piron, and Alain Morineau. 2006. Statistique exploratoire multidimensionnelle : Visualisations et inférences en fouilles de données.

Lockyear, Kris. 2013. “Applying Bootstrapped Correspondence Analysis to Archaeological Data.” Journal of Archaeological Science 40 (12): 4744–53. https://doi.org/10.1016/j.jas.2012.08.035.

Ringrose, T. J. 1992. “Bootstrapping and Correspondence Analysis in Archaeology.” Journal of Archaeological Science 19 (6): 615–29. https://doi.org/10.1016/0305-4403(92)90032-X.

Copy Link

Version

Install

install.packages('dimensio')

Monthly Downloads

351

Version

0.9.0

License

GPL (>= 3)

Issues

Pull Requests

Stars

Forks

Last Published

August 26th, 2024

Functions in dimensio (0.9.0)

cdt

Complete Disjunctive Table
mca

Multiple Correspondence Analysis
pca

Principal Components Analysis
summary

Object Summaries
get_data

Get Original Data
get_eigenvalues

Get Eigenvalues
reexports

Objects exported from other packages
svd2

Singular Value Decomposition of a Matrix
prepare_plot

Prepare Data for Plotting
dimensio-defunct

Defunct Functions in dimensio
dimensio-deprecated

Deprecated Functions in dimensio
get_contributions

Get Contributions
tidy

Tidy Coordinates
loadings

Extract Loadings
benthos

Benthos
label

Non-Overlapping Text Labels
viz_biplot

Build a Biplot
biplot

Biplot
get_coordinates

Get Coordinates
dimensio-package

dimensio: Multivariate Data Analysis
dimnames

Dimnames of an Object
viz_labels

Non-Overlapping Text Labels
viz_points

Build a Factor Map
predict

Predict New Coordinates
prepare_legend

Build a Legend
plot

Plot Coordinates
pcoa

Principal Coordinates Analysis
screeplot

Scree Plot
viz_variables

Visualize Variables Factor Map
viz_wrap

Plot Envelopes
subset

Extract Parts of an Object
wrap

Wrap Observations
viz_contributions

Visualize Contributions and cos2
viz_individuals

Visualize Individuals Factor Map
MultivariateResults

Multivariate Data Analysis Results
MultivariateSummary

Summary of Multivariate Data Analysis
BootstrapCA-class

Bootstrap CA Results
colours

Colours
boot

Partial Bootstrap Analysis
CA-class

CA Results
PCA-class

PCA Results
PCOA-class

PCoA Results
burt

Burt Table
countries

Countries
ca

Correspondence Analysis
MultivariateAnalysis

Output of Multivariate Data Analysis
MCA-class

MCA Results
MultivariateBootstrap

Output of Bootstrap Replications
BootstrapPCA-class

Bootstrap PCA Results