Learn R Programming

⚠️There's a newer version (5.5) of this package.Take me there.

Compositional (version 4.7)

Compositional Data Analysis

Description

Regression, classification, contour plots, hypothesis testing and fitting of distributions for compositional data are some of the functions included. The standard textbook for such data is John Aitchison's (1986) "The statistical analysis of compositional data". Relevant papers include: a) Tsagris M.T., Preston S. and Wood A.T.A. (2011). A data-based power transformation for compositional data. Fourth International International Workshop on Compositional Data Analysis. b) Tsagris M. (2014). The k-NN algorithm for compositional data: a revised approach with and without zero values present. Journal of Data Science, 12(3):519--534. c) Tsagris M. (2015). A novel, divergence based, regression for compositional data. Proceedings of the 28th Panhellenic Statistics Conference, 15-18 April 2015, Athens, Greece, 430--444. d) Tsagris M. (2015). Regression analysis with compositional data containing zero values. Chilean Journal of Statistics, 6(2):47--57. e) Tsagris M., Preston S. and Wood A.T.A. (2016). Improved supervised classification for compositional data using the alpha-transformation. Journal of Classification, 33(2):243--261. . f) Tsagris M., Preston S. and Wood A.T.A. (2017). Nonparametric hypothesis testing for equality of means on the simplex. Journal of Statistical Computation and Simulation, 87(2): 406--422. . g) Tsagris M. and Stewart C. (2018). A Dirichlet regression model for compositional data with zeros. Lobachevskii Journal of Mathematics,39(3): 398--412. . h) Alenazi A. (2019). Regression for compositional data with compositional data as predictor variables with or without zero values. Journal of Data Science, 17(1): 219--238. . i) Tsagris M. and Stewart C. (2020). A folded model for compositional data analysis. Australian and New Zealand Journal of Statistics, 62(2):249--277. . j) Tsagris M., Alenazi A. and Stewart C. (2021). The alpha-k-NN regression for compositional data. . We further include functions for percentages (or proportions).

Copy Link

Version

Install

install.packages('Compositional')

Monthly Downloads

1,440

Version

4.7

License

GPL (>= 2)

Maintainer

Michail Tsagris

Last Published

May 13th, 2021

Functions in Compositional (4.7)

Compositional-package

Compositional Data Analysis
Cross validation for the alpha-kernel regression for compositional response data

Cross validation for the \(\alpha\)-kernel regression for compositional response data
Cross validation for the alpha-k-NN regression for compositional response data

Cross validation for the \(\alpha\)-k-NN regression for compositional response data
The alpha-k-NN regression with compositional predictor variables

The \(\alpha\)-k-NN regression with compositional predictor variables
Tuning of the alpha generalised correlations between two compositional datasets

Tuning of the alpha generalised correlations between two compositional datasets
Estimation of the value of alpha in the folded model

Estimation of the value of \(\alpha\) in the folded model
The alpha-kernel regression for compositional response data

The \(\alpha\)-kernel regression for compositional response data
The alpha-k-NN regression for compositional response data

The \(\alpha\)-k-NN regression for compositional response data
Naive Bayes classifiers for compositional data using the alpha-transformation

Naive Bayes classifiers for compositional data using the \(\alpha\)-transformation
Alpha generalised correlations between two compositional datasets

Alpha generalised correlations between two compositional datasets
The alpha-transformation

The \(\alpha\)-transformation
Ridge regression with compositional data in the covariates side using the alpha-transformation

Ridge regression with compositional data in the covariates side using the \(\alpha\)-transformation
The additive log-ratio transformation and its inverse

The additive log-ratio transformation and its inverse
MLE of the folded model for a given value of alpha

MLE of the folded model for a given value of \(\alpha\)
Regression with compositional data using the alpha-transformation

Regression with compositional data using the \(\alpha\)-transformation
Multivariate or univariate regression with compositional data in the covariates side using the alpha-transformation

Multivariate or univariate regression with compositional data in the covariates side using the \(\alpha\)-transformation
Regularised and flexible discriminant analysis for compositional data using the alpha-transformation

Regularised and flexible discriminant analysis for compositional data using the \(\alpha\)-transformation
Fast estimation of the value of alpha

Fast estimation of the value of \(\alpha\)
Estimation of the value of alpha via the profile log-likelihood

Estimation of the value of \(\alpha\) via the alfa profile log-likelihood
All pairwise additive log-ratio transformations

All pairwise additive log-ratio transformations
The Box-Cox transformation applied to ratios of components

The Box-Cox transformation applied to ratios of components
Ridge regression with the alpha-transformation plot

Ridge regression plot
Contour plot of the kernel density estimate in S^2

Contour plot of the kernel density estimate in \(S^2\)
The k-NN algorithm for compositional data

The k-NN algorithm for compositional data
Cross validation for some compositional regression models

Cross validation for some compositional regression models
Tuning of the projection pursuit regression for compositional data

Tuning of the projection pursuit regression for compositional data
Tuning the number of PCs in the PCR with compositional data using the alpha-transformation

Tuning the number of PCs in the PCR with compositional data using the \(\alpha\)-transformation
Choose the number of principal components via reconstruction error

Choose the number of principal components via reconstruction error
Hypothesis testing for two or more compositional mean vectors

Hypothesis testing for two or more compositional mean vectors
Multivariate regression with compositional data

Multivariate regression with compositional data
Estimating location and scatter parameters for compositional data

Estimating location and scatter parameters for compositional data
Cross-validation for the naive Bayes classifiers for compositional data

Cross-validation for the naive Bayes classifiers for compositional data
The k-nearest neighbours using the alpha-distance

The k-nearest neighbours using the \(alpha\)-distance
Cross validation for the ridge regression with compositional data as predictor using the alpha-transformation

Cross validation for the ridge regression with compositional data as predictor using the \(\alpha\)-transformation
Mixture model selection via BIC

Mixture model selection via BIC
Cross-validation for the LASSO Kullback-Leibler divergence based regression

Cross-validation for the LASSO Kullback-Leibler divergence based regression
Contour plot of the t distribution in S^2

Contour plot of the t distribution in \(S^2\)
Divergence matrix of compositional data

Divergence matrix of compositional data
Cross-validation for the constrained linear least squares for compositional responses and predictors

Cross-validation for the constrained linear least squares for compositional responses and predictors
Cross-validation for the Dirichlet discriminant analysis

Cross-validation for the Dirichlet discriminant analysis
Contour plot of the Dirichlet distribution in S^2

Contour plot of the Dirichlet distribution in \(S^2\)
Fitting a Dirichlet distribution

Fitting a Dirichlet distribution
Density values of a Dirichlet distribution

Density values of a Dirichlet distribution
Empirical likelihood hypothesis testing for two mean vectors

Empirical likelihood hypothesis testing for two mean vectors
Fitting a Dirichlet distribution via Newton-Rapshon

Fitting a Dirichlet distribution via Newton-Rapshon
The alpha-distance

The \(\alpha\)-distance
Cross validation for the transformation-free linear regression for compositional responses and predictors

Cross validation for the transformation-free linear regression for compositional responses and predictors
Naive Bayes classifiers for compositional data

Naive Bayes classifiers for compositional data
Cross validation for the alpha-k-NN regression with compositional predictor variables

Cross validation for the \(\alpha\)-k-NN regression with compositional predictor variables
Cross-validation for the naive Bayes classifiers for compositional data using the alpha-transformation

Cross-validation for the naive Bayes classifiers for compositional data using the \(\alpha\)-transformation
Projection pursuit regression for compositional data

Projection pursuit regression for compositional data
Inverse of the alpha-transformation

Inverse of the \(\alpha\)-transformation
Log-likelihood ratio test for a Dirichlet mean vector

Log-likelihood ratio test for a Dirichlet mean vector
Dirichlet regression

Dirichlet regression
Density of the Flexible Dirichlet distribution

Density of the Flexible Dirichlet distribution
Principal component generalised linear models

Principal component generalised linear models
Principal component analysis

Principal component analysis
Log-contrast regression with compositional predictor variables

Log-contrast regression with compositional predictor variables
Helper Frechet mean for compositional data

Helper Frechet mean for compositional data
MLE for the multivarite t distribution

MLE for the multivarite t distribution
Contour plot of the normal distribution in S^2

Contour plot of the normal distribution in \(S^2\)
Dirichlet discriminant analysis

Dirichlet discriminant analysis
Contour plot of the Flexible Dirichlet distribution in S^2

Contour plot of the Flexible Dirichlet distribution in \(S^2\)
Principal component analysis using the alpha-transformation

Principal component analysis using the \(\alpha\)-transformation
Exponential empirical likelihood for a one sample mean vector hypothesis testing

Exponential empirical likelihood for a one sample mean vector hypothesis testing
Fitting a Flexible Dirichlet distribution

Fitting a Flexible Dirichlet distribution
Tuning the value of alpha in the alpha-regression

Tuning the value of \(\alpha\) in the \(\alpha\)-regression
Beta regression

Beta regression
Cross validation for the regularised and flexible discriminant analysis with compositional data using the alpha-transformation

Cross validation for the regularised and flexible discriminant analysis with compositional data using the \(\alpha\)-transformation
Bayesian network learning with compositional data

Bayesian network learning with compositional data
MLE of distributions defined in the (0, 1) interval

MLE of distributions defined in the (0, 1) interval
The ESOV-distance

The ESOV-distance
The Helmert sub-matrix

The Helmert sub-matrix
Contour plot of the Kent distribution in S^2

Contour plot of the Kent distribution in \(S^2\)
Tuning of the k-NN algorithm for compositional data

Tuning of the k-NN algorithm for compositional data
Constrained linear least squares for compositional responses and predictors

Constrained linear least squares for compositional responses and predictors
Exponential empirical likelihood hypothesis testing for two mean vectors

Exponential empirical likelihood hypothesis testing for two mean vectors
Contour plot of the alpha-folded model in S^2

Contour plot of the alpha-folded model in \(S^2\)
Hotelling's multivariate version of the 1 sample t-test for Euclidean data

Hotelling's multivariate version of the 1 sample t-test for Euclidean data
Unit-Weibull regression models for proportions

Unit-Weibull regression models for proportions
Non linear least squares regression for compositional data

Non linear least squares regression for compositional data
Simulation of compositional data from the Flexible Dirichlet distribution

Simulation of compositional data from the Flexible Dirichlet distribution
Total variability

Total variability
Simulation of compositional data from the folded normal distribution

Simulation of compositional data from the folded model normal distribution
Hotelling's multivariate version of the 2 sample t-test for Euclidean data

Hotelling's multivariate version of the 2 sample t-test for Euclidean data
Empirical likelihood for a one sample mean vector hypothesis testing

Empirical likelihood for a one sample mean vector hypothesis testing
James multivariate version of the t-test

James multivariate version of the t-test
Contour plot of the Gaussian mixture model in S^2

Contour plot of the Gaussian mixture model in \(S^2\)
Multivariate kernel density estimation

Multivariate kernel density estimation
Kullback-Leibler divergence and Bhattacharyya distance between two Dirichlet distributions

Kullback-Leibler divergence and Bhattacharyya distance between two Dirichlet distributions
The folded power transformation

The folded power transformation
Quasi binomial regression for proportions

Quasi binomial regression for proportions
Estimation of the probability left outside the simplex when using the alpha-transformation

Estimation of the probability left outside the simplex when using the alpha-transformation
Divergence based regression for compositional data with compositional data in the covariates side using the alpha-transformation

Divergence based regression for compositional data with compositional data in the covariates side using the \(\alpha\)-transformation
Distance based regression models for proportions

Distance based regression models for proportions
Regularised discriminant analysis for Euclidean data

Regularised discriminant analysis for Euclidean data
Multivariate t random values simulation on the simplex

Multivariate t random values simulation on the simplex
Transformation-free linear regression for compositional responses and predictors

Transformation-free linear regression for compositional responses and predictors
Ternary diagram of regression models

Ternary diagram of regression models
The Frechet mean for compositional data

The Frechet mean for compositional data
LASSO Kullback-Leibler divergence based regression

LASSO Kullback-Leibler divergence based regression
Plot of the LASSO coefficients

Plot of the LASSO coefficients
Tuning the principal components with GLMs

Tuning the principal components with GLMs
Multivariate normal random values simulation on the simplex

Multivariate normal random values simulation on the simplex
Gaussian mixture models for compositional data

Gaussian mixture models for compositional data
Zero adjusted Dirichlet regression

Zero adjusted Dirichlet regression
Multivariate analysis of variance (James test)

Multivariate analysis of variance (James test)
Multivariate skew normal random values simulation on the simplex

Multivariate skew normal random values simulation on the simplex
Non parametric zero replacement strategies

Non parametric zero replacement strategies
Proportionality correlation coefficient matrix

Proportionality correlation coefficient matrix
Cross validation for the ridge regression

Cross validation for the ridge regression
Simulation of compositional data from Gaussian mixture models

Simulation of compositional data from Gaussian mixture models
Power operation

Power operation
Multivariate analysis of variance

Multivariate analysis of variance
Perturbation operation

Perturbation operation
Generate random folds for cross-validation

Generate random folds for cross-validation
Tuning of the divergence based regression for compositional data with compositional data in the covariates side using the alpha-transformation

Tuning of the divergence based regression for compositional data with compositional data in the covariates side using the \(\alpha\)-transformation
Dirichlet random values simulation

Dirichlet random values simulation
Tuning the parameters of the regularised discriminant analysis

Tuning the parameters of the regularised discriminant analysis
Contour plot of the skew skew-normal distribution in S^2

Contour plot of the skew skew-normal distribution in \(S^2\)
Spatial median regression

Spatial median regression
Tuning of the bandwidth h of the kernel using the maximum likelihood cross validation

Tuning of the bandwidth h of the kernel using the maximum likelihood cross validation
Divergence based regression for compositional data

Divergence based regression for compositional data
Greenacre's power transformation

Greenacre's power transformation
Multivariate linear regression

Multivariate linear regression
Helper functions for the Kullback-Leibler regression

Helper functions for the Kullback-Leibler regression
Ridge regression

Ridge regression
Log-likelihood ratio test for a symmetric Dirichlet distribution

Log-likelihood ratio test for a symmetric Dirichlet distribution
Ternary diagram

Ternary diagram
Ridge regression plot

Ridge regression plot