Learn R Programming

mixOmics (version 6.3.2)

Omics Data Integration Project

Description

Multivariate methods are well suited to large omics data sets where the number of variables (e.g. genes, proteins, metabolites) is much larger than the number of samples (patients, cells, mice). They have the appealing properties of reducing the dimension of the data by using instrumental variables (components), which are defined as combinations of all variables. Those components are then used to produce useful graphical outputs that enable better understanding of the relationships and correlation structures between the different data sets that are integrated. mixOmics offers a wide range of multivariate methods for the exploration and integration of biological datasets with a particular focus on variable selection. The package proposes several sparse multivariate models we have developed to identify the key variables that are highly correlated, and/or explain the biological outcome of interest. The data that can be analysed with mixOmics may come from high throughput sequencing technologies, such as omics data (transcriptomics, metabolomics, proteomics, metagenomics etc) but also beyond the realm of omics (e.g. spectral imaging). The methods implemented in mixOmics can also handle missing values without having to delete entire rows with missing data. A non exhaustive list of methods include variants of generalised Canonical Correlation Analysis, sparse Partial Least Squares and sparse Discriminant Analysis. Recently we implemented integrative methods to combine multiple data sets: N-integration with variants of Generalised Canonical Correlation Analysis and P-integration with variants of multi-group Partial Least Squares.

Copy Link

Version

Install

install.packages('mixOmics')

Monthly Downloads

290

Version

6.3.2

License

GPL (>= 2)

Maintainer

Last Published

June 1st, 2018

Functions in mixOmics (6.3.2)

circosPlot

circosPlot for DIABLO
explained_variance

Calculation of explained variance
mint.block.spls

NP-integration for integration with variable selection
mint.block.splsda

NP-integration with Discriminant Analysis and variable selection
map

Classification given Probabilities
logratio.transfo

Log-ratio transformation
mat.rank

Matrix Rank
nipals

Non-linear Iterative Partial Least Squares (NIPALS) algorithm
mint.splsda

P-integration with Discriminant Analysis and variable selection
liver.toxicity

Liver Toxicity Data
mint.pls

P-integration
mint.pca

P-integration with Principal Component Analysis
linnerud

Linnerud Dataset
network

Relevance Network for (r)CCA and (s)PLS regression
mixOmics

PLS-derived methods: one function to rule them all!
ipca

Independent Principal Component Analysis
plotDiablo

Graphical output for the DIABLO framework
mint.plsda

P-integration with Projection to Latent Structures models (PLS) with Discriminant Analysis
plotIndiv

Plot of Individuals (Experimental Units)
mint.spls

P-integration with variable selection
plotLoadings

Plot of Loading vectors
plot.perf

Plot for model performance
pcatune

Tune the number of principal components in PCA
mint.block.pls

NP-integration
plot.tune

Plot for model performance
perf

Compute evaluation criteria for PLS, sPLS, PLS-DA, sPLS-DA, MINT and DIABLO
plot.rcc

Canonical Correlations Plot
mint.block.plsda

NP-integration with Discriminant Analysis
rcc

Regularized Canonical Correlation Analysis
selectVar

Output of selected variables
srbct

Small version of the small round blue cell tumors of childhood data
print

Print Methods for CCA, (s)PLS, PCA and Summary objects
study_split

divides a data matrix in a list of matrices defined by a factor
plotVar

Plot of Variables
vip

Variable Importance in the Projection (VIP)
stemcells

Human Stem Cells Data
plotArrow

Arrow sample plot
predict

Predict Method for (mint).(block).(s)pls(da) methods
vac18.simulated

Simulated data based on the vac18 study for multilevel analysis
multidrug

Multidrug Resistence Data
pls

Partial Least Squares (PLS) Regression
nearZeroVar

Identification of zero- or near-zero variance predictors
summary

Summary Methods for CCA and PLS objects
nutrimouse

Nutrimouse Dataset
plsda

Partial Least Squares Discriminant Analysis (PLS-DA).
tune

Generic function to choose the parameters in the different methods in mixOmics
withinVariation

Within matrix decomposition for repeated measurements (cross-over design)
tune.block.splsda

Tuning function for block.splsda method (N-integration with sparse Discriminant Analysis)
tune.splsda

Tuning functions for sPLS-DA method
tune.splslevel

Tuning functions for multilevel sPLS method
wrapper.rgcca

mixOmics wrapper for Regularised Generalised Canonical Correlation Analysis (rgcca)
tune.mint.splsda

Estimate the parameters of mint.splsda method
wrapper.sgcca

mixOmics wrapper for Sparse Generalised Canonical Correlation Analysis (sgcca)
tune.pca

Tune the number of principal components in PCA
sipca

Independent Principal Component Analysis
spca

Sparse Principal Components Analysis
unmap

Dummy matrix for an outcome factor
yeast

Yeast metabolomic study
pca

Principal Components Analysis
spls

Sparse Partial Least Squares (sPLS)
splsda

Sparse Partial Least Squares Discriminant Analysis (sPLS-DA)
tune.rcc

Estimate the parameters of regularization for Regularized CCA
tune.spls

Tuning functions for sPLS method
vac18

Vaccine study Data
breast.TCGA

Breast Cancer multi omics data from TCGA
block.splsda

N-integration and feature selection with Projection to Latent Structures models (PLS) with sparse Discriminant Analysis
background.predict

Calculate prediction areas
block.pls

N-integration with Projection to Latent Structures models (PLS)
breast.tumors

Human Breast Tumors Data
Koren.16S

16S microbiome atherosclerosis study
estim.regul

Estimate the parameters of regularization for Regularized CCA
block.spls

N-integration and feature selection with sparse Projection to Latent Structures models (sPLS)
image.estim.regul

Plot the cross-validation score.
image

Plot the cross-validation score.
cimDiablo

Clustered Image Maps (CIMs) ("heat maps") for DIABLO
cim

Clustered Image Maps (CIMs) ("heat maps")
color.jet

Color Palette for mixOmics
diverse.16S

16S microbiome data: most diverse bodysites from HMP
imgCor

Image Maps of Correlation Matrices between two Data Sets
get.confusion_matrix

Create confusion table and calculate the Balanced Error Rate
auroc

Area Under the Curve (AUC) and Receiver Operating Characteristic (ROC) curves for supervised classification
block.plsda

N-integration with Projection to Latent Structures models (PLS) with Discriminant Analysis