Function to integrate and combine multiple independent studies measured on the same variables or predictors (P-integration) using a multigroup Principal Component Analysis.
mint.pca(X,
ncomp = 2,
study,
scale = TRUE,
tol = 1e-06,
max.iter = 100
)
numeric matrix of predictors combining multiple independent studies on the same set of predictors. NA
s are allowed.
Number of components to include in the model (see Details). Default to 2
factor indicating the membership of each sample to each of the studies being combined
boleean. If scale = TRUE, each block is standardized
to zero means and unit variances. Default = TRUE
.
Convergence stopping value.
integer, the maximum number of iterations.
mint.pca
returns an object of class "mint.pca", "pca"
, a list
that contains the following components:
the centered and standardized original predictor matrix.
the number of components included in the model.
The study grouping factor
the eigenvalues of the covariance/correlation matrix, though the calculation is actually done with the singular values of the data matrix or by using NIPALS.
the centering and scaling used, or FALSE
.
the matrix of variable loadings (i.e., a matrix whose columns contain the eigenvectors).
same as 'rotation' to keep the mixOmics spirit
the value of the rotated data (the centred (and scaled if requested) data multiplied by the rotation/loadings matrix), also called the principal components.
same as 'x' to keep the mixOmics spirit
explained variance from the multivariate model, used for plotIndiv
list containing the names to be used for individuals and variables.
mint.pca
fits a vertical PCA model with ncomp
components in which several independent studies measured on the same variables are integrated. The study
factor indicates the membership of each sample in each study. We advise to only combine studies with more than 3 samples as the function performs internal scaling per study.
Missing values are handled by being disregarded during the cross product computations in the algorithm without having to delete rows with missing data. Alternatively, missing data can be imputed prior using the nipals
function.
Useful graphical outputs are available, e.g. plotIndiv
, plotLoadings
, plotVar
.
Rohart F, Eslami A, Matigian, N, Bougeard S, Le Cao K-A (2017). MINT: A multivariate integrative approach to identify a reproducible biomarker signature across multiple experiments and platforms. BMC Bioinformatics 18:128.
Eslami, A., Qannari, E. M., Kohler, A., and Bougeard, S. (2014). Algorithms for multi-group PLS. J. Chemometrics, 28(3), 192-201.
spls
, summary
,
plotIndiv
, plotVar
, predict
, perf
, mint.spls
, mint.plsda
, mint.splsda
and http://www.mixOmics.org/mixMINT for more details.
# NOT RUN {
data(stemcells)
res = mint.pca(X = stemcells$gene, ncomp = 3,
study = stemcells$study)
plotIndiv(res, group = stemcells$celltype, legend=TRUE)
# }
Run the code above in your browser using DataLab