It is suited for handling difficult problems posed by high-dimensional multivariate datasets ($p \gg n$ paradigm), such as in omics-type data, among which are that the variance is often a function of the mean, variable-specific estimators of variances are not reliable, and tests statistics have low powers due to a lack of degrees of freedom. Key features include:
MVR.news
Function to display the NEWS file of the
mvr
(*)
Function for Mean-Variance Regularization and Variance Stabilization.
End-user function for Mean-Variance Regularization (MVR) and Variance Stabilization
by similarity statistic under sample group homoscedasticity or heteroscedasticity assumption.
The function takes advantage of the R package
mvrt.test
(*)
Function for Computing Mean-Variance Regularized T-test Statistic and Its Significance.
End-user function for computing MVR t-test statistic and its significance (p-value)
under sample group homoscedasticity or heteroscedasticity assumption.
The function takes advantage of the R package
cluster.diagnostic
Function for Plotting Summary Cluster Diagnostic Plots.
Plot similarity statistic profiles and the optimal joint clustering configuration for the means and the variances by group.
Plot quantile profiles of means and standard deviations by group and for each clustering configuration, to check that the distributions
of first and second moments of the MVR-transformed data approch their respective null distributions under the optimal configuration found,
assuming independence and normality of all the variables.
target.diagnostic
Function for Plotting Summary Target Moments Diagnostic Plots.
Plot comparative distribution densities of means and standard deviations of the data before and after Mean-Variance Regularization
to check for location shifts between observed first and second moments and their expected target values under a target centered homoscedastic model.
Plot comparative QQ scatterplots to look at departures between observed distributions of first and second moments of the MVR-transformed data
and their theoretical distributions assuming independence and normality of all the variables.
stabilization.diagnostic
Function for Plotting Summary Variance Stabilization Diagnostic Plots.
Plot comparative variance-mean plots to check the variance stabilization across variables before and after Mean-Variance Regularization.
normalization.diagnostic
Function for Plotting Summary Normalization Diagnostic Plots.
Plot comparative Box-Whisker and Heatmap plots of variables across samples check the effectiveness of normalization before and after
Mean-Variance Regularization.
A Real
dataset coming from a quantitative proteomics experiment, consisting of $n=6$
samples split into a control ("M") and a treated group ("S") with $p=9052$
unique peptides or predictor variables. This is a balanced design with two sample groups ($G=2$),
under unequal sample group variance.
A Synthetic
dataset with $n=10$ observations (samples) and $p=200$ variables,
where $nvar=40$ of them are significantly different between the two sample groups.
This is a balanced design with two sample groups ($G=2$), under unequal sample group variance.
makeCluster
(R packagejustvsn
(R packageeBayes
(R packagesamr
(R packagematest
(R packageebam
(R packagebayesT
Hierarchical Bayesian Regularized t-test statisticBaldi et al., 2001