Function updates a lm.rrpp fit to add $MANOVA, which like
$ANOVA, provides statistics
or matrices typically associated with multivariate analysis
of variance (MANOVA).
MANOVA statistics or sums of squares and cross-products (SSCP) matrices
are calculated over the random permutations of a lm.rrpp
fit. SSCP matrices are
computed, as are the inverse of R times H (invR.H), where R is a SSCP
for the residuals or random effects and H is
the difference between SSCP matrices of full and reduced models
(see below). From invR.H,
MANOVA statistics are calculated, including Roy's maximum root
(eigenvalue), Pillai trace, Hotelling-Lawley trace,
and Wilks lambda (via summary.manova.lm.rrpp
).
The manova.update to add $MANOVA to lm.rrpp
fits
requires more computation time than the $ANOVA
statistics that are computed automatically in lm.rrpp
.
Generally, the same inferential conclusions will
be found with either approach, when observations outnumber response
variables. For high-dimensional data (more variables
than observations) data are projected into a Euclidean space of
appropriate dimensions (rank of residual covariance matrix).
One can vary the tolerance for eigenvalue decay or specify the number
of PCs, if a smaller set of PCs than the maximum is desired.
This is advised if there is strong correlation among variables
(the data space could be simplified to fewer dimensions), as spurious
results are possible. Because distributions of MANOVA stats
can be generated from the random permutations,
there is no need to approximate F-values, like with parametric MANOVA.
By restricting analysis to the real, positive eigenvalues calculated,
all statistics can be calculated (but Wilks lambda, as a product but
not a trace, might be less reliable as variable number approaches
the number of observations).
ANOVA vs. MANOVA
Two SSCP matrices are calculated for each linear model effect, for
every random permutation: R (Residuals or Random effects) and
H, the difference between SSCPs for "full" and "reduced" models.
(Full models contain and reduced models lack
the effect tested; SSCPs are hypothesized to be the same under
a null hypothesis, if there is no effect. The
difference, H, would have a trace of 0 if the null hypothesis
were true.) In RRPP, ANOVA and MANOVA correspond to
two different ways to calculate statistics from R and H matrices.
ANOVA statistics are those that find the trace of R and H SSCP
matrices before calculating subsequent statistics,
including sums of squares (SS), mean squares (MS), and F-values.
These statistics can be calculated with univariate data
and provide univariate-like statistics for multivariate data.
These statistics are dispersion measures only (covariances
among variables do not contribute) and are the same as "distance-based"
stats proposed by Goodall (1991) and Anderson (2001).
MANOVA stats require multivariate data and are implicitly
affected by variable covariances. For MANOVA, the inverse of R times
H (invR.H) is first calculated for each effect, then eigen-analysis
is performed on these matrix products. Multivariate
statistics are calculated from the positive, real eigenvalues.
In general, inferential
conclusions will be similar with either approach, but effect
sizes might differ.
Two important differences between manova.update and
summary.manova
(for lm
objects)
are that manova.update
does not attempt to normalize residual SSCP matrices (unneeded
for non-parametric statistical solutions) and (2) uses a generalized
inverse of the residual SSCP, if needed, when the number of
variables could render eigen-analysis problematic. This approach
is consistent
with covariance regularization methods that attempt to make covariance
matrices positive-definite for calculating model likelihoods or
multivariate statistics. If the number of observations far exceeds
the number of response variables, observed statistics from
manova.update and
summary.manova
will be quite similar. If the
number of response variables approaches or exceeds the number
of observations, manova.update
statistics will be much more reliable.
ANOVA tables are generated by anova.lm.rrpp
on
lm.rrpp fits and MANOVA tables are generated
by summary.manova.lm.rrpp
, after running
manova.update on lm.rrpp fits.
Currently, mixed model effects are only possible with
$ANOVA statistics, not $MANOVA.
More detail is found in the vignette, ANOVA versus MANOVA.