standardScreeningBinaryTrait and
standardScreeningNumericTrait. Given expression (or other) data from multiple independent
data sets, and the corresponding clinical traits or outcomes, the function calculates multiple screening
statistics in each data set, then calculates meta-analysis Z scores, p-values, and optionally q-values
(False Discovery Rates). Three different ways of calculating the meta-analysis Z scores are provided: the
Stouffer method, weighted Stouffer method, and using user-specified weights.metaAnalysis(multiExpr, multiTrait,
binary = NULL,
metaAnalysisWeights = NULL,
corFnc = cor, corOptions = list(use = "p"),
getQvalues = FALSE,
getAreaUnderROC = FALSE,
useRankPvalue = TRUE,
rankPvalueOptions = list(),
setNames = NULL,
kruskalTest = FALSE, var.equal = FALSE,
metaKruskal = kruskalTest, na.action = "na.exclude")checkSets). A vector of lists; in
each list there must be a component named data whose content
is a matrix or dataframe or array of didata
component of each component list can be either a vector or a data frame (matrix, array of dimension 2).TRUE) or continuous (FALSE)? If not given, the decision will
be made based on the content of multiTrait.multiExpr.rankPvalue function be used to obtain alternative
meta-analysis statistics?rankPvalue. These include
na.last (default "keep"), ties.method (default "average"),
calculateQvalue (defnames attribute of multiExpr. If
names(multiExpr) is NULLTRUE, the function will warn
the user that the returned test statistics will be different from the results of the standard
t.testTRUE) or Student t-test
(FALSE)?t.test.Z.Stouffer.equalWeightsp.Stouffer.equalWeights, only present if
getQvalues is TRUE.Z.DoFWeightsp.DoFWeights, only present if
getQvalues is TRUE.Z.DoFWeightsp.DoFWeights, only present if
getQvalues is TRUE.metaAnalysisWeights
are present.Z.userWeightsp.userWeights, only present if
getQvalues is TRUE.useRankPvalue is TRUE and contain the output
of the function rankPvalue with the same column weights as the above meta-analysis. Depending
on the input options calculateQvalue and pValueMethod in rankPvalueOptions, some
columns may be missing. The following columns are calculated using equal weights for each data set.RootDofWeights, DoFWeights, userWeights.standardScreeningBinaryTrait or
standardScreeningNumericTrait (depending on whether the input trait is binary or continuous).For binary traits, the following information is returned for each set:
qValues==TRUE)
q-value (local false discovery rate) based on the Student T-test p-value (Storey et al 2004).datExpr across
samples in the second group.datExpr
across samples in the
first group. Recall that SE(x)=sqrt(var(x)/n) where n is the number of non-missing values of x.datExpr
across samples in the second group.outx=TRUE (from Frank Harrel's
package Hmisc).kruskalTest is TRUE, the following columns further summarize results of
Kruskal-Wallis test:qValues==TRUE).pvalueStudent.Set1, pvalueStudent.Set2, ...
or from pvaluekruskal.Set1, pvaluekruskal.Set2, ..., depending on input metaKruskal.qValues==TRUE) q-values of the
correlations calculated from the p-valuessqrt(n), where n is the number of input data sets. We refer to this method as
Stouffer.equalWeights. In general, a better (i.e., more powerful) method of combining Z statistics is
to weigh them by the number of degrees of freedom (which approximately equals n). We refer to this
method as weightedStouffer. Finally, the user can also specify custom weights, for example if a data
set needs to be downweighted due to technical concerns; however, specifying own weights by hand should be
done carefully to avoid possible selection biases.Stouffer, S.A., Suchman, E.A., DeVinney, L.C., Star, S.A. & Williams, R.M. Jr. 1949. The American Soldier, Vol. 1: Adjustment during Army Life. Princeton University Press, Princeton.
A discussion of weighted Stouffer's method can be found in
Whitlock, M. C., Combining probability from independent tests: the weighted Z-method is superior to Fisher's approach, Journal of Evolutionary Biology 18:5 1368 (2005)
standardScreeningBinaryTrait, standardScreeningNumericTrait for screening
functions for individual data sets