deds.stat.linkC integrates different statistics of differential
expression (DE) to rank and select a set of DE genes.
deds.stat.linkC(X, L, B = 1000, tests = c("t", "fc", "sam"), tail =
c("abs", "lower", "higher"), extras = NULL, distance = c("weuclid",
"euclid"), adj = c("fdr", "adjp"), nsig = nrow(X), quick = TRUE)read.table.B should be 0 (zero) or any number not less than the total
number of permutations.test could be any of the
following:
"t": |
| one or two sample t-statistics; |
"f": |
| F-statistics; |
"fc": |
| fold changes among classes; |
"sam": |
| SAM-statistics; |
"modt": |
| moderated t-statistics; |
"modt": |
| moderated F-statistics; |
side="abs", two-tailed tests, the null hypothesis is
rejected for large absolute values of the test statistic.
If side="higher", one-tailed tests, the null hypothesis
is rejected for large values of the test statistic.
If side="lower", one-tailed tests, the null hypothesis is
rejected for small values of the test statistic.
deds.genExtra.distance="weuclid", weighted euclidean distance, the
weight for statistic $t$ is $1/MAD(t)$;
If distance="euclid", euclidean distance.
adj="fdr", False Discovery Rate is controled and $q$
values are returned.
If adj="adjp", ajusted $p$ values that controls family wise
type I error rate are returned.adj = "fdr", nsig specifies the number of top
differentially expressed genes whose $q$ values will be calculated; we recommend
setting nsig < m, as the computation of $q$ values will be extensive. $q$ values
for the rest of genes will be approximated to 1. If adj = "adjp", the
calculation of the adjusted $p$ values will be for the whole dataset.quick=TRUE,
permutation will be carried out once and stored in memory; If
quick=FALSE a fixed seeded sampling procedure will be
employed, which requires more computation time as the permutation
will be carried out twice, but will not use extra memory for storage.DEDS. See DEDS-class.
deds.stat.linkC summarizes multiple statistical measures for the
evidence of DE. The DEDS methodology treats each gene as
a point corresponding to a gene's vector of DE measures. An "extreme
origin" is defined as the maxima of all statistics and the
distance from all points to the extreme is computed and ranking of
a gene for DE is determined by the closeness of the gene to the
extreme. To determine a cutoff for declaration of DE, null referent
distributions are generated by permuting the data matrix. Statistical measures currently in the DEDS package include t statistics
(tests="t"), fold changes (tests="fc"), F
statistics (tests="f"), SAM (tests="sam"), moderated
t (tests="modt"), moderated F statistics
(tests="modf"), and B statistics (tests="B"). The
function deds.stat.linkC interfaces to C functions for the
tests and the computation of DEDS. For more flexibility, the user can
also use deds.stat which has the same functionality as
deds.stat.linkC but is written completely in R (therefore
slower) and the user can supply their own function for a statistic
not covered in the DEDS package.
DEDS can also summarize p values from different statistical models, see
deds.pval.
deds.pval, deds.stat.X <- matrix(rnorm(1000,0,0.5), nc=10)
L <- rep(0:1,c(5,5))
# genes 1-10 are differentially expressed
X[1:10,6:10]<-X[1:10,6:10]+1
# DEDS summarizing t, fc and sam
d <- deds.stat.linkC(X, L, B=200)
Run the code above in your browser using DataLab