Apply centrality-extented ORA on a list of pathways
cepa.ora.all(dif, pc, bk = NULL, cen = default.centralities,
cen.name = sapply(cen, function(x) ifelse(mode(x) == "name", deparse(x), x)),
iter = 1000)
A cepa.all
class object
differential gene list
a pathway.catalogue
class object
background gene list. If background gene list are not specified, use whole human genes
centrality measuments, it can ce a string, or a function
centrality measurement names. By default it is parsed from cen
argument
number of simulations
Zuguang Gu <z.gu@dkfz.de>
The traditional over-representation analysis (ORA) to find significant pathways uses a 2x2 contingency table to test the independency of genes belonging to a functional category and these genes being differentially expressed, usually by Fisher's exact test. The ORA only consider the number of genes and the function extend traditional ORA with network centralities.
The differential gene list and the background gene list should be indicated
with the same identifiers (e.g. gene symbol or refseq ID). All genes in
the differential gene list should exist in the background gene list. If users
use the PID.db
data, all genes should be formatted in gene symbol.
If the centrality measurement is set as a string, only pre-defined "equal.weight", "in.degree", "out.degree", "degree", "betweenness", "in.reach", "out.reach", "reach", "in.spread", "out.spread" and "spread" are allowed. More centrality measurements can be used by setting it as a function (such as closeness, cluster coefficient). In the function, we recommand users choose at least two centrality measurements. The default centralities are "equal.weight", "in.degree", "out.degree", "betweenness", "in.reach" and "out.reach".
However, in most circumstance, the function is called by cepa.all
.
if (FALSE) {
data(PID.db)
# ORA extension
data(gene.list)
# will spend about 20 min
res.ora = cepa.ora.all(dif = gene.list$dif, bk = gene.list$bk, pc = PID.db$NCI)
}
Run the code above in your browser using DataLab