load.data(chip.data.list, conds, consensus.site, input.data.list = NULL, data.type = "MCS", chr.vec = NULL, chr.exclusion = NULL, chr.len.vec = NULL, norm.factor.vec = NULL, frag.len = 200)
site.merge
.
Filtering of chromosomes can be achieved through specification of chr.vec
and/or chr.exclusion
. Only reads from chromosomes in chr.vec
but not in chr.exclusion
are utilized in the analysis.
User can include or exclude sex chromosomes in the computation, depending on whether protein-DNA bindings on sex chromosomes are of research interest.
Biological replicates of a ChIP sample should be kept separate so that dispersion can be properly estimated. On the other hand, replicates of a control/input sample should be merged because the purpose of the control samples is to estimate the background for testing and plotting. One exception would be when a control replicate is paired with a ChIP replicate, for example, they are coming from the same batch, a portion of which is used for IP and the other portion is used for control. In such case, the control replicate can be kept separate with the same name of the matching ChIP replicate.
data.type
MCS
Minimum ChIP-Seq format. data.frame with fields: chr (factor), pos (integer) and strand (factor, "+" and "-"). pos is 5' location. This is different from eland default which use 3' location for reverse strand.
AlignedRead
from Bioconductor ShortRead package (with support of commonly used formats, including Eland, MAQ, Bowtie, SOAP and BAM).
BED
with at least first 6 fields (chrom, start, end, name, score and strand), http://genome.ucsc.edu/FAQ/FAQformat.html#format1.
DBChIP
.
data("PHA4")
conds <- factor(c("emb","emb","L1", "L1"), levels=c("emb", "L1"))
bs.list <- read.binding.site.list(binding.site.list)
## compute consensus site
consensus.site <- site.merge(bs.list, in.distance=100, out.distance=250)
#load data
dat <- load.data(chip.data.list=chip.data.list, conds=conds, consensus.site=consensus.site, input.data.list=input.data.list, data.type="MCS")
names(dat)
Run the code above in your browser using DataLab