boxplotPara(object,
nSample=if(length(object)> 200) nSample<-200 else nSample <- length(object),
iqrMethod=TRUE, percent=0.05,
typDef="mean", plot=TRUE,
plotAllBoxes=TRUE,
cluster, verbose = getOption("verbose"))
character
vector with the names of CEL files
OR a (partitioned) list of character
vectors with CEL file names..affyParaInternalEnv$cl
will be used.TRUE
it writes out some messages. default: getOption("verbose")QualityPS
, values_boxP
and results_boxP
.qualityPS
as matrix, which contains only the Arrays that are considered as "bad" quality and in which levels are they classified. Possible values are 0 if the Array is not at this levels and 1 if it is classified as "bad" sample at this levelboxplotPara
is the parallelized function for box plots of probe intensities.
It is a function to check and control the Data quality of the samples using the boxplot methode.
For serial function an more details see boxplot. This function is optimized for huge numbers of microarray data.
For using this function a computer cluster using the SNOW package has to be started.
Starting the cluster with the command makeCluster
generates an cluster object in the affyPara environment (.affyParaInternalEnv) and
no cluster object in the global environment. The cluster object in the affyPara environment will be used as default cluster object,
therefore no more cluster object handling is required.
The makeXXXcluster
functions from the package SNOW can be used to create an cluster object in the global environment and
to use it for the preprocessing functions.
We need to calculate a default Sample as reference , which has been built from all Samples data.
Therefore the first is the calculation of the boxplot.stats, it will be made parallel at the cluster.
The calculated values are merged at the master as well as the following calculations, plots and histograms
There are two possibility to calculate the limits between the "good-bad" quality Samples:
1. From the differences between defaultSample values (only media, HL nad HU will be considered) and all the samples.
At limit will be considered as critical and it help to calculate the "bad" quality Samples.
(it is fixed as parameter and thus do it not so sure)
2. From the median and IQR obtained from a boxplot, which is calculated from all Samples values.
The outliers of these boxplot are the "bad" quality Samples. It should be as default parameter.library(affyPara)
if (require(affydata)) {
data(Dilution)
makeCluster(3)
##boxplot of Dulution data (affybatch)
box1 <- boxplotPara(Dilution)
## boxplots to a pdf file
pdf(file="boxplot.pdf", title="AffyBatch Boxplot")
box2 <- boxplotPara(Dilution)
dev.off()
stopCluster()
}
Run the code above in your browser using DataLab