numeric matrix, or any data object that can be processed by getEAWP containing log-expression values for a series of samples.
Rows correspond to probes and columns to samples.
batch
factor or vector indicating batches.
batch2
factor or vector indicating batches.
covariates
matrix or vector of covariates to be adjusted for.
design
optional design matrix relating to treatment conditions to be preserved
A numeric matrix of log-expression values with batch and covariate effects removed.
Details
This function is useful for removing batch effects, associated with hybridization time or other technical variables, prior to clustering or unsupervised analysis such as PCA, MDS or heatmaps.
It is not intended to use with linear modelling.
For linear modelling, it is better to include the batch factors in the linear model.
The design matrix is used to describe comparisons between the samples, for example treatment effects, which should not be removed.
The function (in effect) fits a linear model to the data, including both batches and regular treatments, then removes the component due to the batch effects.
The data object x can be of any class for which lmFit works.
If x contains weights, then these will be used in estimating the batch effects.