Learn R Programming

SCAN.UPC (version 2.14.0)

UPC_TwoColor: Universal exPression Codes (UPC) for two-channel microarrays

Description

This function is used to normalize two-channel expression microarrays (from Agilent) using the Universal exPression Codes (UPC) approach. In raw form, such microarray data come in the form of tab-separate data files.

Usage

UPC_TwoColor(inFilePattern, outFilePath = NA, modelType="nn", convThreshold=0.01, batchFilePath = NA, verbose = TRUE)

Arguments

inFilePattern
Absolute or relative path to the input file to be processed. To process multiple files, wildcard characters can be used (e.g., "*.txt"). Alternatively, a Gene Expression Omnibus identifier (e.g., GSE39655 or GSM1072833) can be specified.(This is the only required parameter.)
outFilePath
Absolute or relative path where the output file will be saved. (This parameter is optional.)
modelType
Various models can be used for the mixture model to differentiate between active and inactive probes. The default is the normal-normal model (``nn''), which uses the normal distribution. Other available options are log-normal (``ln''), negative-binomial (``nb''), and normal-normal Bayes (``nn_bayes'').
convThreshold
Convergence threshold that determines at what point the mixture-model parameters have stabilized. The default value should be suitable in most cases. However, if the model fails to converge, it may be useful to adjust this value. Optional.
batchFilePath
Absolute or relative path to a tab-separated text file that indicates batch (and optionally, covariate information) for each sample. Optional.
verbose
Whether to output more detailed status information as files are processed. Default is TRUE.

Value

A list is returned, containing two elements: a matrix containing UPC values and a vector of probe names that correspond to each row of the matrix. The matrix will contain two columns---one corresponding to each channel---for each sample. When the array design uses duplicate probe names (this is common for control probes), the vector of probe names will also contain duplicates.

References

Piccolo SR, Withers MR, Francis OE, Bild AH and Johnson WE. Multi-platform single-sample estimates of transcriptional activation. Proceedings of the National Academy of Sciences of the United States of America, 2013, 110:44 17778-17783.

Examples

Run this code
## Not run: 
# # Normalize a file from GEO and save output to a file
# result = UPC_TwoColor("GSM1072833", "output_file.txt")
# ## End(Not run)

Run the code above in your browser using DataLab