This function also can correct for the GC content and length of each genomic region. Users who wish to enable this correction must provide a separate annotation file. This tab-separated file should contain a row for each genomic region. The first column should contain a unique identifier that corresponds to identifiers from the read-count input file. The second column should indicate the length of the genomic region. And the third column should specify the number of G or C bases in the region. The ParseMetaFromGtfFile
function can be used to generate annotation files.
UPC_RNASeq(inFilePattern, annotationFilePath = NA, outFilePath = NA, modelType = "nn", convThreshold = 0.01, ignoreZeroes = FALSE, numDataHeaderRows=0, numAnnotationHeaderRows=0, batchFilePath=NA, verbose = TRUE)
ExpressionSet
object that contains a row for each probeset/gene/transcript and a column for each input file.
## Not run:
# result = UPC_RNASeq("ReadCounts.txt", "Annotation.txt")
# ## End(Not run)
Run the code above in your browser using DataLab