This function allows filtering of genes and cells to be used in the RaceID3 analysis.
It also can perform batch effect correction using an internal method or a recently published alternative mnnCorrect from the batchelor package.
filterdata(
object,
mintotal = 3000,
minexpr = 5,
minnumber = 5,
LBatch = NULL,
knn = 10,
CGenes = NULL,
FGenes = NULL,
ccor = 0.4,
bmode = "RaceID",
verbose = TRUE
)An SCseq class object with filtered and normalized expression data.
SCseq class object.
minimum total transcript number required. Cells with less than mintotal transcripts are filtered out. Default is 3000.
minimum required transcript count of a gene in at least minnumber cells. All other genes are filtered out. Default is 5.
See minexpr. Default is 5.
List of experimental batches used for batch effect correction. Each list element contains a vector with cell names
(i.e. column names of the input expression data) falling into this batch. Default is NULL, i.e. no batch correction.
Number of nearest neighbors used to infer corresponding cell types in different batches. Defult is 10.
List of gene names. All genes with correlated expression to any of the genes in CGenes are filtered out for cell type inference.
Default is NULL.
List of gene names to be filtered out for cell type inference. Default is NULL.
Correlation coefficient used as a trehshold for determining genes correlated to genes in CGenes.
Only genes correlating less than ccor to all genes in CGenes are retained for analysis. Default is 0.4.
Method used for batch effect correction. Any of "RaceID","mnnCorrect". If mnnCorrect from the
batchelor package is desired, this package needs to be installed from bioconductor. Default is "RaceID".
logical. If FALSE then status output messages are disabled. Default is TRUE.
sc <- SCseq(intestinalDataSmall)
sc <- filterdata(sc)
Run the code above in your browser using DataLab