Learn R Programming

BASiCS (version 1.1.0)

BASiCS_TestDE: Detection of genes with changes in expression

Description

Function to assess changes in expression between two groups of cells (mean and over-dispersion)

Usage

BASiCS_TestDE(Chain1, Chain2, EpsilonM = log2(1.5), EpsilonD = log2(1.5),
  ProbThresholdM = NULL, ProbThresholdD = NULL, OrderVariable = "Prob",
  GroupLabel1 = "Group1", GroupLabel2 = "Group2", Plot = TRUE,
  PlotOffset = TRUE, Offset = TRUE, EFDR_M = 0.1, EFDR_D = 0.1,
  GenesSelect = NULL, ...)

Arguments

Chain1

an object of class '>BASiCS_Chain containing parameter estimates for the first group of cells

Chain2

an object of class '>BASiCS_Chain containing parameter estimates for the second group of cells

EpsilonM

Minimum fold change tolerance threshold for detecting changes in overall expression (must be a positive real number). Default value: EpsilonM = log2(1.5) (i.e. 50% increase).

EpsilonD

Minimum fold change tolerance threshold for detecting changes in biological over-dispersion (must be a positive real number). Default value: EpsilonM = log2(1.5) (i.e. 50% increase).

ProbThresholdM

Optional parameter. Probdence threshold for detecting changes in overall expression (must be a positive value, between 0 and 1)

ProbThresholdD

Optional parameter. Probdence threshold for detecting changes in cell-to-cell biological over-dispersion (must be a positive value, between 0 and 1)

OrderVariable

Ordering variable for output. Possible values: 'GeneIndex', 'GeneName' and 'Prob'.

GroupLabel1

Label assigned to reference group. Default: GroupLabel1 = 'Group1'

GroupLabel2

Label assigned to reference group. Default: GroupLabel2 = 'Group2'

Plot

If Plot = TRUE, MA and volcano plots are generated.

PlotOffset

If Plot = TRUE, the offset effect is visualised.

Offset

Optional argument to remove a fix offset effect (if not previously removed from the MCMC chains). Default: Offset = TRUE.

EFDR_M

Target for expected false discovery rate related to the comparison of means. Default EFDR_M = 0.10.

EFDR_D

Target for expected false discovery rate related to the comparison of dispersions. Default EFDR_D = 0.10.

GenesSelect

Optional argument to provide a user-defined list of genes to be considered for the comparison. Default: GenesSelect = NULL. When used, this argument must be a vector of TRUE (include gene) / FALSE (exclude gene) indicator, with the same length as the number of intrinsic genes and following the same order as how genes are displayed in the table of counts. This argument is necessary in order to have a meaningful EFDR calibration when the user decides to exclude some genes from the comparison.

...

Graphical parameters (see par).

Value

BASiCS_TestDE returns a list of 4 elements:

TableMean

A data.frame containing the results of the differential mean test

GeneName

Gene name

MeanOverall

For each gene, the estimated mean expression parameter \(\mu_i\) is averaged across both groups of cells (weighted by sample size).

Mean1

Estimated mean expression parameter \(\mu_i\) for each biological gene in the first group of cells.

Mean2

Estimated mean expression parameter \(\mu_i\) for each biological gene in the second group of cells.

MeanFC

Fold change in mean expression parameters between the first and second groups of cells.

MeanLog2FC

Log2-transformed fold change in mean expression between the first and second groups of cells.

ProbDiffMean

Posterior probability for mean expression difference between the first and second groups of cells.

ResultDiffExp

Indicator if a gene has a higher mean expression in the first or second groups of cells.

TableDisp

A data.frame containing the results of the differential dispersion test (excludes genes for which the mean does not changes).

GeneName

Gene name

MeanOverall

For each gene, the estimated mean expression parameter \(\mu_i\) is averaged across both groups of cells (weighted by sample size).

DispOverall

For each gene, the estimated over-dispersion parameter \(\delta_i\) is averaged across both groups of cells (weighted by sample size).

Disp1

Estimated over-dispersion parameter \(\delta_i\) for each biological gene in the first group of cells.

Disp2

Estimated over-dispersion parameter \(\delta_i\) for each biological gene in the second group of cells.

DispFC

Fold change in over-dispersion parameters between the between the first and second groups of cells.

DispLog2FC

Log-transformed fold change in over-dispersion between the first and second groups of cells.

ProbDiffDisp

Posterior probability for over-dispersion difference between the first and second groups of cells.

ResultDiffDisp

Indicator if a gene has a higher over-dispersion in the first or second groups of cells.

DiffExpSummary

A list containing the following information for the differential mean expression test:

ProbThreshold

Posterior probability threshold.

EFDR

Expected false discovery rate for the given thresholds.

EFNR

Expected false negative rate for the given thresholds.

DiffOverDispSummary

A list containing the following information for the differential over-dispersion test:

ProbThreshold

Posterior probability threshold.

EFDR

Expected false discovery rate for the given thresholds.

EFNR

Expected false negative rate for the given thresholds.

Chain1_offset

an BASiCS_Chain object: Chain1 after offset removal.

Chain2_offset

an BASiCS_Chain object: Chain2 after offset removal (this is only provided for completeness; Chain2 is not affected by the offset).

OffsetChain

MCMC chain calculated for the offset effect.

Offset

Estimated offset (posterior median of OffsetChain). Default value set equal to 1 when offset correction is not performed.

References

Vallejos, Richardson and Marioni (2016). Genome Biology.

Examples

Run this code
# NOT RUN {
# Loading two 'BASiCS_Chain' objects (obtained using 'BASiCS_MCMC')
data(ChainSC)
data(ChainRNA)

Test <- BASiCS_TestDE(Chain1 = ChainSC, Chain2 = ChainRNA,
                      GroupLabel1 = 'SC', GroupLabel2 = 'P&S',
                      EpsilonM = log2(1.5), EpsilonD = log2(1.5), 
                      OffSet = TRUE)
                      
# Results for the differential mean test
head(Test$TableMean)

# Results for the differential over-dispersion test
# This only includes genes marked as 'NoDiff' in Test$TableMean 
head(Test$TableDisp)                    

# }

Run the code above in your browser using DataLab