DGEList-class: Digital Gene Expression data - class
Description
A list-based S4 class for storing read counts and associated information from digital gene expression or sequencing technologies.
List Components
For objects of this class, rows correspond to genomic features and columns to samples.
The genomic features are called genes, but in reality might correspond to transcripts, tags, exons etc.
Objects of this class contain the following essential list components:
counts
:- numeric matrix of read counts, one row for each gene and one column for each sample.
samples
:- data.frame with a row for each sample and columns
group
, lib.size
and norm.factors
containing the group labels, library sizes and normalization factors.
Other columns can be optionally added to give more detailed sample information.
Optional components include:
genes
:- data.frame giving annotation information for each gene. Same number of rows as
counts
. AveLogCPM
:- numeric vector giving average log2 counts per million for each gene.
common.dispersion
:- numeric scalar giving the overall dispersion estimate.
trended.dispersion
:- numeric vector giving trended dispersion estimates for each gene.
tagwise.dispersion
:- numeric vector giving tagwise dispersion estimates for each gene (note that `tag' and `gene' are synonymous here).
offset
:- numeric matrix of same size as
counts
giving offsets for use in log-linear models.
Methods
This class inherits directly from class list
, so DGEList
objects can be manipulated as if they were ordinary lists.
However they can also be treated as if they were matrices for the purposes of subsetting. The dimensions, row names and column names of a DGEList
object are defined by those of counts
, see dim.DGEList
or dimnames.DGEList
. DGEList
objects can be subsetted, see subsetting
. DGEList
objects also have a show
method so that printing produces a compact summary of their contents.