Learn R Programming

hyperSpec (version 0.98-20140523)

aggregate: aggregate hyperSpec objects

Description

aggregate Computes summary statistics for subsets of a hyperSpec object.

Usage

## S3 method for class 'hyperSpec':
aggregate(x,
    by = stop("by is needed"),
    FUN = stop("FUN is needed."), ..., out.rows = NULL,
    append.rows = NULL, by.isindex = FALSE,
    short = "aggregate", date = NULL, user = NULL)

Arguments

x
a hyperSpec object
by
grouping for the rows of x@data.

Either a list containing an index vector for each of the subgroups or a vector that can be split in such a list.

FUN
function to compute the summary statistics
out.rows
number of rows in the resulting hyperSpec object, for memory preallocation.
append.rows
If more rows are needed, how many should be appended?

Defaults to 100 or an estimate based on the percentage of groups that are still to be done, whatever is larger.

by.isindex
If a list is given in by: does the list already contain the row indices of the groups? If FALSE, the list in by is computed first (as in aggregate).
...
further arguments passed to FUN
short,date,user
aguments passed to logentry

Value

  • A hyperSpec object with an additional column @data$.aggregate tracing which group the rows belong to.

Details

aggregate applies FUN to each of the subgroups given by by. It combines the functionality of aggregate, tapply, and ave for hyperSpec objects.

aggregate avoids splitting x@data.

FUN does not need to return exactly one value. The number of returned values needs to be the same for all wavelengths (otherwise the result could not be a matrix), see the examples.

If the initially preallocated data.frame turns out to be too small, more rows are appended and a warning is issued.

See Also

tapply, aggregate, ave

Examples

Run this code
cluster.means <- aggregate (chondro, chondro$clusters, mean_pm_sd)
plot(cluster.means, stacked = ".aggregate", fill = ".aggregate",
     col = matlab.dark.palette (3))

## make some "spectra"
spc <- new ("hyperSpec", spc = sweep (matrix (rnorm (10*20), ncol = 20), 1, (1:10)*5, "+"))

## 3 groups
color <- c("red", "blue", "black")
by <- as.factor (c (1, 1, 1, 1, 1, 1, 5, 1, 2, 2))
by
plot (spc, "spc", col = color[by])

## Example 1: plot the mean of the groups
plot (aggregate (spc, by, mean), "spc", col = color, add = TRUE,
      lines.args = list(lwd = 3, lty = 2))

## Example 2: FUN may return more than one value (here: 3)
plot (aggregate (spc, by, mean_pm_sd), "spc",
      col = rep(color, each = 3), lines.args = list(lwd = 3, lty = 2))

## Example 3: aggregate even takes FUN that return different numbers of
##            values for different groups
plot (spc, "spc", col = color[by])

weird.function <- function (x){
   if (length (x) == 1)
      x + 1 : 10
   else if (length (x) == 2)
      NULL
   else
      x [1]
}

agg <- aggregate (spc, by, weird.function)
agg$.aggregate
plot (agg, "spc",  add = TRUE, col = color[agg$.aggregate],
      lines.args = list (lwd = 3, lty = 2))

Run the code above in your browser using DataLab