bwplot.by.var(xmat, log = FALSE, wend = 0.05, notch = FALSE,
xlab = "Measured Variables", ylab = "Reported Values",
main = "", label = NULL, plot.order = NULL, xpos = NA,
las = 1, cex = 1, adj = 0.5, colr = 8, pch = 3, ...)
log = TRUE
.wend = 0.02
plots the whisker ends at the 2nd and 98th percentiles. See Details below.notch = TRUE
.xlab = "Measured Variables"
.ylab = "Reported Values"
.main
, e.g., main = "Kola Project, 1995"
.label = c("Alt1", "Alt2", "Alt3")
.plot.order = c(2, 1, 3)
will plot the 2nd alphabetically ordered factor in the 1st position, the 1st in tNA
, which causes default equally spaced positions to be used, i.e. boxplot 1 plots at value 1 on the x-axis, boxplot 2 at value 2, elas = 1
, or are written down from the x-axis by setting las = 2
. See also, Details below.1
, however, if the labels are long it is sometimes necessary to use a smaller font, for example cex = 0.8
results in a font 80adj = 0.5
, to left justify them if the labels are written downwards set adj = 0
.colr = 8
. If no infill is required, set colr = 0
. See display.lty
for the range of available colours.pch =
3, alternate plotting symbols may be chosen from those displayed by display.marks
.xmat
. However, if there are factor variables, or only a subset of the variables are to be displayed, the data are entered via the cbind
construct, see Examples below.
In a box-and-whisker plot there are two special cases. When wend = 0
the whiskers extend to the observed minima and maxima that are not plotted with the plus symbol. When wend = 0.25
no whiskers or the data minima and maxima are plotted, only the medians and boxes representing the span of the middle 50% of the data are displayed.
Long variable names can lead to display problems, changing the las
parameter from its default of las = 1
which plots subset labels parallel to the axis to las = 2
, to plot perpendicular to the axis, can help. It may also help to use label
and split the character string into two lines, e.g., by changing the string "Specific Conductivity"
that was supplied to replace the variable name SC
to "Specific\nConductivity"
. If this, or setting las = 2
, causes a conflict with the x-axis title, if one is needed, the title can be moved down a line by using xlab = "\nPhysical soil properties"
. In both cases the \n
forces the following text to be placed on the next lower line.
If there are more than 7 labels (variables) and no alternate labels are provided las
is set to 2
, otherwise some variable names may fail to be displayed.
The notches in the boxplots indicate the 95% confidence intervals for the medians and can extend beyond the upper and lower limits of the boxes indicating the middle 50% of the data when subset population sizes are small. The confidence intervals are estimated using the binomial theorem. It can be argued that for small populations a normal approximation would be better. However, it was decided to remain with a non-parametric estimate to be consistent with the use of non-parametric statistics in this display.bwplot
, var2fact
, ltdl.fix.df
## Make test data kola.c available
data(kola.c)
attach(kola.c)
## Display a simple box-and-whisker plot for measured variables
bwplot.by.var(cbind(Co,Cu,Ni))
## Display a more appropriately labelled and scaled box-and-whisker plot
bwplot.by.var(cbind(Co,Cu,Ni), log = TRUE,
ylab = "Levels (mg/kg) in < 2 mm C-horizon soil")
## Detach test data kola.c
detach(kola.c)
## Make test data ms.data1 available
data(ms.data1)
## Display variables in a data frame extending the whiskers to the
## 2nd and 98th percentiles of the data
bwplot.by.var(ms.data1, log=TRUE, wend = 0.02)
Run the code above in your browser using DataLab