Learn R Programming

gamlss (version 5.4-12)

quantSheets: Quantile Sheets

Description

The quantile sheets function quantSheets() is based on the work of Sabine Schnabe and Paul Eiler (see references below). The estimation of the quantile curves is done simultaneously by also smoothing in the direction of y as well as x. This avoids (but do not eliminate completely) the problem of crossing quantiles.

Usage

quantSheets(y, x, x.lambda = 1, p.lambda = 1, data = NULL, 
            cent = 100 * pnorm((-4:4) * 2/3), 
            control = quantSheets.control(...), print = TRUE,  ...)

quantSheets.control(x.inter = 10, p.inter = 10, degree = 3, logit = FALSE, order = 2, kappa = 0, n.cyc = 100, c.crit = 1e-05, plot = TRUE, power = NULL, ...)

findPower(y, x, data = NULL, lim.trans = c(0, 1.5), prof = FALSE, k = 2, c.crit = 0.01, step = 0.1)

z.scoresQS(object, y, x, plot = FALSE, tol = NULL)

Value

Using the function quantSheets() a quantSheets object is returned having the following methods: print(), fitted(), predict() and resid().

Using findPower() a single values of the power parameter is returned.

Using z.scoresQS a vector of z-scores is returned.

Arguments

y

the y variable

x

the x variable

x.lambda

smoothing parameter in the direction of x

p.lambda

smoothing parameter in the direction of y (probabilities)

data

the data frame

cent

the centile values where the quantile sheets is evaluated

control

for the parameters controlling the algorithm

print

whether to print the sample percentages

x.inter

number of intervals in the x direction for the B-splines

p.inter

number of intervals in the probabilities (y-direction) for the B-splines

degree

the degree for the B-splines

logit

whether to use logit(p) instead of p (probabilities) for the y-axis

order

the order of the penalty

kappa

is a ridge parameter set to zero (for no ridge effect)

n.cyc

number of cycles of the algorithm

c.crit

convergence criterion of the algorithm

plot

whether to plot the resulting quantile sheets

power

The value of the power transformation in the x axis if needed

lim.trans

the limits for looking for the power transformation parameter using findPower()

prof

whether to use the profile GAIC or optim() to the parameter the power transformation

k

the GAIC penalty

step

the steps for the profile GAIC if the argument prof of findPower() is TRUE

object

a fitted quantSheets object

tol

how far out from the range of the y variable should go for estimating the distribution of y using the flexDist() function

...

for further arguments

Author

Mikis Stasinopoulos based on function provided by Paul Eiler and Sabine Schnabe

Details

The advantage of quantile sheets is that they estimates simultaneously all the quantiles. This almost eliminates the problem of crossing quantiles. The method is very fast and useful for exploratory tool. The function needs two smoothing parameters. Those two parameters have to specified by the user. They are not estimated automatically. They can be selected by visual inspection.

The disadvantages of quantile sheets comes from the fact that like all non-parametric techniques do not have a goodness of fit measure to change how good is the models and the residuals based diagnostics are not existence since it is difficult to define residuals in this set up.

In this implementation we do provide residuals by using the flexDist() function from package gamlss.dist. This is based on the idea that by knowing the quantiles of the distribution we can reconstruct non parametrically the distribution itself and this is what flexDist() is doing. As a word of caution, such a construct is based on several assumptions and depends on several smoothing parameters. Treat those residuals with caution. The same caution should apply to the function z.scoresQS().

References

Schnabel, S.K. (2011) Expectile smoothing: new perspectives on asymmetric least squares. An application to life expectancy, Utrecht University.

Schnabel, S. K and Eilers, P. H. C.(2013) Simultaneous estimation of quantile curves using quantile sheets, AStA Advances in Statistical Analysis, 97, 1, pp 77-87, Springer.

Schnabel, S. K and Eilers, P. H. (2013) A location-scale model for non-crossing expectile curves, Stat, 2, 1, pp 171-183.

Rigby, R. A., Stasinopoulos, D. M., Heller, G. Z., and De Bastiani, F. (2019) Distributions for modeling location, scale, and shape: Using GAMLSS in R, Chapman and Hall/CRC. An older version can be found in https://www.gamlss.com/.

Stasinopoulos D. M. Rigby R.A. (2007) Generalized additive models for location scale and shape (GAMLSS) in R. Journal of Statistical Software, Vol. 23, Issue 7, Dec 2007, https://www.jstatsoft.org/v23/i07/.

Stasinopoulos D. M., Rigby R.A., Heller G., Voudouris V., and De Bastiani F., (2017) Flexible Regression and Smoothing: Using GAMLSS in R, Chapman and Hall/CRC.

(see also https://www.gamlss.com/).

See Also

lms: for a parametric equivalent results.

Examples

Run this code
data(abdom)
m1 <- quantSheets(y,x, data=abdom)
head(fitted(m1))
p1 <- predict(m1, newdata=c(20,30,40))
matpoints(c(20,30,40), p1)
z.scoresQS(m1,y=c(150, 300),x=c(20, 30) )
# If we needed a power transformation not appropriate for this data
findPower(y,x, data=abdom)

Run the code above in your browser using DataLab