
glm
. See the
documentation for glm
for the details on how such model
fitting takes place.
family(object, ...)
binomial(link = "logit")
gaussian(link = "identity")
Gamma(link = "inverse")
inverse.gaussian(link = "1/mu^2")
poisson(link = "log")
quasi(link = "identity", variance = "constant")
quasibinomial(link = "logit")
quasipoisson(link = "log")
"link-glm"
(such as generated by
make.link
) provided it is not specified
via one of the standard names given next. The gaussian
family accepts the links (as names)
identity
, log
and inverse
;
the binomial
family the links logit
,
probit
, cauchit
, (corresponding to logistic,
normal and Cauchy CDFs respectively) log
and
cloglog
(complementary log-log);
the Gamma
family the links inverse
, identity
and log
;
the poisson
family the links log
, identity
,
and sqrt
and the inverse.gaussian
family the links
1/mu^2
, inverse
, identity
and log
.
The quasi
family accepts the links logit
, probit
,
cloglog
, identity
, inverse
,
log
, 1/mu^2
and sqrt
, and
the function power
can be used to create a
power link function.
quasi
, the variance
function is determined by the family. The quasi
family will
accept the literal character string (or unquoted as a name/expression)
specifications "constant"
, "mu(1-mu)"
, "mu"
,
"mu^2"
and "mu^3"
, a length-one character vector
taking one of those values, or a list containing components
varfun
, validmu
, dev.resids
, initialize
and name
.
family
accesses the family
objects which are stored within objects created by modelling
functions (e.g., glm
)."family"
(which has a concise print method).
This is a list with elements
(y, mu, wt)
.NA
for the quasi- families). See logLik
for the assumptions
made about the dispersion parameter.function(eta)
$d\mu/d\eta$.n
(needed for
AIC in the binomial family) and mustart
(see glm
.TRUE
if a mean
vector mu
is within the domain of variance
.TRUE
if a linear
predictor eta
is within the domain of linkinv
.simulate(object, nsim)
to be
called by the "lm"
method of simulate
. It will
normally return a matrix with nsim
columns and one row for
each fitted value, but it can also return a list of length
nsim
. Clearly this will be missing for ‘quasi-’ families.family
is a generic function with methods for classes
"glm"
and "lm"
(the latter returning gaussian()
).
For the binomial
and quasibinomial
families the response
can be specified in one of three ways:
0
and
1
, interpreted as the proportion of successful cases (with the
total number of cases given by the weights
).
The quasibinomial
and quasipoisson
families differ from
the binomial
and poisson
families only in that the
dispersion parameter is not fixed at one, so they can model
over-dispersion. For the binomial case see McCullagh and Nelder
(1989, pp.\ifelse{latex}{\out{~}}{ } 124--8). Although they show that there is (under some
restrictions) a model with
variance proportional to mean as in the quasi-binomial model, note
that glm
does not compute maximum-likelihood estimates in that
model. The behaviour of S is closer to the quasi- variants.
Dobson, A. J. (1983) An Introduction to Statistical Modelling. London: Chapman and Hall.
Cox, D. R. and Snell, E. J. (1981). Applied Statistics; Principles and Examples. London: Chapman and Hall.
Hastie, T. J. and Pregibon, D. (1992) Generalized linear models. Chapter 6 of Statistical Models in S eds J. M. Chambers and T. J. Hastie, Wadsworth & Brooks/Cole.
glm
, power
, make.link
. For binomial coefficients, choose
;
the binomial and negative binomial distributions,
Binomial
, and NegBinomial
.
require(utils) # for str
nf <- gaussian() # Normal family
nf
str(nf)
gf <- Gamma()
gf
str(gf)
gf$linkinv
gf$variance(-3:4) #- == (.)^2
## quasipoisson. compare with example(glm)
counts <- c(18,17,15,20,10,20,25,13,12)
outcome <- gl(3,1,9)
treatment <- gl(3,3)
d.AD <- data.frame(treatment, outcome, counts)
glm.qD93 <- glm(counts ~ outcome + treatment, family = quasipoisson())
glm.qD93
anova(glm.qD93, test = "F")
summary(glm.qD93)
## for Poisson results use
anova(glm.qD93, dispersion = 1, test = "Chisq")
summary(glm.qD93, dispersion = 1)
## Example of user-specified link, a logit model for p^days
## See Shaffer, T. 2004. Auk 121(2): 526-540.
logexp <- function(days = 1)
{
linkfun <- function(mu) qlogis(mu^(1/days))
linkinv <- function(eta) plogis(eta)^days
mu.eta <- function(eta) days * plogis(eta)^(days-1) * binomial()$mu_eta
valideta <- function(eta) TRUE
link <- paste0("logexp(", days, ")")
structure(list(linkfun = linkfun, linkinv = linkinv,
mu.eta = mu.eta, valideta = valideta, name = link),
class = "link-glm")
}
binomial(logexp(3))
## in practice this would be used with a vector of 'days', in
## which case use an offset of 0 in the corresponding formula
## to get the null deviance right.
## Binomial with identity link: often not a good idea.
## Not run: binomial(link = make.link("identity"))
## tests of quasi
x <- rnorm(100)
y <- rpois(100, exp(1+x))
glm(y ~ x, family = quasi(variance = "mu", link = "log"))
# which is the same as
glm(y ~ x, family = poisson)
glm(y ~ x, family = quasi(variance = "mu^2", link = "log"))
## Not run: glm(y ~ x, family = quasi(variance = "mu^3", link = "log")) # fails
y <- rbinom(100, 1, plogis(x))
# needs to set a starting value for the next fit
glm(y ~ x, family = quasi(variance = "mu(1-mu)", link = "logit"), start = c(0,1))
Run the code above in your browser using DataLab