Density, distribution function, quantile function and random generation for the generally--altered, --inflated and --truncated Poisson distribution. Both parametric and nonparametric variants are supported; these are based on finite mixtures of the parent with itself and the multinomial logit model (MLM) respectively. Altogether it can be abbreviated as GAAIIT--Pois(lambda.p)--Pois(lambda.a)--MLM--Pois(lambda.i)--MLM, and it is also known as the GAIT-Pois PNP combo.
dgaitpois(x, lambda.p, alter.mix = NULL, alter.mlm = NULL,
inflate.mix = NULL, inflate.mlm = NULL, truncate = NULL,
max.support = Inf, pobs.mix.a = 0, pobs.mlm.a = 0,
pstr.mix.i = 0, pstr.mlm.i = 0, lambda.a = lambda.p,
lambda.i = lambda.p, byrow.arg = FALSE, deflation = FALSE,
log.arg = FALSE)
pgaitpois(q, lambda.p, alter.mix = NULL, alter.mlm = NULL,
inflate.mix = NULL, inflate.mlm = NULL, truncate = NULL,
max.support = Inf, pobs.mix.a = 0, pobs.mlm.a = 0,
pstr.mix.i = 0, pstr.mlm.i = 0, lambda.a = lambda.p,
lambda.i = lambda.p, byrow.arg = FALSE)
qgaitpois(p, lambda.p, alter.mix = NULL, alter.mlm = NULL,
inflate.mix = NULL, inflate.mlm = NULL, truncate = NULL,
max.support = Inf, pobs.mix.a = 0, pobs.mlm.a = 0,
pstr.mix.i = 0, pstr.mlm.i = 0, lambda.a = lambda.p,
lambda.i = lambda.p, byrow.arg = FALSE)
rgaitpois(n, lambda.p, alter.mix = NULL, alter.mlm = NULL,
inflate.mix = NULL, inflate.mlm = NULL, truncate = NULL,
max.support = Inf, pobs.mix.a = 0, pobs.mlm.a = 0,
pstr.mix.i = 0, pstr.mlm.i = 0, lambda.a = lambda.p,
lambda.i = lambda.p, byrow.arg = FALSE)
Same meaning as in rpois
.
Same meaning as in rpois
,
i.e., for an ordinary Poisson distribution.
The first is for the main parent (inner) distribution.
The other two concern the parametric variant and
these outer distributions (usually spikes) may be
altered and/or inflated.
Short vectors are recycled.
numeric; these specify the set of truncated values.
The default value of NULL
means an empty set for the former.
The latter is the
maximum support value so that any value larger
has been truncated (necessary because
truncate = (max.support + 1):Inf
is not allowed),
hence is needed for truncating the upper tail of the distribution.
Note that max(truncate) < max.support
must be satisfied
otherwise an error message will be issued.
Vectors of nonnegative integers;
the altered, inflated and truncated values for the
parametric variant.
Each argument must have unique values only.
Assigning argument alter.mix
means that pobs.mix.a
will be used.
Assigning argument inflate.mix
means that pstr.mix.i
will be used.
If alter.mix
is of unit length
then the default probability mass function (PMF)
evaluated at alter.mix
will be pobs.mix.a
.
So having alter.mix = 0
corresponds to the
zero-inflated Poisson distribution (see Zipois
).
Similar to the above, but for the nonparametric (MLM) variant.
Assigning argument alter.mlm
means that pobs.mlm.a
will be used.
Assigning argument inflate.mlm
means that pstr.mlm.i
will be used.
Collectively, the above 6 arguments represent
5 disjoint sets of
special values and they are a proper subset of the support of the
distribution.
These arguments are coerced into a matrix of probabilities
using byrow.arg
to determine the order of the elements
(similar to matrix
).
The first argument is recycled if necessary to become
n x length(alter.mlm)
.
The second argument becomes
n x length(inflate.mlm)
.
Thus these arguments are not used unless
alter.mlm
and inflate.mlm
are assigned.
If deflation = TRUE
then pstr.mlm.i
may be
negative.
Vectors of probabilities that are recycled if necessary to
length \(n\).
The first argument is used when alter.mix
is not NULL
.
The second argument is used when inflate.mix
is not NULL
.
Logical. If TRUE
then pstr.mlm.i
is allowed to have
negative values,
however, not too negative so that the final PMF becomes negative.
Of course, if the values are negative then they cannot be
interpreted as probabilities.
In theory, one could also allow pstr.mix.i
to be negative,
but currently this is disallowed.
dgaitpois
gives the density,
pgaitpois
gives the distribution function,
qgaitpois
gives the quantile function, and
rgaitpois
generates random deviates.
The default values of the arguments correspond to ordinary
dpois
,
ppois
,
qpois
,
rpois
respectively.
These functions allow any combination of 3 operator types:
truncation, alteration and inflation.
The precedence is
truncation, then alteration and lastly inflation.
This order minimizes the potential interference among the operators.
Loosely, a set of probabilities is set to 0 by truncation
and the remaining probabilities are scaled up.
Then a different set of probabilities are set to some
values pobs.mix.a
and/or pobs.mlm.a
and the remaining probabilities are rescaled up.
Then another different set of probabilities is inflated by
an amount pstr.mlm.i
and/or proportional
to pstr.mix.i
so that individual elements in this set have two sources.
Then all the probabilities are
rescaled so that they sum to unity.
Both parametric and nonparametric variants are implemented.
They usually have arguments with suffix
.mix
and .mlm
respectively.
The MLM is a loose coupling that effectively separates
the parent distribution from the altered values.
Values inflated nonparametrically effectively have
their spikes shaved off.
The .mix
variant has associated with it
lambda.a
and lambda.i
because it is mixture of 3 Poisson distributions with
partitioned or nested support.
Any value of the support of the distribution that is altered, inflated or truncated is called a special value. A special value that is altered may mean that its probability increases or decreases relative to the parent distribution. An inflated special value means that its probability has increased, provided alteration elsewhere has not made it decrease in the first case.
Jargonwise,
the outer distributions concern those special values which
are altered or inflated, and
the inner distribution concerns the remaining
support points that correspond directly to
the parent distribution.
These functions do what
Zapois
,
Zipois
,
Pospois
collectively did plus much more.
In the notation of Yee and Ma (2020)
these functions allow for the special cases:
(i) GAIT--Pois(lambda.p
)--Pois(lambda.a
,
alter.mix
, pobs.mix.a
)--Pois(lambda.i
,
inflate.mix
, pstr.mix.i
);
(ii) GAIT--Pois(lambda.p
)--MLM(alter.mlm
,
pobs.mlm.a
)--MLM(inflate.mlm
, pstr.mlm.i
).
Model (i) is totally parametric while model (ii) is the most
nonparametric possible.
Yee, T. W. and Ma, C. (2020). Generally--altered, --inflated and --truncated regression, with application to heaped and seeped count data. In preparation.
multinomial
,
gaitpoisson.mlm
,
gaitpoisson.mix
,
Zapois
,
Zipois
,
Pospois
Poisson
;
Gaitlog
,
Gaitzeta
.
# NOT RUN {
ivec <- c(6, 14); avec <- c(8, 11); lambda <- 10; xgrid <- 0:25
tvec <- 15; max.support <- 20; pobs.a <- 0.05; pstr.i <- 0.25
(ddd <- dgaitpois(xgrid, lambda, lambda.a = lambda + 5,
truncate = tvec, max.support = max.support, pobs.mix.a = pobs.a,
pobs.mlm.a = pobs.a, alter.mlm = avec,
pstr.mix.i = pstr.i, inflate.mix = ivec))
# }
# NOT RUN {
plot(xgrid, ddd, type = "n", ylab = "Probability", xlab = "x",
main = "GAIT PNP Combo PMF---Poisson Parent")
mylwd <- 0.5
abline(v = avec, col = 'green', lwd = mylwd)
abline(v = ivec, col = 'red', lwd = mylwd)
abline(v = tvec, col = 'tan', lwd = mylwd)
abline(v = max.support, col = 'magenta', lwd = mylwd)
abline(h = c(pobs.a, pstr.i, 0:1), col = 'gray', lty = "dashed")
lines(xgrid, dpois(xgrid, lambda), col = 'gray', lty = "dashed") # f_{\pi}
lines(xgrid, ddd, type = "h", col = "blue", lwd = 3) # GAIT PNP combo PMF
points(xgrid[ddd == 0], ddd[ddd == 0], pch = 16)
# }
Run the code above in your browser using DataLab