This function provides one approach for testing the significance of the spatial variation of the local parameter estimates resulted in by fitting a Geographically Weighted Zero Inflated Poisson Regression (GWZIPR) model. The approach consists of a Monte Carlo simulation according to which: a) the data are spatially reallocated in a random way; b) GWZIPR models fit for the original and simulated spatial data sets; c) the variance of each variable for the original and simulated sets is then calculated; d) a pseudo p-value for each variable V
is calculated as p = (1+C)/(1+M)
where C
is the number of cases in which the simulated data sets generated variances of the local parameter estimates of the variable V
that were as extreme as the observed local parameter estimates variance of the variable in question and M
is the number of permutations. If p <= 0.05
it can be argued that the spatial variation of the local parameters estimates for a variable V
is statistically significant. For this approach, a minimum of 19 simulations is required.
gw.zi.mc.test(Nsim = 19, formula, family, dframe, bw, kernel, coords)
Returns a list of the simulated values, the observed the pseudo p-value of significance
a vector with the variances of the observed local parameter estimates for each variable in the model.
a matrix with the variance of the simulated local parameter estimates for each variable in the model
a matrix with the number of cases in which the simulated data set generated variances of the local parameter estimates of a variable V
that were as extreme as the observed local parameter estimates variance of the variable in question
a vector of pseudo p-values for all the parameters in the model (constant and variables).
a positive integer that defines the number of the simulation's iterations
the local model to be fitted using the same syntax used in the zeroinfl function of the R package pscl
. This is a sting (a symbolic description of the model) that is passed to the sub-models' zeroinfl
function. For more details look at the details of the zeroinfl
function.
a specification of the count model family to be used in the local model as in the zeroinfl
function. Currently the only option tested is "poisson".
a numeric data frame of at least two suitable variables (one dependent and one independent)
a positive number that may be an integer in the case of an "adaptive kernel" or a real in the case of a "fixed kernel". In the first case the integer denotes the number of nearest neighbours, whereas in the latter case the real number refers to the bandwidth (in meters if the coordinates provided are Cartesian). This argument can be also the result of a bandwidth selection algorithm such as those available in the function gw.zi.bw
the kernel to be used in the regression. Options are "adaptive" or "fixed". The weighting scheme used here is defined by the bi-square function (weight = (1-(ndist/H)^2)^2
for distances less than or equal to H
, 0
otherwise)
a numeric matrix or data frame of two columns giving the X,Y coordinates of the observations
Stamatis Kalogirou <stamatis.science@gmail.com>
Large datasets may take way too long to perform this test.
For 0.05 level of significance in social sciences, a minimum number of 19 simulations (Nsim >= 19) is required. We recommend at least 99 and at best 999 iterations.
Kalogirou, S. (2016) Destination Choice of Athenians: an application of geographically weighted versions of standard and zero inflated Poisson spatial interaction models, Geographical Analysis, 48(2),pp. 191-230. DOI: 10.1111/gean.12092 https://onlinelibrary.wiley.com/doi/abs/10.1111/gean.12092
gw.zi.bw
gw.glm
gwr