KFS: Kalman Filter and Smoother with Exact Diffuse Initialization for Exponential Family State Space Models

Description

Performs Kalman filtering and smoothing with exact diffuse initialization using univariate approach for exponential family state space models.

Usage

KFS(
  model,
  filtering,
  smoothing,
  simplify = TRUE,
  transform = c("ldl", "augment"),
  nsim = 0,
  theta,
  maxiter = 50,
  convtol = 1e-08,
  return_model = TRUE,
  expected = FALSE,
  H_tol = 1e+15,
  transform_tol
)

Value

What KFS returns depends on the arguments filtering,

smoothing and simplify, and whether the model is Gaussian or not:

model: Original state space model.
KFS_transform: How the non-diagonal H was handled.
logLik: Value of the log-likelihood function. Only returned for fully Gaussian models.
a: One-step-ahead predictions of states, $a_t = E(\alpha_t | y_{t-1}, \ldots, y_{1})$.
P: Non-diffuse parts of the error covariance matrix of predicted states, $P_t = Var(\alpha_t | y_{t-1}, \ldots, y_{1}) $.
Pinf: Diffuse part of the error covariance matrix of predicted states. Only returned for Gaussian models.
att: Filtered estimates of states, $a_tt = E(\alpha_t | y_{t}, \ldots, y_{1})$.
Ptt: Non-diffuse parts of the error covariance matrix of filtered states, $P_tt = Var(\alpha_t | y_{t}, \ldots, y_{1}) $.
t: One-step-ahead predictions of signals, $E(Z_t\alpha_t | y_{t-1}, \ldots, y_{1})$.
P_theta: Non-diffuse part of $Var(Z_t\alpha_t | y_{t-1}, \ldots, y_{1})$.
m: One-step-ahead predictions $f(\theta_t) | y_{t-1}, \ldots, y_{1})$, where $f$ is the inverse link function. In case of Poisson distribution these predictions are multiplied with exposure $u_t$.
P_mu: Non-diffuse part of $Var(f(\theta_t) | y_{t-1}, \ldots, y_{1})$. In case of Poisson distribution this is $Var(u_t f(\theta_t) | y_{t-1}, \ldots, y_{1})$. If nsim = 0, only diagonal elements (variances) are computed, using the Delta method.
alphahat: Smoothed estimates of states, $E(\alpha_t | y_1, \ldots, y_n)$.
V: Error covariance matrices of smoothed states, $Var(\alpha_t | y_1, \ldots, y_n)$.
thetahat: Smoothed estimates of signals, $E(Z_t\alpha_t | y_1, \ldots, y_n)$.
V_theta: Error covariance matrices of smoothed signals $Var(Z[t]\alpha_t | y_1, \ldots, y_n).$.
muhat: Smoothed estimates of $f(\theta_t) | y_1, \ldots, y_n)$, where $f$ is the inverse link function, or in Poisson case $u_t f(\theta_t) | y_1, \ldots, y_n)$, where $u$ is the exposure term.
V_mu: Error covariances $Cov(f(\theta_t)| y_1, \ldots, y_n)$ (or the covariances of $u_t f(\theta_t)$ given the data in case of Poisson distribution). If nsim = 0, only diagonal elements (variances) are computed, using the Delta method.
etahat: Smoothed disturbance terms $E(\eta_t | y_1, \ldots, y_n)$. Only for Gaussian models.
V_eta: Error covariances $Var(\eta_t | y_1, \ldots, y_n)$. Note that for computing auxiliary residuals you shoud use method rstandard.KFS.
epshat: Smoothed disturbance terms $E(\epsilon_{t,i} | y_1, \ldots, y_n)$. Note that due to the possible diagonalization these are on transformed scale. Only for Gaussian models.
V_eps: Diagonal elements of $Var(\epsilon_{t} | y_1, \ldots, y_n)$. Note that due to the diagonalization the off-diagonal elements are zero. Only for Gaussian models. Note that for computing auxiliary residuals you shoud use method rstandard.KFS.
iterations: The number of iterations used in linearization of non-Gaussian model.
v: Prediction errors $v_{t,i} = y_{t,i} - Z_{i,t}a_{t,i}, i = 1, \ldots,p$, where $$a_{t,i} = E(\alpha_t | y_{t,i-1}, \ldots, y_{t,1}, \ldots, y_{1,1})$$. Only returned for Gaussian models.
F: Prediction error variances $Var(v_{t,i})$. Only returned for Gaussian models.
Finf: Diffuse part of prediction error variances. Only returned for Gaussian models.
d: The last time index of diffuse phase, i.e. the non-diffuse phase began at time $d+1$. Only returned for Gaussian models.
j: The last observation index $i$ of $y_{i,t}$ of the diffuse phase. Only returned for Gaussian models.

In addition, if argument simplify = FALSE, list contains following components:

K: Covariances $Cov(\alpha_{t,i}, y_{t,i} | y_{t,i-1}, \ldots, y_{t,1}, y_{t-1}, \ldots , y_{1}), \quad i = 1, \ldots, p$.
Kinf: Diffuse part of $K_t$.
r: Weighted sums of innovations $v_{t+1}, \ldots, v_{n}$. Notice that in literature $t$ in $r_t$ goes from $0, \ldots, n$. Here $t = 1, \ldots, n + 1$. Same applies to all $r$ and $N$ variables.
r0, r1: Diffuse phase decomposition of $r_t$.
N: Covariances $Var(r_t)$.
N0, N1, N2: Diffuse phase decomposition of $N_t$.

Arguments

model: Object of class SSModel.
filtering: Types of filtering. Possible choices are "state", "signal", "mean", and "none". Default is "state" for Gaussian and "none" for non-Gaussian models. Multiple values are allowed. For Gaussian models, the signal is the mean. Note that filtering for non-Gaussian models with importance sampling can be very slow with large models.
smoothing: Types of smoothing. Possible choices are "state", "signal", "mean", "disturbance", and "none". Default is "state" and "mean". For non-Gaussian models, option "disturbance" is not supported, and for Gaussian models option "mean" is identical to "signal". Multiple values are allowed.
simplify: If FALSE and the model is completely Gaussian, KFS returns some generally not so interesting variables from filtering and smoothing. Default is TRUE.
transform: How to transform the model in case of non-diagonal covariance matrix H. Defaults to "ldl". See function transformSSM for details.
nsim: The number of independent samples used in importance sampling. Only used for non-Gaussian models. Default is 0, which computes the approximating Gaussian model by approxSSM and performs the usual Gaussian filtering/smoothing so that the smoothed state estimates equals to the conditional mode of $p(\alpha_t|y)$. In case of nsim = 0, the mean estimates and their variances are computed using the Delta method (ignoring the covariance terms).
theta: Initial values for conditional mode theta. Only used for non-Gaussian models.
maxiter: The maximum number of iterations used in Gaussian approximation. Default is 50. Only used for non-Gaussian models.
convtol: Tolerance parameter for convergence checks for Gaussian approximation. Only used for non-Gaussian models.
return_model: Logical, indicating whether the original input model should be returned as part of the output. Defaults to TRUE, but for large models can be set to FALSE in order to save memory. However, many of the methods operating on the output of KFS use this model so this will not work if return_model=FALSE.
expected: Logical value defining the approximation of H_t in case of Gamma and negative binomial distribution. Default is FALSE which matches the algorithm of Durbin & Koopman (1997), whereas TRUE uses the expected value of observations in the equations, leading to results which match with glm (where applicable). The latter case was the default behaviour of KFAS before version 1.3.8. Essentially this is the difference between observed and expected information in the GLM context. Only used for non-Gaussian model.
H_tol: Tolerance parameter for check max(H) > tol_H, which suggests that the approximation converged to degenerate case with near zero signal-to-noise ratio. Default is very generous 1e15. Only used for non-Gaussian model.
transform_tol: Tolerance parameter for LDL decomposition in case of a non-diagonal H and transform = "ldl". See transformSSM and ldl for details.

Details

Notice that in case of multivariate Gaussian observations, v, F, Finf, K and Kinf are usually not the same as those calculated in usual multivariate Kalman filter. As filtering is done one observation element at the time, the elements of the prediction error $v_t$ are uncorrelated, and F, Finf, K and Kinf contain only the diagonal elemens of the corresponding covariance matrices. The usual multivariate versions of F and v can be obtained from the output of KFS using the function mvInnovations.

In rare cases (typically with regression components with high multicollinearity or long cyclic patterns), the cumulative rounding errors in Kalman filtering and smoothing can cause the diffuse phase end too early, or the backward smoothing gives negative variances (in diffuse and nondiffuse cases). Since version 1.4.0, filtering and smoothing algorithms truncate these values to zero during the recursions, but this can still leads some numerical problems. In these cases, redefining the prior state variances more informative is often helpful. Changing the tolerance parameter tol of the model (see SSModel) to smaller (or larger), or scaling the model input can sometimes help as well. These numerical issues are well known in Kalman filtering/smoothing in general (there are other numerically more stable versions available, but these are in general slower).

Fon non-Gaussian models the components corresponding to diffuse filtering (Finf, Pinf, d, Kinf) are not returned even when filtering is used. Results based on approximating Gaussian model can be obtained by running KFS using the output of approxSSM.

In case of non-Gaussian models with nsim = 0, the smoothed estimates relate to the conditional mode of $p(\alpha|y)$. When using importance sampling (nsim>0), results correspond to the conditional mean of $p(\alpha|y)$.

References

Koopman, S.J. and Durbin J. (2000). Fast filtering and smoothing for non-stationary time series models, Journal of American Statistical Assosiation, 92, 1630-38.

Koopman, S.J. and Durbin J. (2001). Time Series Analysis by State Space Methods. Oxford: Oxford University Press.

Koopman, S.J. and Durbin J. (2003). Filtering and smoothing of state vector for diffuse state space models, Journal of Time Series Analysis, Vol. 24, No. 1.

Examples

Run this code

set.seed(1)
x <- cumsum(rnorm(100, 0, 0.1))
y <- rnorm(100, x, 0.1)
model <- SSModel(y ~ SSMtrend(1, Q = 0.01), H = 0.01)
out <- KFS(model)
ts.plot(ts(x), out$a, out$att, out$alpha, col = 1:4)

Run the code above in your browser using DataLab