estim_NLS
estimates the autoregressive and weight parameters of STVAR models
by the method of least squares (relative_dens
weight function is not supported).
estim_NLS(
data,
p,
M,
weight_function = c("relative_dens", "logistic", "mlogit", "exponential", "threshold",
"exogenous"),
weightfun_pars = NULL,
cond_dist = c("Gaussian", "Student", "ind_Student", "ind_skewed_t"),
parametrization = c("intercept", "mean"),
AR_constraints = NULL,
mean_constraints = NULL,
weight_constraints = NULL,
penalized = TRUE,
penalty_params = c(0.05, 0.2),
min_obs_coef = 3,
sparse_grid = FALSE,
use_parallel = TRUE,
ncores = 2
)
Returns the estimated parameters in a vector of the form
\((\phi_{1},...,\phi_{M},\varphi_1,...,\varphi_M,\alpha\), where
\(\phi_{m} = \) the \((d \times 1)\) intercept vector of the \(m\)th regime.
\(\varphi_m = (vec(A_{m,1}),...,vec(A_{m,p}))\) \((pd^2 \times 1)\).
\(\alpha\) is the vector of the weight parameters:
weight_function="relative_dens"
:\(\alpha = (\alpha_1,...,\alpha_{M-1})\) \((M - 1 \times 1)\), where \(\alpha_m\) \((1\times 1), m=1,...,M-1\) are the transition weight parameters.
weight_function="logistic"
:\(\alpha = (c,\gamma)\) \((2 \times 1)\), where \(c\in\mathbb{R}\) is the location parameter and \(\gamma >0\) is the scale parameter.
weight_function="mlogit"
:\(\alpha = (\gamma_1,...,\gamma_M)\) \(((M-1)k\times 1)\), where \(\gamma_m\) \((k\times 1)\), \(m=1,...,M-1\) contains the multinomial logit-regression coefficients of the \(m\)th regime. Specifically, for switching variables with indices in \(I\subset\lbrace 1,...,d\rbrace\), and with \(\tilde{p}\in\lbrace 1,...,p\rbrace\) lags included, \(\gamma_m\) contains the coefficients for the vector \(z_{t-1} = (1,\tilde{z}_{\min\lbrace I\rbrace},...,\tilde{z}_{\max\lbrace I\rbrace})\), where \(\tilde{z}_{i} =(y_{it-1},...,y_{it-\tilde{p}})\), \(i\in I\). So \(k=1+|I|\tilde{p}\) where \(|I|\) denotes the number of elements in \(I\).
weight_function="exponential"
:\(\alpha = (c,\gamma)\) \((2 \times 1)\), where \(c\in\mathbb{R}\) is the location parameter and \(\gamma >0\) is the scale parameter.
weight_function="threshold"
:\(\alpha = (r_1,...,r_{M-1})\) \((M-1 \times 1)\), where \(r_1,...,r_{M-1}\) are the thresholds.
weight_function="exogenous"
:Omit \(\alpha\) from the parameter vector.
For models with...
Replace \(\varphi_1,...,\varphi_M\) with \(\psi\) as described in the argument AR_constraints
.
If linear constraints are imposed, replace \(\alpha\) with \(\xi\) as described in the
argument weigh_constraints
. If weight functions parameters are imposed to be fixed values, simply drop \(\alpha\)
from the parameter vector.
a matrix or class 'ts'
object with d>1
columns. Each column is taken to represent
a univariate time series. Missing values are not supported.
a positive integer specifying the autoregressive order
a positive integer specifying the number of regimes
What type of transition weights \(\alpha_{m,t}\) should be used?
"relative_dens"
:\(\alpha_{m,t}= \frac{\alpha_mf_{m,dp}(y_{t-1},...,y_{t-p+1})}{\sum_{n=1}^M\alpha_nf_{n,dp}(y_{t-1},...,y_{t-p+1})}\), where \(\alpha_m\in (0,1)\) are weight parameters that satisfy \(\sum_{m=1}^M\alpha_m=1\) and \(f_{m,dp}(\cdot)\) is the \(dp\)-dimensional stationary density of the \(m\)th regime corresponding to \(p\) consecutive observations. Available for Gaussian conditional distribution only.
"logistic"
:\(M=2\), \(\alpha_{1,t}=1-\alpha_{2,t}\), and \(\alpha_{2,t}=[1+\exp\lbrace -\gamma(y_{it-j}-c) \rbrace]^{-1}\), where \(y_{it-j}\) is the lag \(j\) observation of the \(i\)th variable, \(c\) is a location parameter, and \(\gamma > 0\) is a scale parameter.
"mlogit"
:\(\alpha_{m,t}=\frac{\exp\lbrace \gamma_m'z_{t-1} \rbrace} {\sum_{n=1}^M\exp\lbrace \gamma_n'z_{t-1} \rbrace}\), where \(\gamma_m\) are coefficient vectors, \(\gamma_M=0\), and \(z_{t-1}\) \((k\times 1)\) is the vector containing a constant and the (lagged) switching variables.
"exponential"
:\(M=2\), \(\alpha_{1,t}=1-\alpha_{2,t}\), and \(\alpha_{2,t}=1-\exp\lbrace -\gamma(y_{it-j}-c) \rbrace\), where \(y_{it-j}\) is the lag \(j\) observation of the \(i\)th variable, \(c\) is a location parameter, and \(\gamma > 0\) is a scale parameter.
"threshold"
:\(\alpha_{m,t} = 1\) if \(r_{m-1}<y_{it-j}\leq r_{m}\) and \(0\) otherwise, where \(-\infty\equiv r_0<r_1<\cdots <r_{M-1}<r_M\equiv\infty\) are thresholds \(y_{it-j}\) is the lag \(j\) observation of the \(i\)th variable.
"exogenous"
:Exogenous nonrandom transition weights, specify the weight series in weightfun_pars
.
See the vignette for more details about the weight functions.
weight_function == "relative_dens"
:Not used.
weight_function %in% c("logistic", "exponential", "threshold")
:a numeric vector with the switching variable \(i\in\lbrace 1,...,d \rbrace\) in the first and the lag \(j\in\lbrace 1,...,p \rbrace\) in the second element.
weight_function == "mlogit"
:a list of two elements:
$vars
:a numeric vector containing the variables that should used as switching variables in the weight function in an increasing order, i.e., a vector with unique elements in \(\lbrace 1,...,d \rbrace\).
$lags
:an integer in \(\lbrace 1,...,p \rbrace\) specifying the number of lags to be used in the weight function.
weight_function == "exogenous"
:a size (nrow(data) - p
x M
) matrix containing the exogenous
transition weights as [t, m]
for time \(t\) and regime \(m\). Each row needs to sum to one and only weakly positive
values are allowed.
specifies the conditional distribution of the model as "Gaussian"
, "Student"
, "ind_Student"
,
or "ind_skewed_t"
, where "ind_Student"
the Student's \(t\) distribution with independent components, and
"ind_skewed_t"
is the skewed \(t\) distribution with independent components (see Hansen, 1994).
"intercept"
or "mean"
determining whether the model is parametrized with intercept
parameters \(\phi_{m}\) or regime means \(\mu_{m}\), m=1,...,M.
a size \((Mpd^2 \times q)\) constraint matrix \(C\) specifying linear constraints
to the autoregressive parameters. The constraints are of the form
\((\varphi_{1},...,\varphi_{M}) = C\psi\), where \(\varphi_{m} = (vec(A_{m,1}),...,vec(A_{m,p})) \ (pd^2 \times 1),\ m=1,...,M\),
contains the coefficient matrices and \(\psi\) \((q \times 1)\) contains the related parameters.
For example, to restrict the AR-parameters to be the identical across the regimes, set \(C =\)
[I:...:I
]' \((Mpd^2 \times pd^2)\) where I = diag(p*d^2)
.
Restrict the mean parameters of some regimes to be identical? Provide a list of numeric vectors
such that each numeric vector contains the regimes that should share the common mean parameters. For instance, if
M=3
, the argument list(1, 2:3)
restricts the mean parameters of the second and third regime to be
identical but the first regime has freely estimated (unconditional) mean. Ignore or set to NULL
if mean parameters
should not be restricted to be the same among any regimes. This constraint is available only for mean parametrized models;
that is, when parametrization="mean"
.
a list of two elements, \(R\) in the first element and \(r\) in the second element, specifying linear constraints on the transition weight parameters \(\alpha\). The constraints are of the form \(\alpha = R\xi + r\), where \(R\) is a known \((a\times l)\) constraint matrix of full column rank (\(a\) is the dimension of \(\alpha\)), \(r\) is a known \((a\times 1)\) constant, and \(\xi\) is an unknown \((l\times 1)\) parameter. Alternatively, set \(R=0\) to constrain the weight parameters to the constant \(r\) (in this case, \(\alpha\) is dropped from the constrained parameter vector).
Perform penalized LS estimation that minimizes penalized RSS in which estimates close to breaking or not satisfying the
usual stability condition are penalized? If TRUE
, the tuning parameter is set by the argument penalty_params[2]
,
and the penalization starts when the eigenvalues of the companion form AR matrix are larger than 1 - penalty_params[1]
.
a numeric vector with two positive elements specifying the penalization parameters: the first element determined how far from the boundary of the stability region the penalization starts (a number between zero and one, smaller number starts penalization closer to the boundary) and the second element is a tuning parameter for the penalization (a positive real number, a higher value penalizes non-stability more).
the smallest accepted number of observations (times variables) from each regime
relative to the number of parameters in the regime. For models with AR constraints, the number of
AR matrix parameters in each regimes is simplisticly assumed to be ncol(AR_constraints)/M
.
should the grid of weight function values in LS/NLS estimation be more sparse (speeding up the estimation)?
employ parallel computing? If FALSE
, does not print anything.
the number CPU cores to be used in parallel computing.
Used internally in the multiple phase estimation procedure proposed by Virolainen (2025).
The weight function relative_dens
is not supported. Mean constraints are not supported.
Only weight constraints that specify the weight parameters as fixed values are supported.
Only intercept parametrization is supported.
Hubrich K., Teräsvirta. T. 2013. Thresholds and Smooth Transitions in Vector Autoregressive Models. CREATES Research Paper 2013-18, Aarhus University.
Virolainen S. 2025. Identification by non-Gaussianity in structural threshold and smooth transition vector autoregressive models. Unpublished working paper, available as arXiv:2404.19707.