To measure the effects of smoothing, Getmansky, Lo, et al (2004) define a "smoothing profile" as a vector of coefficients for an MLE fit on returns using a two-period moving-average process. The moving-average process of order $k=2$ gives $R_t = _{0} R_{t} + _1 R_{t -1} + _2 R_{t-2}$, under the constraint that the sum of the coefficients is equal to 1. In R, the arima
function allows us to create an MA(2) model using an "ARIMA(p,d,q)" model, where $p$ is the number of autoregressive terms (AR), $d$ is the degree of differencing, and $q$ is the number of lagged forecast errors (MA) in the prediction equation. The order
parameter allows us to specify the three components (p, d, q) as an argument, e.g., order = c(0, 0, 2)
. The method
specifies how to fit the model, in this case using maximum likelihood estimation (MLE) in a fashion similar to the estimation of standard moving-average time series models, using:
arima(ra, order=c(0,0,2), method="ML", transform.pars=TRUE, include.mean=FALSE)
include.mean
: Getmansky, et al. (2004) p 555 "By applying the above procedure
to observed de-meaned returns...", so we set that parameter to 'FALSE'.
transform.pars
: ibid, "we impose the additional restriction that the estimated MA(k)
process be invertible," so we set the parameter to 'TRUE'.
The coefficients, $_{j}$, are then normalized to sum to interpreted as a "weighted average of the fund's true returns over the most recent $k + 1$ periods, including the current period."
If these weights are disproportionately centered on a small number of lags, relatively little serial correlation will be induced. However, if the weights are evenly distributed among many lags, this would show higher serial correlation.
The paper notes that because $_j [0, 1]$, $$ is also confined to the unit interval, and is minimized when all the $_j$'s are identical. That implies a value of $1/(k + 1)$ for $$, and a maximum value of $= 1$ when one coefficient is 1 and the rest are 0. In the context of smoothed returns, a lower value of $$ implies more smoothing, and the upper bound of 1 implies no smoothing.
The "smoothing index," represented as $$, is calculated the same way the Herfindahl index. The Herfindal measure is well known in the industrial organization literature as a measure of the concentration of firms in a given industry where $y_j$ represents the market share of firm $j$.
This method (as well as the implementation described in the paper), does not enforce $_j [0, 1]$, so $$ is not limited to that range either. All we can say is that lower values are "less liquid" and higher values are "more liquid" or mis-specified. In this function, setting the parameter neg.thetas = FALSE does enforce the limitation, eliminating negative autocorrelation coefficients from the calculation (the papers below do not make an economic case for eliminating negative autocorrelation, however).
Interpretation of the resulting value is difficult. All we can say is that lower values
appear to have autocorrelation structure like we might expect of "less liquid" instruments. Higher values appear "more liquid" or are poorly fit or mis-specified.
Chan, Nicholas, Mila Getmansky, Shane M. Haas, and Andrew W. Lo. 2005. Systemic Risk
and Hedge Funds. NBER Working Paper Series (11200).
Getmansky, Mila, Andrew W. Lo, and Igor Makarov. 2004. An Econometric Model of Serial
Correlation and Illiquidity in Hedge Fund Returns. Journal of Financial Economics (74):
529-609.
[object Object]
data(managers)
data(edhec)
indexes=merge(managers[,8:10],edhec)
indexes=na.omit(indexes)
sapply(indexes,FUN=SmoothingIndex)
SmoothingIndex(managers[,8,drop=FALSE])ts
multivariate
distribution
models