Carries out model-based clustering using the mixture of coalesced generalized hyperbolic distributions.
MCGHD(data=NULL,gpar0=NULL,G=2,max.iter=100,eps=1e-2,label=NULL,
method="km",scale=TRUE,nr=10, modelSel="AIC")
A n x p matrix or data frame such that rows correspond to observations and columns correspond to variables.
(optional) A list containing the initial parameters of the mixture model. See the 'Details' section.
The range of values for the number of clusters.
(optional) A numerical parameter giving the maximum number of iterations each EM algorithm is allowed to use.
(optional) A number specifying the epsilon value for the convergence criteria used in the EM algorithms. For each algorithm, the criterion is based on the difference between the log-likelihood at an iteration and an asymptotic estimate of the log-likelihood at that iteration. This asymptotic estimate is based on the Aitken acceleration.
( optional) A n dimensional vector, if label[i]=k then observation i belongs to group k, If label[i]=0 then observation i has no known group, if NULL then the data has no known groups.
( optional) A string indicating the initialization criterion, if not specified kmeans clustering is used. Alternative methods are: hierarchical "hierarchical", random "random", and model based "modelBased"
( optional) A logical value indicating whether or not the data should be scaled, true by default.
( optional) A number indicating the number of starting value when random is used, 10 by default.
( optional) A string indicating the model selection criterion, if not specified AIC is used. Alternative methods are: BIC,ICL, and AIC3
A S4 object of class MixGHD with slots:
Value of the index used for model selection (AIC or ICL or BIC or AIC3) for each G,the index used is specified by the user, if not specified AIC is used.
Bayesian information criterion.
Integrated completed likelihood..
Akaike information criterion.
Akaike information criterion 3.
A list of the model parameters in the rotated space.
The log-likelihood values.
A vector of integers indicating the maximum a posteriori classifications for the best model.
A list of the model parameters.
A matrix giving the raw values upon which map is based.
The arguments gpar0, if specified, has to be a list structure containing as much element as the number of components G. Each element must include the following parameters: one p dimensional vector mu, alpha and phi, a pxp matrix gamma, a px2 vector cpl containing the vectors omega and lambda, and a 2-dimensional vector containing the omega0 and lambda0.
C. Tortora, B.C. Franczak, R.P. Browne, and P.D. McNicholas (2019). A Mixture of Coalesced Generalized Hyperbolic Distributions. Journal of Classification 36(1) 26-57.\ C. Tortora, R. P. Browne, A. ElSherbiny, B. C. Franczak, and P. D. McNicholas (2021). Model-Based Clustering, Classification, and Discriminant Analysis using the Generalized Hyperbolic Distribution: MixGHD R package, Journal of Statistical Software 98(3) 1--24, <doi:10.18637/jss.v098.i03>.
# NOT RUN {
##loading banknote data
data(banknote)
##model estimation
model=MCGHD(banknote[,2:7],G=2,max.iter=20)
#result
#summary(model)
#plot(model)
table(banknote[,1],model@map)
# }
Run the code above in your browser using DataLab