datamsaeDBns: Sample Data for Multivariate Small Area Estimation with Difference Benchmarking with clustering

Description

Dataset to simulate difference benchmarking of Multivariate Fay Herriot model for non-sampled area using clustering This data is generated base on multivariate Fay Herriot model by these following steps:

Generate explanatory variables X1 and X2. Take \(\mu_{X1}\) = \(\mu_{X1}\) = 10, \(\sigma_{X11}\)=1, \(\sigma_{X2}\)=2, and \(\rho_{x}\)= 1/2. Sampling error e is generated with the following \(\sigma_{e11}\) = 0.15, \(\sigma_{e22}\) = 0.25, \(\sigma_{e33}\) = 0.35, and \(\rho_{e}\) = 1/2. For random effect u, we set \(\sigma_{u11}\)= 0.2, \(\sigma_{u22}\)= 0.6, and \(\sigma_{u33}\)= 1.8. For the weight we generate w1 w2 w3 by set the w1 ~ U(25,30) , w2 ~ U(25,30), w3 ~ U(25,30) Calculate direct estimation Y1 Y2 Y3 where \(Y_{i}\) = \(X * \beta + u_{i} + e_{i}\) cl1 cl2 cl3 were obtained using K-Means clustering from the explanatory variables.
Then combine the direct estimations Y1 Y2 Y3, explanatory variables X1 X2, weights w1 w2 w3, and sampling varians covarians v1 v12 v13 v2 v23 v3 in a data frame then named as datamsaeDB

Usage

datamsaeDBns

Arguments

Format

A data frame with 30 rows and 18 variables:

clY1: cluster information of Y1
clY2: cluster information of Y2
clY3: cluster information of Y3
nonsample: A column with logical values, TRUE if the area is non-sampled
Y1: Direct Estimation of Y1
Y2: Direct Estimation of Y2
Y3: Direct Estimation of Y3
X1: Auxiliary variable of X1
X2: Auxiliary variable of X2
w1: Known proportion of units in small areas of Y1
w2: Known proportion of units in small areas of Y2
w3: Known proportion of units in small areas of Y3
v1: Sampling Variance of Y1
v12: Sampling Covariance of Y1 and Y2
v13: Sampling Covariance of Y1 and Y3
v2: Sampling Variance of Y2
v23: Sampling Covariance of Y2 and Y3
v3: Sampling Variance of Y3