This data set is synthetically generated from real Austrian EU-SILC (European Union Statistics on Income and Living Conditions) data.
data(eusilcP)
A data.frame
with 58 654 observations on the following 28 variables:
hid
integer; the household ID.
region
factor; the federal state in which the household is
located (levels Burgenland
, Carinthia
,
Lower Austria
, Salzburg
, Styria
, Tyrol
,
Upper Austria
, Vienna
and Vorarlberg
).
hsize
integer; the number of persons in the household.
eqsize
numeric; the equivalized household size according to the modified OECD scale.
eqIncome
numeric; a simplified version of the equivalized household income.
pid
integer; the personal ID.
the household ID combined with the personal ID. The first five digits represent the household ID, the last two digits the personal ID (both with leading zeros).
age
integer; the person's age.
gender
factor; the person's gender (levels male
and
female
).
ecoStat
factor; the person's economic status (levels
1
= working full time, 2
= working part time, 3
=
unemployed, 4
= pupil, student, further training or unpaid work
experience or in compulsory military or community service, 5
= in
retirement or early retirement or has given up business, 6
=
permanently disabled or/and unfit to work or other inactive person,
7
= fulfilling domestic tasks and care responsibilities).
citizenship
factor; the person's citizenship (levels
AT
, EU
and Other
).
py010n
numeric; employee cash or near cash income (net).
py050n
numeric; cash benefits or losses from self-employment (net).
py090n
numeric; unemployment benefits (net).
py100n
numeric; old-age benefits (net).
py110n
numeric; survivor's benefits (net).
py120n
numeric; sickness benefits (net).
py130n
numeric; disability benefits (net).
py140n
numeric; education-related allowances (net).
hy040n
numeric; income from rental of a property or land (net).
hy050n
numeric; family/children related allowances (net).
hy070n
numeric; housing allowances (net).
hy080n
numeric; regular inter-household cash transfer received (net).
hy090n
numeric; interest, dividends, profit from capital investments in unincorporated business (net).
hy110n
numeric; income received by people aged under 16 (net).
hy130n
numeric; regular inter-household cash transfer paid (net).
hy145n
numeric; repayments/receipts for tax adjustment (net).
main
logical; indicates the main income holder (i.e., the person with the highest income) of each household.
The data set is used as population data in some of the examples in package
simFrame
. Note that it is included for illustrative purposes only. It
consists of 25 000 households, hence it does not represent the true population
sizes of Austria and its regions.
Only a few of the large number of variables in the original survey are included
in this example data set. Some variable names are different from the
standardized names used by the statistical agencies, as the latter are rather
cryptic codes. Furthermore, the variables hsize
, eqsize
,
eqIncome
and age
are not included in the standardized format of
EU-SILC data, but have been derived from other variables for convenience.
Moreover, some very sparse income components were not included in the the
generation of this synthetic data set. Thus the equivalized household income is
computed from the available income components.
Eurostat (2004) Description of target variables: Cross-sectional and longitudinal. EU-SILC 065/04, Eurostat.
# NOT RUN {
data(eusilcP)
summary(eusilcP)
strata <- stratify(eusilcP, c("region", "gender"))
summary(strata)
# }
Run the code above in your browser using DataLab