gpg: Gender pay (wage) gap.

Description

Estimate the gender pay (wage) gap.

Usage

gpg(
  inc,
  gender = NULL,
  method = c("mean", "median"),
  weights = NULL,
  sort = NULL,
  years = NULL,
  breakdown = NULL,
  design = NULL,
  cluster = NULL,
  data = NULL,
  var = NULL,
  alpha = 0.05,
  na.rm = FALSE,
  ...
)

Value

A list of class "gpg" (which inherits from the class "indicator") with the following components:

value: a numeric vector containing the overall value(s).
valueByStratum: a data.frame containing the values by domain, or NULL.
varMethod: a character string specifying the type of variance estimation used, or NULL if variance estimation was omitted.
var: a numeric vector containing the variance estimate(s), or NULL.
varByStratum: a data.frame containing the variance estimates by domain, or NULL.
ci: a numeric vector or matrix containing the lower and upper endpoints of the confidence interval(s), or NULL.
ciByStratum: a data.frame containing the lower and upper endpoints of the confidence intervals by domain, or NULL.
alpha: a numeric value giving the significance level used for computing the confidence interv al(s) (i.e., the confidence level is \(1 - \)alpha), or NULL.
years: a numeric vector containing the different years of the survey.
strata: a character vector containing the different domains of the breakdown.

Arguments

inc: either a numeric vector giving the equivalized disposable income, or (if data is not NULL) a character string, an integer or a logical vector specifying the corresponding column of data.
gender: either a factor giving the gender, or (if data is not NULL) a character string, an integer or a logical vector specifying the corresponding column of data.
method: a character string specifying the method to be used. Possible values are "mean" for the mean, and "median" for the median. If weights are provided, the weighted mean or weighted median is estimated.
weights: optional; either a numeric vector giving the personal sample weights, or (if data is not NULL) a character string, an integer or a logical vector specifying the corresponding column of data.
sort: optional; either a numeric vector giving the personal IDs to be used as tie-breakers for sorting, or (if data is not NULL) a character string, an integer or a logical vector specifying the corresponding column of data.
years: optional; either a numeric vector giving the different years of the survey, or (if data is not NULL) a character string, an integer or a logical vector specifying the corresponding column of data. If supplied, values are computed for each year.
breakdown: optional; either a numeric vector giving different domains, or (if data is not NULL) a character string, an integer or a logical vector specifying the corresponding column of data. If supplied, the values for each domain are computed in addition to the overall value.
design: optional and only used if var is not NULL; either an integer vector or factor giving different strata for stratified sampling designs, or (if data is not NULL) a character string, an integer or a logical vector specifying the corresponding column of data.
cluster: optional and only used if var is not NULL; either an integer vector or factor giving different clusters for cluster sampling designs, or (if data is not NULL) a character string, an integer or a logical vector specifying the corresponding column of data.
data: an optional data.frame.
var: a character string specifying the type of variance estimation to be used, or NULL to omit variance estimation. See variance for possible values.
alpha: numeric; if var is not NULL, this gives the significance level to be used for computing the confidence interval (i.e., the confidence level is \(1 - \)alpha).
na.rm: a logical indicating whether missing values should be removed.
...: if var is not NULL, additional arguments to be passed to variance.

Author

Matthias Templ and Alexander Haider, using code for breaking down estimation by Andreas Alfons

Details

The implementation strictly follows the Eurostat definition (with default method "mean" and alternative method "median"). If weights are provided, the weighted mean or weighted median is estimated.

References

A. Alfons and M. Templ (2013) Estimation of Social Exclusion Indicators from Complex Surveys: The R Package laeken. Journal of Statistical Software, 54(15), 1--25. tools:::Rd_expr_doi("10.18637/jss.v054.i15")

Working group on Statistics on Income and Living Conditions (2004) Common cross-sectional EU indicators based on EU-SILC; the gender pay gap. EU-SILC 131-rev/04, Eurostat, Luxembourg.

Examples

Run this code

data(ses)

# overall value with mean
gpg("earningsHour", gender = "sex", weigths = "weights",
    data = ses)

# overall value with median
gpg("earningsHour", gender = "sex", weigths = "weights",
    data = ses, method = "median")

# values by education with mean
gpg("earningsHour", gender = "sex", weigths = "weights",
    breakdown = "education", data = ses)

# values by education with median
gpg("earningsHour", gender = "sex", weigths = "weights",
    breakdown = "education", data = ses, method = "median")

Run the code above in your browser using DataLab