Learn R Programming

asbio (version 1.9-2)

ci.strat: Confidence intervals for stratified random samples.

Description

A statistical estimate along with its associated confidence interval can be considered to be an inferential statement about the sampled population. However this statement will only be correct if the method of sampling is considered in the computations of standard errors. The function ci.strat provides appropriate computations given stratified random sampling.

Usage

ci.strat(data, strat, N.h, conf = 0.95, summarized = FALSE, use.t = FALSE, 
n.h = NULL, x.bar.h = NULL, var.h = NULL)

Value

Returns a list with two items:

strat.summary

A matrix with columns: N.h,n.h,x.bar.h and var.h

CI

Confidence intervals for \(\mu\) and T

Arguments

data

A vector of quantitative data. Required if summarized=FALSE.

strat

A vector describing strata.

N.h

A vector describing the number of experimental units for each of the k strata.

conf

Level of confidence; 1 - P(type I error).

summarized

Logical. Indicates whether summarized data are to be used.

use.t

Logical. Indicates whether t or z confidence intervals should be built.

n.h

A vector indicating the number of experimental units sampled in each of the k strata. Required if summarized=TRUE.

x.bar.h

A vector containing the sample means for each of the k strata. Required if summarized=TRUE .

var.h

A vector containing the sample variances for each of the k strata. Required if summarized=TRUE.

Author

Ken Aho

Details

the conventional formula for the sample standard error assumes simple random sampling. There are two other general types of sampling designs: stratified random sampling and cluster sampling. Since cluster sampling is generally used for surveys involving human demographics we will only describe corrections for stratified random sampling here. For more information on sample standard error adjustments for cluster sampling see Lohr (1999).

For a stratified random sampling design let N be the known total number of units in the defined population of interest, and assume that the population can be logically divided into k strata; \(N=N_1+N_2+N_3+\dots+N_k\) (i.e. we are assuming that we know both the total population size, and the population size of each stratum). We sample each of the k strata with \(n_h\) observations; \(h=1,2,\dots,k\).

We estimate the variance in the hth stratum as: $$S^{2}_{h}=\frac{1}{n_h-1}\sum_{i=1}^{n_k}(X_{hi}-\bar{X}_h)^2$$

where \(X_{hi}\) is the ith observation from the hth strata and \(\bar{X}_h\) is the hth sample mean. We estimate the true population total, T, with: $$\hat{T}=\sum_{h=i}^{k}N_h\bar{X}_h$$

We estimate the population mean, \(\mu\), with: $$\bar{X}_{str}=\frac{\hat{T}}{N}$$

An unbiased estimator for the standard error of \(\bar{X}_{str}\) is: $$S_{\bar{X}_{str}}=\sqrt{\sum_{h=1}^{k}\left(1-\frac{n_h}{N_h}\right)\left(\frac{N_h}{N}\right)^2\left(\frac{S_h^2}{n_h}\right)}$$

The standard error of \(\hat{T}\) is also of interest. Here is an unbiased estimator: $$S_{\hat{T}}=\sqrt{\sum_{h=1}^{k}\left(1-\frac{n_h}{N_h}\right)N_h^2\left(\frac{S_h^2}{n_h}\right)}$$

Note that these standard errors have both a finite population correction and adjustments for stratification built into them. Assuming that sample sizes within each stratum are large or that the sampling design has a large number of strata, a 100(1 - \(\alpha\))percent confidence interval for \(\mu\) and T can be constructed using:

$$\bar{X}_{str}\pm z_{1-\alpha/2}S_{\bar{X}_{str}}$$ $$\hat{T}\pm z_{1-\alpha/2}S_{\hat{T}}$$

In situations where sample sizes or the number of strata are small, a \(t(n - k)\) distribution can (and should) be used for calculation of confidence intervals, where \(n=n_1+n_2+\dots+n_k\).

References

Lohr, S. L. (1999) Sampling: Design and Analysis. Duxbury Press. Pacific Grove, USA.

Siniff, D. B., and Skoog, R. O. (1964) Aerial censusing of caribou using stratified random sampling. Journal of Wildlife Management 28: 391-401.

See Also

ci.mu.z

Examples

Run this code
#Data from Siniff and Skoog (1964)
Caribou<-data.frame(Stratum=c("A","B","C","D","E","F"),N.h=c(400,30,61,18,70,120),
n.h=c(98,10,37,6,39,21),x.bar.h=c(24.1,25.6,267.6,179,293.7,33.2),
var.h=c(5575,4064,347556,22798,123578,9795))
attach(Caribou)
ci.strat(data,strat=Stratum,N.h=N.h,conf=.95,summarized=TRUE,use.t=FALSE,n.h=n.h,
x.bar.h=x.bar.h,var.h=var.h)

Run the code above in your browser using DataLab