
Compute the value of predIntNormK
is called by predIntNorm
.
predIntNormK(n, df = n - 1, n.mean = 1, k = 1,
method = "Bonferroni", pi.type = "two-sided",
conf.level = 0.95)
a positive integer greater than 2 indicating the sample size upon which the prediction interval is based.
the degrees of freedom associated with the prediction interval. The default is
df=n-1
.
positive integer specifying the sample size associated with the n.mean=1
(i.e., individual observations). Note that all
future averages must be based on the same sample size.
positive integer specifying the number of future observations or averages the
prediction interval should contain with confidence level conf.level
.
The default value is k=1
.
character string specifying the method to use if the number of future observations
(k
) is greater than 1. The possible values are method="Bonferroni"
(approximate method based on Bonferonni inequality; the default), and
method="exact"
(exact method due to Dunnett, 1955). See the DETAILS section for
more information. This argument is ignored if k=1
.
character string indicating what kind of prediction interval to compute.
The possible values are pi.type="two-sided"
(the default), pi.type="lower"
,
and pi.type="upper"
.
a scalar between 0 and 1 indicating the confidence level of the prediction interval.
The default value is conf.level=0.95
.
A numeric scalar equal to
A prediction interval for some population is an interval on the real line constructed
so that it will contain
Let mean=
sd=
n.mean=
For a normal distribution, the form of a two-sided tolIntNorm
).
Similarly, the form of a one-sided lower prediction interval is:
predIntNormK
computes the value of predIntNorm
.
The Derivation of K for One Future Observation or Average (k = 1)
Let mean=
sd=
A true two-sided
More generally, a true two-sided
Because the values of
For a two-sided prediction interval (pi.type="two-sided"
),
the constant pi.type="lower"
or pi.type="lower"
), the prediction interval
is given by:
The formulas for these prediction intervals are derived as follows. Let
The Derivation of K for More than One Future Observation or Average (k >1)
When predIntNormK
allows for two ways to compute
method="exact"
), and
an approximate (conservative) method based on the Bonferroni inequality
(method="Bonferroni"
; see Miller, 1981a, pp.8, 67-70;
Gibbons et al., 2009, p.4). Each of these methods is explained below.
Exact Method Due to Dunnett (1955) (method="exact"
)
Dunnett (1955) derived the value of
When pi.type="lower"
or pi.type="upper"
, the value of
When pi.type="two-sided"
, the value of
Approximate Method Based on the Bonferroni Inequality (method="Bonferroni"
)
As shown above, when
Berthouex, P.M., and L.C. Brown. (2002). Statistics for Environmental Engineers. Lewis Publishers, Boca Raton.
Dunnett, C.W. (1955). A Multiple Comparisons Procedure for Comparing Several Treatments with a Control. Journal of the American Statistical Association 50, 1096-1121.
Dunnett, C.W. (1964). New Tables for Multiple Comparisons with a Control. Biometrics 20, 482-491.
Gibbons, R.D., D.K. Bhaumik, and S. Aryal. (2009). Statistical Methods for Groundwater Monitoring, Second Edition. John Wiley & Sons, Hoboken.
Hahn, G.J. (1969). Factors for Calculating Two-Sided Prediction Intervals for Samples from a Normal Distribution. Journal of the American Statistical Association 64(327), 878-898.
Hahn, G.J. (1970a). Additional Factors for Calculating Prediction Intervals for Samples from a Normal Distribution. Journal of the American Statistical Association 65(332), 1668-1676.
Hahn, G.J. (1970b). Statistical Intervals for a Normal Population, Part I: Tables, Examples and Applications. Journal of Quality Technology 2(3), 115-125.
Hahn, G.J. (1970c). Statistical Intervals for a Normal Population, Part II: Formulas, Assumptions, Some Derivations. Journal of Quality Technology 2(4), 195-206.
Hahn, G.J., and W.Q. Meeker. (1991). Statistical Intervals: A Guide for Practitioners. John Wiley and Sons, New York.
Hahn, G., and W. Nelson. (1973). A Survey of Prediction Intervals and Their Applications. Journal of Quality Technology 5, 178-188.
Helsel, D.R., and R.M. Hirsch. (1992). Statistical Methods in Water Resources Research. Elsevier, New York.
Helsel, D.R., and R.M. Hirsch. (2002). Statistical Methods in Water Resources. Techniques of Water Resources Investigations, Book 4, chapter A3. U.S. Geological Survey. (available on-line at: http://pubs.usgs.gov/twri/twri4a3/).
Millard, S.P., and Neerchal, N.K. (2001). Environmental Statistics with S-PLUS. CRC Press, Boca Raton, Florida.
Miller, R.G. (1981a). Simultaneous Statistical Inference. McGraw-Hill, New York.
USEPA. (2009). Statistical Analysis of Groundwater Monitoring Data at RCRA Facilities, Unified Guidance. EPA 530/R-09-007, March 2009. Office of Resource Conservation and Recovery Program Implementation and Information Division. U.S. Environmental Protection Agency, Washington, D.C.
USEPA. (2010). Errata Sheet - March 2009 Unified Guidance. EPA 530/R-09-007a, August 9, 2010. Office of Resource Conservation and Recovery, Program Information and Implementation Division. U.S. Environmental Protection Agency, Washington, D.C.
predIntNorm
, predIntNormSimultaneous
,
predIntLnorm
, tolIntNorm
,
Normal, estimate.object
, enorm
, eqnorm
.
# NOT RUN {
# Compute the value of K for a two-sided 95% prediction interval
# for the next observation given a sample size of n=20.
predIntNormK(n = 20)
#[1] 2.144711
#--------------------------------------------------------------------
# Compute the value of K for a one-sided upper 99% prediction limit
# for the next 3 averages of order 2 (i.e., each of the 3 future
# averages is based on a sample size of 2 future observations) given a
# samle size of n=20.
predIntNormK(n = 20, n.mean = 2, k = 3, pi.type = "upper",
conf.level = 0.99)
#[1] 2.258026
#----------
# Compare the result above that is based on the Bonferroni method
# with the exact method.
predIntNormK(n = 20, n.mean = 2, k = 3, method = "exact",
pi.type = "upper", conf.level = 0.99)
#[1] 2.251084
#--------------------------------------------------------------------
# Example 18-1 of USEPA (2009, p.18-9) shows how to construct a 95%
# prediction interval for 4 future observations assuming a
# normal distribution based on arsenic concentrations (ppb) in
# groundwater at a solid waste landfill. There were 4 years of
# quarterly monitoring, and years 1-3 are considered background,
# So the sample size for the prediciton limit is n = 12,
# and the number of future samples is k = 4.
predIntNormK(n = 12, k = 4, pi.type = "upper")
#[1] 2.698976
# }
Run the code above in your browser using DataLab