Learn R Programming

pvrank (version 1.1.2)

Berk: Effect of outliers

Description

Data on the average number of births and deaths by the time of the day for a particular hospital in Brussels. The data cover a 30-year period in the nineteenth century.

Usage

data(Berk)

Arguments

Format

A data frame with 24 observations on the following two variables.

Births

Number of births by hour.

Deaths

Number of deaths by hour.

Details

Twenty-two observations are clustered and show little association. Two observations (for noon and midnight) are dramatically smaller in both the y-direction and x-direction. With these two included, there is obviously a positive correlation in the data. However, a direct association between the two variables is doubtful, for if the outliers are removed then all correlations decrease and the associated p-values increase up to the point where the null hypothesis of independence cannot be rejected at any reasonable level.

Examples

Run this code
# NOT RUN {
	data(Berk)
	head(Berk)
# }

Run the code above in your browser using DataLab