Learn R Programming

imputeR (version 2.2)

tic: Insurance Company Benchmark (COIL 2000) Data Set

Description

This data set used in the CoIL 2000 Challenge contains information on customers of an insurance company. The data consists of 86 variables and includes product usage data and socio-demographic data. Detailed information, please refer to the Source. For imputation study, this dataset can be treated as a mixed-type data.

Arguments

Format

A data frame with 266 rows and 23 variables

Details

  • V1. a numeric variable

  • V2. a categorical variable

  • ...

References

P. van der Putten and M. van Someren (eds). CoIL Challenge 2000: The Insurance Company Case. Published by Sentient Machine Research, Amsterdam. Also a Leiden Institute of Advanced Computer Science Technical Report 2000-09. June 22, 2000.