Learn R Programming

dprep (version 3.0.2)

breastw: The Breast Wisconsin dataset

Description

This is the Breast Wisconsin dataset from the UCI Machine Learning Repository. This dataset has 699 instances, sixteen of them with missing values, 9 predictor attributes and one class attribute assuming values 1(benign tumor) and 2(malign tumor).

Usage

data(breastw)

Arguments

Format

A data frame with 699 observations on the following 10 variables.
V1
Clump Thickness
V2
Uniformity of Cell Size
V3
Uniformity of Cell Shape
V4
Marginal Adhesion
V5
Single Epithelial Cell Size
V6
Bare Nuclei
V7
Bland Chromatin
V8
Normal Nucleoli
V9
Mitoses
V10
Class: 1 for benign, 2 for Malign

Details

All the features assume values in the range 1-10. The dataset contains 699 observations with 16 of them having missing values. It is recommended to impute these values been before further analysis.

Examples

Run this code
#Detecting outliers in class-1 using the LOF algorithms---
data(breastw)
ce.impute(breastw,"mean",1:9)

Run the code above in your browser using DataLab