Learn R Programming

dslabs (version 0.8.0)

brca: Breast Cancer Wisconsin Diagnostic Dataset from UCI Machine Learning Repository

Description

Biopsy features for classification of 569 malignant (cancer) and benign (not cancer) breast masses.

Usage

brca

Arguments

Format

An object of class list.

Details

Features were computationally extracted from digital images of fine needle aspirate biopsy slides. Features correspond to properties of cell nuclei, such as size, shape and regularity. The mean, standard error, and worst value of each of 10 nuclear parameters is reported for a total of 30 features.

This is a classic dataset for training and benchmarking machine learning algorithms.

  • y. The outcomes. A factor with two levels denoting whether a mass is malignant ("M") or benign ("B").

  • x. The predictors. A matrix with the mean, standard error and worst value of each of 10 nuclear measurements on the slide, for 30 total features per biopsy:

    • radius. Nucleus radius (mean of distances from center to points on perimeter).

    • texture. Nucleus texture (standard deviation of grayscale values).

    • perimeter. Nucleus perimeter.

    • area. Nucleus area.

    • smoothness. Nucleus smoothness (local variation in radius lengths).

    • compactness. Nucleus compactness (perimeter^2/area - 1).

    • concavity, Nucleus concavity (severity of concave portions of the contour).

    • concave_pts. Number of concave portions of the nucleus contour.

    • symmetry. Nucleus symmetry.

    • fractal_dim. Nucleus fractal dimension ("coastline approximation" -1).

Examples

Run this code
table(brca$y)
dim(brca$x)
head(brca$x)

Run the code above in your browser using DataLab