A data set with descriptions of hypothetical samples corresponding to 23 species of gilled mushrooms in the Agaricus and Lepiota Family, classified according to their edibility as (definitely) ‘edible’ or ‘poisonous’ (definitely poisonous, or of unknown edibility and not recommended).
data("Mushroom")A data frame with 8124 observations on the following 23 variables.
classa factor with levels edible and
poisonous.
cap-shapea factor with levels bell,
conical, convex, flat, knobbed,
sunken.
cap-surfacea factor with levels fibrous,
grooves, scaly, smooth.
cap-colora factor with levels brown,
buff, cinnamon, gray, green,
pink, purple, red, white,
yellow.
bruises?a factor with levels bruises and
no.
odora factor with levels almond, anise,
creosote, fishy, foul, musty,
none, pungent, spicy.
gill-attachmenta factor with levels attached
and free.
gill-spacinga factor with levels close and
crowded.
gill-sizea factor with levels broad and
narrow.
gill-colora factor with levels black,
brown, buff, chocolate, gray,
green, orange, pink, purple,
red, white, and yellow.
stalk-shapea factor with levels enlarging and
tapering.
stalk-roota factor with levels bulbous,
club, equal, and rooted.
stalk-surface-above-ringa factor with levels
fibrous, scaly, silky, and smooth.
stalk-surface-below-ringa factor with levels
fibrous, scaly, silky, and smooth.
stalk-color-above-ringa factor with levels
brown, buff, cinnamon, gray,
orange, pink, red, white, and
yellow.
stalk-color-below-ringa factor with levels
brown, buff, cinnamon, gray,
orange, pink, red, white, and
yellow.
veil-typea factor with levels partial.
veil-colora factor with levels brown,
orange, white, and yellow.
ring-numbera factor with levels one,
one, and two.
ring-typea factor with levels evanescent,
flaring, large, none, and pendant.
spore-print-colora factor with levels black,
brown, buff, chocolate, green,
orange, purple, white, and yellow.
populationa factor with levels abundant,
clustered, numerous, scattered,
several, and solitary.
habitata factor with levels grasses,
leaves, meadows, paths, urban,
waste, and woods.
The records are drawn from G. H. Lincoff (1981) (Pres.), The Audubon Society Field Guide to North American Mushrooms. New York: Alfred A. Knopf. (See pages 500--525 for the Agaricus and Lepiota Family.)
The Guide clearly states that there is no simple rule for determining the edibility of a mushroom; no rule like “leaflets three, let it be” for Poisonous Oak and Ivy.
Unused levels in the original data were dropped.
The current version of the UC Irvine Machine Learning Repository Mushroom data set is available from tools:::Rd_expr_doi("10.24432/C5959T")
Blake, C.L. & Merz, C.J. (1998). UCI Repository of Machine Learning Databases. Irvine, CA: University of California, Department of Information and Computer Science. Formerly available from http://www.ics.uci.edu/~mlearn/MLRepository.html.
data("Mushroom")
summary(Mushroom)
Run the code above in your browser using DataLab