A data set with descriptions of hypothetical samples corresponding to 23 species of gilled mushrooms in the Agaricus and Lepiota Family, classified according to their edibility as (definitely) ‘edible’ or ‘poisonous’ (definitely poisonous, or of unknown edibility and not recommended).
data("Mushroom")
A data frame with 8124 observations on the following 23 variables.
class
a factor with levels edible
and
poisonous
.
cap-shape
a factor with levels bell
,
conical
, convex
, flat
, knobbed
,
sunken
.
cap-surface
a factor with levels fibrous
,
grooves
, scaly
, smooth
.
cap-color
a factor with levels brown
,
buff
, cinnamon
, gray
, green
,
pink
, purple
, red
, white
,
yellow
.
bruises?
a factor with levels bruises
and
no
.
odor
a factor with levels almond
, anise
,
creosote
, fishy
, foul
, musty
,
none
, pungent
, spicy
.
gill-attachment
a factor with levels attached
and free
.
gill-spacing
a factor with levels close
and
crowded
.
gill-size
a factor with levels broad
and
narrow
.
gill-color
a factor with levels black
,
brown
, buff
, chocolate
, gray
,
green
, orange
, pink
, purple
,
red
, white
, and yellow
.
stalk-shape
a factor with levels enlarging
and
tapering
.
stalk-root
a factor with levels bulbous
,
club
, equal
, and rooted
.
stalk-surface-above-ring
a factor with levels
fibrous
, scaly
, silky
, and smooth
.
stalk-surface-below-ring
a factor with levels
fibrous
, scaly
, silky
, and smooth
.
stalk-color-above-ring
a factor with levels
brown
, buff
, cinnamon
, gray
,
orange
, pink
, red
, white
, and
yellow
.
stalk-color-below-ring
a factor with levels
brown
, buff
, cinnamon
, gray
,
orange
, pink
, red
, white
, and
yellow
.
veil-type
a factor with levels partial
.
veil-color
a factor with levels brown
,
orange
, white
, and yellow
.
ring-number
a factor with levels one
,
one
, and two
.
ring-type
a factor with levels evanescent
,
flaring
, large
, none
, and pendant
.
spore-print-color
a factor with levels black
,
brown
, buff
, chocolate
, green
,
orange
, purple
, white
, and yellow
.
population
a factor with levels abundant
,
clustered
, numerous
, scattered
,
several
, and solitary
.
habitat
a factor with levels grasses
,
leaves
, meadows
, paths
, urban
,
waste
, and woods
.
The records are drawn from G. H. Lincoff (1981) (Pres.), The Audubon Society Field Guide to North American Mushrooms. New York: Alfred A. Knopf. (See pages 500--525 for the Agaricus and Lepiota Family.)
The Guide clearly states that there is no simple rule for determining the edibility of a mushroom; no rule like “leaflets three, let it be” for Poisonous Oak and Ivy.
Unused levels in the original data were dropped.
The current version of the UC Irvine Machine Learning Repository Mushroom data set is available from tools:::Rd_expr_doi("10.24432/C5959T")
Blake, C.L. & Merz, C.J. (1998). UCI Repository of Machine Learning Databases. Irvine, CA: University of California, Department of Information and Computer Science. Formerly available from http://www.ics.uci.edu/~mlearn/MLRepository.html.
data("Mushroom")
summary(Mushroom)
Run the code above in your browser using DataLab