A dataset containing 1990 U.S. census data on first names.
data(NAMES)
A data frame with 5493 rows and 7 variables
name. A first name.
per.freq. Frequency in percent of the name by gender.
cum.freq. Cumulative frequency in percent of the name by gender.
rank. Rank of the name by gender.
gender. Gender of the combined male/female list (M/F).
gender2. Gender of the combined male/female list with "B" in place of overlapping (M/F) names.
pred.sex. Predicted gender of the names with B's in gender2
replaced with the gender that had a higher per.freq
.