A data set that contains the spectra of six different cultivars of the same fruit (cantaloupe - Cucumis melo L. Cantaloupensis group) obtained from Colin Greensill (Faculty of Engineering and Physical Systems, Central Queensland University, Rockhampton, Australia). The total data set contained 2818 spectra measured in 256 wavelengths. For illustrative purposes are considered only three cultivars out of it, named D, M and HA with sizes 490, 106 and 500, respectively. Thus the data set thus contains 1096 observations. For more details about this data set see the references below.
data(fruit)
A data frame with 1096 rows and 257 variables (one grouping variable -- cultivar
-- and 256 measurement variables).
Hubert, M. and Van Driessen, K., (2004). Fast and robust discriminant analysis. Computational Statistics and Data Analysis, 45(2):301--320. tools:::Rd_expr_doi("10.1016/S0167-9473(02)00299-2").
Vanden Branden, K and Hubert, M, (2005). Robust classification in high dimensions based on the SIMCA Method. Chemometrics and Intelligent Laboratory Systems, 79(1-2), pp. 10--21. tools:::Rd_expr_doi("10.1016/j.chemolab.2005.03.002").
Hubert, M, Rousseeuw, PJ and Verdonck, T, (2012). A Deterministic Algorithm for Robust Location and Scatter. Journal of Computational and Graphical Statistics, 21(3), pp 618--637. tools:::Rd_expr_doi("10.1080/10618600.2012.672100").
data(fruit)
table(fruit$cultivar)
Run the code above in your browser using DataLab