subset_correction: Subset Correction of a predicted result
Description
This method restrict a multi-label learner to predict only label combinations
whose existence is present in the (training) data. To this all labelsets
that are predicted but are not found on training data is replaced by the most
similar labelset.
Usage
subset_correction(mlresult, train_y, probability = FALSE)
Arguments
mlresult
An object of mlresult that contain the scores and bipartition
values.
train_y
A matrix/data.frame with all labels values of the training
dataset or a mldr train dataset.
probability
A logical value. If TRUE the predicted values are
the score between 0 and 1, otherwise the values are bipartition 0 or 1.
(Default: FALSE)
Value
A new mlresult where all results are present in the training
labelsets.
Details
If the most similar is not unique, those label combinations with higher
frequency in the training data are preferred. The Hamming loss distance is
used to determine the difference between the labelsets.
References
Senge, R., Coz, J. J. del, & Hullermeier, E. (2013). Rectifying classifier
chains for multi-label classification. In Workshop of Lernen, Wissen &
Adaptivitat (LWA 2013) (pp. 162-169). Bamberg, Germany.