Post-processing alternatives to deal with multiples meta-features values. This method is used by the meta-features characterization methods to summarize the obtained values.
post.processing(measure, summary = c("mean", "sd"), multiple = TRUE, ...)
A list with the meta-features values.
The functions to post processing the data. See the details
to more information. Default: c("mean", "sd")
A logical value defining if the measure should return
multiple values. (Default: TRUE
)
Extra values used to the functions of summarization.
A list with the post-processed meta-features
The post processing functions are used to summarize the meta-features. They are organized into three groups: non-aggregated, descriptive statistic and distribution. Currently, the hypothesis testing post processing are not supported.
In practice, there are no difference among the types, so that more than one
type and functions can be combined. Usually, these function are used to
summarize a set of values for each meta-features. For instance, a measure
computed for each attribute can be summarized using the "mean"
and/or
"sd"
. Necessarily, a single value always use the
"non.aggregated"
function.
In addition to the native functions available in R, the following functions can be used:
Computes a histogram of the given data value. The extra
parameters 'bins
' can be used to define the number of values to
be returned. The parameters 'max
' and 'min
' are used to
define the range of the data. The default value for these parameters
are respectively 10, min(x)
and max(x)
.
See kurtosis
See max
See mean
See median
See min
See quantile
See sd
See skewness
See var
Returns the original value(s) of the meta-feature.
These functions are not restrictive, thus another functions can be applied as post-processing summarization function.
Fabio Pinto, Carlos Soares, and Joao Mendes-Moreira. Towards Automatic Generation of Metafeatures. In 20th Pacific-Asia Conference on Knowledge Discovery and Data Mining (PAKDD), pages 215 - 226, 2016.
# NOT RUN {
post.processing(runif(15))
post.processing(runif(15), c("min", "max"))
post.processing(runif(15), c("quantiles", "skewness"))
post.processing(runif(15), "histogram", bins=5, min=0, max=1)
# }
Run the code above in your browser using DataLab