- x
a matrix or data frame with data to search in.
- condition
a tidyselect expression (see
tidyselect syntax)
specifying the columns to use as condition predicates
- xvars
a tidyselect expression (see
tidyselect syntax)
specifying the columns to use for computation of correlations
- yvars
a tidyselect expression (see
tidyselect syntax)
specifying the columns to use for computation of correlations
- disjoint
an atomic vector of size equal to the number of columns of x
that specifies the groups of predicates: if some elements of the disjoint
vector are equal, then the corresponding columns of x
will NOT be
present together in a single condition. If x
is prepared with
partition()
, using the var_names()
function on x
's column names
is a convenient way to create the disjoint
vector.
- method
a character string indicating which correlation coefficient is
to be used for the test. One of "pearson"
, "kendall"
, or "spearman"
- alternative
indicates the alternative hypothesis and must be one of
"two.sided"
, "greater"
or "less"
. "greater"
corresponds to
positive association, "less"
to negative association.
- exact
a logical indicating whether an exact p-value should be computed.
Used for Kendall's tau and Spearman's rho. See stats::cor.test()
for
more information.
- min_length
the minimum size (the minimum number of predicates) of the
condition to be generated (must be greater or equal to 0). If 0, the empty
condition is generated in the first place.
- max_length
The maximum size (the maximum number of predicates) of the
condition to be generated. If equal to Inf, the maximum length of conditions
is limited only by the number of available predicates.
- min_support
the minimum support of a condition to trigger the callback
function for it. The support of the condition is the relative frequency
of the condition in the dataset x
. For logical data, it equals to the
relative frequency of rows such that all condition predicates are TRUE on it.
For numerical (double) input, the support is computed as the mean (over all
rows) of multiplications of predicate values.
- max_support
the maximum support of a condition to trigger the callback
function for it. See argument min_support
for details of what is the
support of a condition.
- max_results
the maximum number of generated conditions to execute the
callback function on. If the number of found conditions exceeds
max_results
, the function stops generating new conditions and returns
the results. To avoid long computations during the search, it is recommended
to set max_results
to a reasonable positive value. Setting max_results
to Inf
will generate all possible conditions.
- verbose
a logical scalar indicating whether to print progress messages.
- threads
the number of threads to use for parallel computation.