ml_fpgrowth

ml_association_rules

ml_freq_itemsets

A parallel FP-growth algorithm to mine frequent itemsets.

R interface to Apache Spark, a fast and general
engine for big data processing, see <https://spark.apache.org/>. This
package supports connecting to local and remote Apache Spark clusters,
provides a 'dplyr' compatible back-end, and provides an interface to
Spark's built-in machine learning algorithms.

Edgar Ruiz

sparklyr

R Interface to Apache Spark

Javier Luraschi

Kevin Kuo

Kevin Ushey

JJ Allaire

Samuel Macedo

Hossein Falaki

Lu Wang

Andy Zhang

Yitao Li

Jozef Hajnala

Maciej Szymkiewicz

Wil Davis

 RStudio

 The Apache Software Foundation

ml_fpgrowth function

<dl><dt>x</dt>
<dd>A <code>spark_connection</code>, <code>ml_pipeline</code>, or a <code>tbl_spark</code>.</dd>
<dt>items_col</dt>
<dd>Items column name. Default: "items"</dd>
<dt>min_confidence</dt>
<dd>Minimal confidence for generating Association Rule.
<code>min_confidence</code> will not affect the mining for frequent itemsets, but
will affect the association rules generation. Default: 0.8</dd>
<dt>min_support</dt>
<dd>Minimal support level of the frequent pattern. [0.0, 1.0].
Any pattern that appears more than (min_support * size-of-the-dataset) times
 will be output in the frequent itemsets. Default: 0.3</dd>
<dt>prediction_col</dt>
<dd>Prediction column name.</dd>
<dt>uid</dt>
<dd>A character string used to uniquely identify the ML estimator.</dd>
<dt>...</dt>
<dd>Optional arguments; currently unused.</dd>
<dt>model</dt>
<dd>A fitted FPGrowth model returned by <code>ml_fpgrowth()</code></dd></dl>

Arguments

Frequent Pattern Mining -- FPGrowth — ml_fpgrowth

<dl>

<dt>x</dt>
<dd>A <code>spark_connection</code>, <code>ml_pipeline</code>, or a <code>tbl_spark</code>.</dd>


<dt>items_col</dt>
<dd>Items column name. Default: "items"</dd>


<dt>min_confidence</dt>
<dd>Minimal confidence for generating Association Rule.
<code>min_confidence</code> will not affect the mining for frequent itemsets, but
will affect the association rules generation. Default: 0.8</dd>


<dt>min_support</dt>
<dd>Minimal support level of the frequent pattern. [0.0, 1.0].
Any pattern that appears more than (min_support * size-of-the-dataset) times
 will be output in the frequent itemsets. Default: 0.3</dd>


<dt>prediction_col</dt>
<dd>Prediction column name.</dd>


<dt>uid</dt>
<dd>A character string used to uniquely identify the ML estimator.</dd>


<dt>...</dt>
<dd>Optional arguments; currently unused.</dd>


<dt>model</dt>
<dd>A fitted FPGrowth model returned by <code>ml_fpgrowth()</code></dd>

</dl>

Frequent Pattern Mining -- FPGrowth

ml_fpgrowth: Frequent Pattern Mining -- FPGrowth

Description

Usage

Arguments