sdf_distinct

Invoke distinct on a Spark DataFrame

R interface to Apache Spark, a fast and general
engine for big data processing, see <https://spark.apache.org/>. This
package supports connecting to local and remote Apache Spark clusters,
provides a 'dplyr' compatible back-end, and provides an interface to
Spark's built-in machine learning algorithms.

Edgar Ruiz

sparklyr

R Interface to Apache Spark

Javier Luraschi

Kevin Kuo

Kevin Ushey

JJ Allaire

Samuel Macedo

Hossein Falaki

Lu Wang

Andy Zhang

Yitao Li

Jozef Hajnala

Maciej Szymkiewicz

Wil Davis

 RStudio

 The Apache Software Foundation

sdf_distinct function

<dl><dt>x</dt>
<dd>A Spark DataFrame.</dd>
<dt>...</dt>
<dd>Optional variables to use when determining uniqueness.
If there are multiple rows for a given combination of inputs,
only the first row will be preserved. If omitted, will use all
variables.</dd>
<dt>name</dt>
<dd>A name to assign this table. Passed to [sdf_register()].</dd></dl>

Arguments

Invoke distinct on a Spark DataFrame — sdf_distinct

<dl>

<dt>x</dt>
<dd>A Spark DataFrame.</dd>


<dt>...</dt>
<dd>Optional variables to use when determining uniqueness.
If there are multiple rows for a given combination of inputs,
only the first row will be preserved. If omitted, will use all
variables.</dd>


<dt>name</dt>
<dd>A name to assign this table. Passed to [sdf_register()].</dd>

</dl>

Invoke distinct on a Spark DataFrame

sdf_distinct: Invoke distinct on a Spark DataFrame

Description

Usage

Arguments

See Also