stream_write_table

Writes a Spark dataframe stream into a table.

R interface to Apache Spark, a fast and general
engine for big data processing, see <https://spark.apache.org/>. This
package supports connecting to local and remote Apache Spark clusters,
provides a 'dplyr' compatible back-end, and provides an interface to
Spark's built-in machine learning algorithms.

Edgar Ruiz

sparklyr

R Interface to Apache Spark

Javier Luraschi

Kevin Kuo

Kevin Ushey

JJ Allaire

Samuel Macedo

Hossein Falaki

Lu Wang

Andy Zhang

Yitao Li

Jozef Hajnala

Maciej Szymkiewicz

Wil Davis

 RStudio

 The Apache Software Foundation

stream_write_table function

<dl><dt>x</dt>
<dd>A Spark DataFrame or dplyr operation</dd>
<dt>path</dt>
<dd>The path to the file. Needs to be accessible from the cluster.
Supports the <samp>"hdfs://"</samp>, <samp>"s3a://"</samp> and <samp>"file://"</samp> protocols.</dd>
<dt>format</dt>
<dd>Specifies format of data written to table E.g.
<code>"delta"</code>, <code>"parquet"</code>. Defaults to <code>NULL</code> which will use
system default format.</dd>
<dt>mode</dt>
<dd>Specifies how data is written to a streaming sink. Valid values are
<code>"append"</code>, <code>"complete"</code> or <code>"update"</code>.</dd>
<dt>checkpoint</dt>
<dd>The location where the system will write all the checkpoint
information to guarantee end-to-end fault-tolerance.</dd>
<dt>options</dt>
<dd>A list of strings with additional options.</dd>
<dt>partition_by</dt>
<dd>Partitions the output by the given list of columns.</dd>
<dt>...</dt>
<dd>Optional arguments; currently unused.</dd></dl>

Arguments

Write Stream to Table — stream_write_table

<dl>

<dt>x</dt>
<dd>A Spark DataFrame or dplyr operation</dd>


<dt>path</dt>
<dd>The path to the file. Needs to be accessible from the cluster.
Supports the <samp>"hdfs://"</samp>, <samp>"s3a://"</samp> and <samp>"file://"</samp> protocols.</dd>


<dt>format</dt>
<dd>Specifies format of data written to table E.g.
<code>"delta"</code>, <code>"parquet"</code>. Defaults to <code>NULL</code> which will use
system default format.</dd>


<dt>mode</dt>
<dd>Specifies how data is written to a streaming sink. Valid values are
<code>"append"</code>, <code>"complete"</code> or <code>"update"</code>.</dd>


<dt>checkpoint</dt>
<dd>The location where the system will write all the checkpoint
information to guarantee end-to-end fault-tolerance.</dd>


<dt>options</dt>
<dd>A list of strings with additional options.</dd>


<dt>partition_by</dt>
<dd>Partitions the output by the given list of columns.</dd>


<dt>...</dt>
<dd>Optional arguments; currently unused.</dd>

</dl>

Write Stream to Table

stream_write_table: Write Stream to Table

Description

Usage

Arguments

See Also