ml_pca

An object coercable to a Spark DataFrame (typically, a
<code>tbl_spark</code>).

The columns to use in the principal components
analysis. Defaults to all columns in <code>x</code>.

features

The number of principal components.

Optional arguments, used to affect the model generated. See
<code><a rd-options="" href="/link/ml_options?package=sparklyr&version=0.6.4" data-mini-rdoc="sparklyr::ml_options">ml_options</a></code> for more details.

ml.options

Optional arguments. The <code>data</code> argument can be used to
specify the data to be used when <code>x</code> is a formula; this allows calls
of the form <code>ml_linear_regression(y ~ x, data = tbl)</code>, and is
especially useful in conjunction with <code><a rd-options="" href="/link/do?package=sparklyr&version=0.6.4" data-mini-rdoc="sparklyr::do">do</a></code>.

Perform principal components analysis on a Spark DataFrame.

R interface to Apache Spark, a fast and general engine for big data
processing, see <http://spark.apache.org>. This package supports connecting to
local and remote Apache Spark clusters, provides a 'dplyr' compatible back-end,
and provides an interface to Spark's built-in machine learning algorithms.

Javier Luraschi

sparklyr

R Interface to Apache Spark

Kevin Ushey

JJ Allaire

 RStudio

 The Apache Software Foundation

ml_pca function

Optional arguments, used to affect the model generated. See
<code><a rd-options='' href='ml_options'>ml_options</a></code> for more details.

Optional arguments. The <code>data</code> argument can be used to
specify the data to be used when <code>x</code> is a formula; this allows calls
of the form <code>ml_linear_regression(y ~ x, data = tbl)</code>, and is
especially useful in conjunction with <code><a rd-options='' href='do'>do</a></code>.

ml_pca: Spark ML -- Principal Components Analysis

Description

Usage

Arguments

See Also