simil

dist

a <a rd-options="" href="/link/matrix?package=proxyC&version=0.1.0" data-mini-rdoc="proxyC::matrix">matrix</a> or <a rd-options="" href="/link/Matrix?package=proxyC&version=0.1.0" data-mini-rdoc="proxyC::Matrix">Matrix</a> object

if a <a rd-options="" href="/link/matrix?package=proxyC&version=0.1.0" data-mini-rdoc="proxyC::matrix">matrix</a> or <a rd-options="" href="/link/Matrix?package=proxyC&version=0.1.0" data-mini-rdoc="proxyC::Matrix">Matrix</a> object is provided, proximity
between documents or features in <code>x</code> and <code>y</code> is computed.

integer indicating margin of similiarty/distance computation. 1
indicates rows or 2 indicates columns.

margin

method to compute similarity or distance

method

the minimum similiarty value to be recoded.

min_simil

an integer value specifying top-n most similiarty values to be
recorded.

rank

Fast similarity/distance computation function for large sparse matrices. You
can floor small similairty value to to save computation time and storage
space by an arbitrary threashold (<code>min_simil</code>) or rank (<code>rank</code>).
Please increase the numbner of threads for better perfromance using
<code><a rd-options="RcppParallel" href="/link/setThreadOptions?package=proxyC&version=0.1.0&to=RcppParallel" data-mini-rdoc="RcppParallel::setThreadOptions">setThreadOptions</a></code>.


Computes proximity between rows or columns of large matrices efficiently in C++.
Functions are optimized for large sparse matrices using the Armadillo and Intel TBB libraries.
Among several built-in similarity/distance measures, computation of correlation,
cosine similarity and Euclidean distance is particularly fast.

Kohei Watanabe

proxyC

Computes Proximity in Large Sparse Matrices

simil function

a <a rd-options='' href='matrix'>matrix</a> or <a rd-options='' href='Matrix'>Matrix</a> object

if a <a rd-options='' href='matrix'>matrix</a> or <a rd-options='' href='Matrix'>Matrix</a> object is provided, proximity
between documents or features in <code>x</code> and <code>y</code> is computed.

Fast similarity/distance computation function for large sparse matrices. You
can floor small similairty value to to save computation time and storage
space by an arbitrary threashold (<code>min_simil</code>) or rank (<code>rank</code>).
Please increase the numbner of threads for better perfromance using
<code><a rd-options='RcppParallel' href='setThreadOptions'>setThreadOptions</a></code>.

simil: Compute similiarty/distance between raws or columns of large matrices

Description

Usage

Arguments

Examples