If only one sample is provided, this function computes an adjacency matrix,
i.e., a binary matrix whose \((i,j)\) element is one if and only if
elements \(i\) and \(j\) in the partition have the same cluster label. If
multiple samples are provided (as rows of the <code>x</code> matrix), this function
computes the \(n\)-by-\(n\) matrix whose \((i,j)\) element gives the
relative frequency (i.e., estimated probability) that items \(i\) and
\(j\) are in the same subset (i.e., cluster). This is the mean of the
adjacency matrices of the provided samples.

The SALSO algorithm is an efficient randomized greedy search method to find a point estimate for a random partition based on a loss function and posterior Monte Carlo samples. The algorithm is implemented for many loss functions, including the Binder loss and a generalization of the variation of information loss, both of which allow for unequal weights on the two types of clustering mistakes. Efficient implementations are also provided for Monte Carlo estimation of the posterior expected loss of a given clustering estimate. See Dahl, Johnson, Müller (2022) <doi:10.1080/10618600.2022.2069779>.

David B Dahl

salso

Search Algorithms and Loss Functions for Bayesian Clustering

David B. Dahl

Devin J. Johnson

Peter Müller

Andrés Felipe Barrientos

Garritt Page

David Dunson

Alex Crichton

Brendan Zabarauskas

David Tolnay

Jim Turner

Josh Stone

R. Janis Goldschmidt

Sean McArthur

Stefan Lankes

The Cranelift Project Developers 

The CryptoCorrosion Contributors 

The Rand Project Developers 

The Rust Project Developers 

Ulrik Sverdrup "bluss"

bluss 

psm function

<dl><dt>x</dt>
<dd>A \(B\)-by-\(n\) matrix, where each of the \(B\) rows
represents a clustering of \(n\) items using cluster labels. For the
\(b\)th clustering, items \(i\) and \(j\) are in the same cluster if
<code>x[b,i] == x[b,j]</code>.</dd>
<dt>nCores</dt>
<dd>The number of CPU cores to use, i.e., the number of
simultaneous runs at any given time. A value of zero indicates to use all
cores on the system.</dd></dl>

Arguments

Compute an Adjacency or Pairwise Similarity Matrix — psm

<dl>

<dt>x</dt>
<dd>A \(B\)-by-\(n\) matrix, where each of the \(B\) rows
represents a clustering of \(n\) items using cluster labels. For the
\(b\)th clustering, items \(i\) and \(j\) are in the same cluster if
<code>x[b,i] == x[b,j]</code>.</dd>


<dt>nCores</dt>
<dd>The number of CPU cores to use, i.e., the number of
simultaneous runs at any given time. A value of zero indicates to use all
cores on the system.</dd>

</dl>

Compute an Adjacency or Pairwise Similarity Matrix

psm: Compute an Adjacency or Pairwise Similarity Matrix

Description

Usage

Value

Arguments

Examples