powered by
It calculates a similarity/distance matrix from either an incidence data frame/matrix or a coin object.
sim(input, procedures="Jaccard", level=.95, distance=FALSE, minimum=1, maximum=Inf, sort=FALSE, decreasing=FALSE, weight = NULL, pairwise = FALSE)
A similarity/distance matrix.
a binary data frame or a coin object, let's say an R list composed by a number of scenarios ($n) and a coincidence matrix with frequencies ($f).
$n
$f
a vector of statistics of similarity. See details below.
confidence level
convert the similarity matrix into a distance matrix
minimum frequency to obtain a similarity/distance measure.
maxium frequency to obtain a similarity/distance measure.
sort the list according to the values of a statistic. See details below
order in a decreasing way.
a vector of weights. Optimal for data.framed tables
Pairwise mode of handling missing values if TRUE. Listwise by default.
Modesto Escobar, Department of Sociology and Communication, University of Salamanca. See https://sociocav.usal.es/blog/modesto-escobar/
Possible measures in procedures are
Frequencies (f), Relative frequencies (x), Conditional frequencies (i), Coincidence degree (cc), Probable degree (cp),
Expected (e), Confidence interval (con)
Matching (m), Rogers & Tanimoto (t), Gower (g), Sneath (s), Anderberg (and),
Jaccard (j), Dice (d), antiDice (a), Ochiai (o), Kulczynski (k),
Hamann (ham), Yule (y), Pearson (p), odds ratio (od), Rusell (r),
Haberman (h), Z value of Haberman (z).
Hypergeometric p greater value (hyp).
# From a random incidence matrix I(25X4) I<-matrix(rbinom(100,1,.5),nrow=25,ncol=4, dimnames=list(NULL,c("A","B","C","D"))) sim(I) #Same results C<-coin(I) sim(C)
Run the code above in your browser using DataLab