Given a dtm and a similarity (adjacency) matrix, create a new column for each nonzero cell in the similarity matrix. For the term combinations (everything except the diagonal) the column names will be pasted together with a "&" separator (read as AND)
term_intersect(dtm, simmat, as_dfm = T, verbose = F, sep = " & ", par = NA)
A dgCMatrix or quanteda dfm
A quanteda dfm or a dgCMatrix.
A similarity matrix in dgCMatrix format. For instance, created with term_char_sim
If True, return as quanteda dfm
If True, report progress
The separator used for pasting the terms
If TRUE, add parentheses to colnames before combining. This is mainly for internal use, as it allows specification if OR (term_union) and AND (term_intersect) operations are combined. If NA, this is based on whether parenthese are present.