spanPunct = FALSE
) as adjacent and hence are not eligible to
be collocations.collocations2(x, method = c("lr", "chi2", "pmi", "dice"), features = "*",
valuetype = c("glob", "regex", "fixed"), case_insensitive = TRUE,
min_count = 1, size = 2, ...)
"lr"
"chi2"
"pmi"
"dice"
"all"
"glob"
for
"glob"-style wildcard expressions; "regex"
for regular expressions;
or "fixed"
for exact matching. See valuetype for details.TRUE
n=2
) and trigram
(n=3
) collocations are currently implemented. Can be c(2,3)
(or 2:3
) to return both bi- and tri-gram collocations.tokens