Calculate N-Gram wise Dice coefficients for different N-Gram Lengths.
multi_dice_coefficient_matching(document_1, document_2, ngram_sizes = c(1:50),
remove_duplicates = TRUE)
A vector of strings (one per line or one per sentence), or a list of vectors of tokens (one per line or one per sentence).
Same as document_1, will be used for comparison.
A numeric vector of N-Gram lengths for us in calculating Dice coefficients.
Logical indicating whether dublicate ngrams should be removed before matching. Defaults to TRUE.
A data.frame with Dice coefficients based on different N-Gram lengths.