Calculates the positions of n-grams in two document versions which match an ngram in the other version.
ngram_sequence_matching(document_1, document_2, ngram_size,
use_hashmap = FALSE, tokenized_strings_provided = FALSE)
A string (or a character vector) representing the earlier document version.
A string (or a character vector) representing the later document version.
The length of n-grams to be compared
Defaults to FALSE. If TRUE, then a hashmap is used for faster lookup and comparisons.
Defaults to FALSE. If TRUE, then pre-tokenized strings are expected as character vectors.
A List object.