powered by
Cosine similarity for text documents
COS_TEXT( text_vector1 = NULL, text_vector2 = NULL, threads = 1, separator = " " )
a numeric vector
a character string vector representing text documents (it should have the same length as the text_vector2)
a character string vector representing text documents (it should have the same length as the text_vector1)
a numeric value specifying the number of cores to run in parallel
specifies the separator used between words of each character string in the text vectors
The function calculates the cosine distance between pairs of text sequences of two character string vectors
library(textTinyR) vec1 = c('use this', 'function to compute the') vec2 = c('cosine distance', 'between text sequences') out = COS_TEXT(text_vector1 = vec1, text_vector2 = vec2, separator = " ")
Run the code above in your browser using DataLab