powered by
Implementation of the SnowballC stemmer. Note that punctuation and capitals letters are also removed.
pr_stem_words(df, col, language = "french")
the data.frame containing the sentences
the column with the sentences
the language of the words Defaut is french. See SnowballC::getStemLanguages() function for a list of supported languages.
a tibble
# NOT RUN { a <- data.frame(words = c("matin", "heure", "fatigu<U+00E9>","sonn<U+00E9>","lois", "tests","fusionner")) pr_stem_words(a, words) # }
Run the code above in your browser using DataLab