powered by
remove Punctuations, remove Numbers, Translate characters to lower or upper case, remove stopwords, remove user specified words, Stemming words.
cleanAbstracts(abstracts, rmNum = TRUE, tolw = TRUE, toup = FALSE, rmWords = TRUE, yrWords = NULL, stemDoc = FALSE)
output of getAbstracts, or just a paragraph of text
Remove the text document with any numbers in it or not
Translate characters in character vectors to lower case or not
Translate characters in character vectors to upper case or not
Remove a set of English stopwords (e.g., 'the') or not
A character vector listing the words to be removed.
Stem words in a text document using Porter's stemming algorithm.
getAbstracts
# NOT RUN { # Abs=getAbstracts(c("22693232", "22564732")) # cleanAbs=cleanAbstracts(Abs) # text="Jobs received a number of honors and public recognition." # cleanD=cleanAbstracts(text) # }
Run the code above in your browser using DataLab