powered by
Read a corpus vector and generate the dictionary data frame.
createDict(trainvec, dicfile = NULL, wordsplit = "\\s+", natruesplit = "/")
A character vector of corpus.
The path of output file. Defult is NULL.
Character containing regular expression to use for splitting words.
Character containing regular expression to use for splitting nature.
A data frame of:
Word.
Frequency.
Nature.
# NOT RUN { data(PD980105) d1 <- createDict(PD980105[1:10]) head(d1) # }
Run the code above in your browser using DataLab