Edit the default user dictionary.
edit_dict(name = "user")
the name of dictionary including user
, system
,
stop_word
.
There are three column in the system dictionary. Each column is seperated by space. The first column is the word, and the second column is the frequency of word. The third column is speech tag using labels compatible with ictclas.
There are two column in the user dictionary. The first column is the word,
and the second column is speech tag using labels compatible with ictclas.
Frequencies of words in the user dictionary is set by user_weight in worker
function.
If you want to provide the frequency of a new word,
you can put it in the system dictionary.
Only one column in the stop words dictionary, and it contains the stop words.
The ictclas speech tag : http://t.cn/RAEj7e1