Rdocumentation
powered by
Learn R Programming
jiebaR (version 0.11)
Chinese Text Segmentation
Description
Chinese text segmentation, keyword extraction and speech tagging For R.
Copy Link
Link to current version
Version
Version
0.11
0.10.99
0.9.99
0.9.1
0.8.1
0.7
0.6
0.5
0.4
0.3
0.2
Install
install.packages('jiebaR')
Monthly Downloads
2,933
Version
0.11
License
MIT + file LICENSE
Issues
36
Pull Requests
0
Stars
344
Forks
108
Repository
https://github.com/qinwf/jiebaR/
Maintainer
Qin Wenfeng
Last Published
December 13th, 2019
Functions in jiebaR (0.11)
Search all functions
DICTPATH
The path of dictionary
apply_list
Apply list input to a worker
get_tuple
get tuple from the segmentation result
<=.keywords
Keywords symbol
keywords
Keyword extraction
new_user_word
Add user word
tagging
Speech Tagging
print.inv
Print worker settings
simhash_dist
Compute Hamming distance of Simhash value
jiebaR
A package for Chinese text segmentation
<=.segment
Text segmentation symbol
<=.qseg
Quick mode symbol
<=.simhash
Simhash symbol
vector_tag
Tag the a character vector
tobin
simhash value to binary
<=.tagger
Tagger symbol
worker
Initialize jiebaR worker
get_qsegmodel
Set quick mode model
show_dictpath
Show default path of dictionaries
segment
Chinese text segmentation function
simhash
Simhash computation
freq
The frequency of words
get_idf
generate IDF dict
distance
Hamming distance of words
edit_dict
Edit default user dictionary
file_coding
Files encoding detection
filter_segment
Filter segmentation result