Learn R Programming

jiebaR (version 0.11)

Chinese Text Segmentation

Description

Chinese text segmentation, keyword extraction and speech tagging For R.

Copy Link

Version

Install

install.packages('jiebaR')

Monthly Downloads

2,933

Version

0.11

License

MIT + file LICENSE

Issues

Pull Requests

Stars

Forks

Maintainer

Last Published

December 13th, 2019

Functions in jiebaR (0.11)

DICTPATH

The path of dictionary
apply_list

Apply list input to a worker
get_tuple

get tuple from the segmentation result
<=.keywords

Keywords symbol
keywords

Keyword extraction
new_user_word

Add user word
tagging

Speech Tagging
print.inv

Print worker settings
simhash_dist

Compute Hamming distance of Simhash value
jiebaR

A package for Chinese text segmentation
<=.segment

Text segmentation symbol
<=.qseg

Quick mode symbol
<=.simhash

Simhash symbol
vector_tag

Tag the a character vector
tobin

simhash value to binary
<=.tagger

Tagger symbol
worker

Initialize jiebaR worker
get_qsegmodel

Set quick mode model
show_dictpath

Show default path of dictionaries
segment

Chinese text segmentation function
simhash

Simhash computation
freq

The frequency of words
get_idf

generate IDF dict
distance

Hamming distance of words
edit_dict

Edit default user dictionary
file_coding

Files encoding detection
filter_segment

Filter segmentation result