Learn R Programming

jiebaR (version 0.11.1)

Chinese Text Segmentation

Description

Chinese text segmentation, keyword extraction and speech tagging For R.

Copy Link

Version

Install

install.packages('jiebaR')

Monthly Downloads

3,123

Version

0.11.1

License

MIT + file LICENSE

Issues

Pull Requests

Stars

Forks

Maintainer

ORPHANED

Last Published

March 29th, 2025

Functions in jiebaR (0.11.1)

<=.segment

Text segmentation symbol
worker

Initialize jiebaR worker
simhash

Simhash computation
show_dictpath

Show default path of dictionaries
vector_tag

Tag the a character vector
tobin

simhash value to binary
get_qsegmodel

Set quick mode model
simhash_dist

Compute Hamming distance of Simhash value
tagging

Speech Tagging
freq

The frequency of words
file_coding

Files encoding detection
DICTPATH

The path of dictionary
filter_segment

Filter segmentation result
apply_list

Apply list input to a worker
get_idf

generate IDF dict
edit_dict

Edit default user dictionary
get_tuple

get tuple from the segmentation result
distance

Hamming distance of words
jiebaR

A package for Chinese text segmentation
<=.simhash

Simhash symbol
print.inv

Print worker settings
new_user_word

Add user word
<=.tagger

Tagger symbol
segment

Chinese text segmentation function
<=.keywords

Keywords symbol
keywords

Keyword extraction
<=.qseg

Quick mode symbol