Learn R Programming

Rwordseg (version 0.3-2)

Chinese Word Segmentation

Description

Provides interfaces and useful tools for Chinese word segmentation. Implements a segmentation algorithm based on Hidden Markov Model (HMM) in native R codes. Methods for HHMM-Based Chinese lexical analyzer are as described in : Hua-Ping Zhang et al., (2003) .

Copy Link

Version

Install

install.packages('Rwordseg')

Monthly Downloads

11

Version

0.3-2

License

LGPL

Maintainer

Jian Li

Last Published

August 23rd, 2019

Functions in Rwordseg (0.3-2)

uninstallDict

Uninstall a dictionary.
parseSentence

Parse a string of text.
listDict

List the installed dictionaries.
PD980105

Corpus of Multi-level Processing for Modern Chinese
createDict

Create a dictionary file from corpus.
setAnalyzer

Set the default analyzer.
segmentCN

Sengment Chinese text.
insertWords

Insert new words into analyzer.
installDict

Install a new dictionary.
importSogouScel

Import a Sogou dictionary.
createHMM

Create a HMM model from corpus.
setAppDir

Set the application path.