Learn R Programming

KoNLP

POS Tagger and Morphological Analyzer for Korean text based research. It provides tools for corpus linguistics research such as Keystroke converter, Hangul automata, Concordance, and Mutual Information. It also provides a convenient interface for users to apply, edit and add morphological dictionary selectively.

  • Interfacing with opensource Hannanum analyzer.
  • Some twiks are applied on Hannanum analyzer for bigger or flexible user dictionary for Sejong project and NIADic.
  • Many other functions for Korean text analysis like keystroke conversion, is.jamo, is.hangul, Hangul automata...

Some of Korean tutorials are on my blog, English pages are mainly on wiki.

To install from CRAN, use

install.packages('KoNLP')

To install from GitHub, use

install.packages('devtools')
devtools::install_github('haven-jeon/KoNLP')

Copy Link

Version

Monthly Downloads

173

Version

0.80.2

License

GPL-3

Issues

Pull Requests

Stars

Forks

Maintainer

Last Published

March 1st, 2020

Functions in KoNLP (0.80.2)

editweights

Keystroke misspell cost table
convertTag

tag name converter
SimplePos09

POS tagging by using 9 KAIST tags
install_NIADic

install_NIADic
useSystemDic

use system default dictionary
is.ascii

check if sentence is all ASCII
restoreUsrDic

use for restoring backuped dic_user.txt
scala_library_install

scala_library_install
is.hangul

check if sentence is all Hangul
is.jaeum

check if sentence is all Jaeum
is.jamo

check if sentence is all Jamo
extractNoun

Noun extractor for Hangul
mergeUserDic

appending or replacing with new data.frame
get_dictionary

Get Dictionary
statDic

summary of dictionaries
mutualinformation

mutual information for input text
is.moeum

check if sentence is all Moeum
convertHangulStringToKeyStrokes

convertion function Hangul string to keyStrokes
convertHangulStringToJamos

convertion function Hangul string to Jamos
tags

tag names
reloadAllDic

reload all Hannanum analyzer dictionary
reloadUserDic

reload dictionaries for specific functions
useNIADic

use Insighter and Woorimalsam dictionary
useSejongDic

use Sejong noun dictionary
HangulAutomata

do Hangul automata
concordance_file

concordance for input text file
concordance_str

concordance for input text vector
MorphAnalyzer

Hannanum morphological analyzer interface function
backupUsrDic

use for backup current dic_user.txt Utility function for backup dic_user.txt file to backup directory.
buildDictionary

buildDictionary
KtoS

KAIST tag to Sejong tag
SimplePos22

POS tagging by using 22 KAIST tags
StoK

Sejong tag to KAIST tag