Learn R Programming

imagerExtra (version 1.3.2)

OCR: Optical Character Recognition with tesseract

Description

OCR and OCR_data are wrappers for ocr and ocr_data of tesseract package. You need to install tesseract package to use these functions.

Usage

OCR(imorpx, engine = tesseract::tesseract("eng"), HOCR = FALSE)

OCR_data(imorpx, engine = tesseract::tesseract("eng"))

Arguments

imorpx

a grayscale image of class cimg or a pixel set

engine

a tesseract engine. See the reference manual of tesseract for detail.

HOCR

if TRUE return results as HOCR xml instead of plain text

Examples

Run this code
# NOT RUN {
hello <- DenoiseDCT(papers, 0.01) %>% ThresholdAdaptive(., 0.1, range = c(0,1))
if (requireNamespace("tesseract", quietly = TRUE))
{
  OCR(hello) %>% cat
  OCR_data(hello)
}
# }

Run the code above in your browser using DataLab