목차

Tesseract OCR

설치

homebrew

brew install tesseract tesseract-lang

ubuntu

sudo apt install  tesseract-ocr-kor tesseract-ocr-kor-vert gscan2pdf

인식 데이터

#cd ~/.config
#git clone --recursive --depth=1 https://github.com/tesseract-ocr/tessdata_best.git
#원하는 것들만 받기
 
mkdir -p ~/.config/tessdata_best
 
wget -O ~/.config/tessdata_best/kor.traineddata https://github.com/tesseract-ocr/tessdata_best/raw/main/kor.traineddata
wget -O ~/.config/tessdata_best/eng.traineddata https://github.com/tesseract-ocr/tessdata_best/raw/main/eng.traineddata
 
export TESSDATA_PREFIX=$HOME/.config/tessdata_best

인식률