목차
version 4
LSTM(Long Short Term Memory) 기반 엔진 구현
* RNN(Recurrent Neural Network)의 일종
다른 딥러닝 기술
* CNN(Convolutional Neural Network): 단일문자가 포함된 이미지를 인식하는 경우
설치
공식문서 https://github.com/tesseract-ocr/tesseract/wiki#installation
데비안 계열
$ sudo apt install tesseract-ocr tesseract-ocr-kor
명령어 사용법
$ tesseract 영문텍스트.png stdout -l eng --oem 1 --psm 3
여러언어
$ tesseract myscan.png out -l eng+deu
파라미터
oem (OCR Engine modes)
0 Legacy engine only. 1 Neural nets LSTM engine only. 2 Legacy + LSTM engines. 3 Default, based on what is available.
psm (Page segmentation modes)
0 Orientation and script detection (OSD) only. 1 Automatic page segmentation with OSD. 2 Automatic page segmentation, but no OSD, or OCR. 3 Fully automatic page segmentation, but no OSD. (Default) 4 Assume a single column of text of variable sizes. 5 Assume a single uniform block of vertically aligned text. 6 Assume a single uniform block of text. 7 Treat the image as a single text line. 8 Treat the image as a single word. 9 Treat the image as a single word in a circle. 10 Treat the image as a single character. 11 Sparse text. Find as much text as possible in no particular order. 12 Sparse text with OSD. 13 Raw line. Treat the image as a single text line, bypassing hacks that are Tesseract-specific.
결과 향상시키기
- 300DPI이미지 사용하기