바뀜

이동: 둘러보기, 검색

Tesseract

1,443 바이트 추가됨, 2019년 10월 29일 (화) 00:16
새 문서: # version 4 ## LSTM(Long Short Term Memory) 기반 엔진 구현 * RNN(Recurrent Neural Network)의 일종 ## 다른 딥러닝 기술 * CNN(Convolutional Neural Network): 단일문...
# version 4
## LSTM(Long Short Term Memory) 기반 엔진 구현
* RNN(Recurrent Neural Network)의 일종
## 다른 딥러닝 기술
* CNN(Convolutional Neural Network): 단일문자가 포함된 이미지를 인식하는 경우
## 설치
https://github.com/tesseract-ocr/tesseract/wiki#installation
## 명령어 사용법
$ tesseract 영문텍스트.png stdout -l eng --oem 1 --psm 3
### 파라미터
#### oem (OCR Engine modes)
0 Legacy engine only.
1 Neural nets LSTM engine only.
2 Legacy + LSTM engines.
3 Default, based on what is available.
#### psm (Page segmentation modes)
0 Orientation and script detection (OSD) only.
1 Automatic page segmentation with OSD.
2 Automatic page segmentation, but no OSD, or OCR.
3 Fully automatic page segmentation, but no OSD. (Default)
4 Assume a single column of text of variable sizes.
5 Assume a single uniform block of vertically aligned text.
6 Assume a single uniform block of text.
7 Treat the image as a single text line.
8 Treat the image as a single word.
9 Treat the image as a single word in a circle.
10 Treat the image as a single character.
11 Sparse text. Find as much text as possible in no particular order.
12 Sparse text with OSD.
13 Raw line. Treat the image as a single text line, bypassing hacks that are Tesseract-specific.
### 결과 향상시키기
* 300DPI이미지 사용하기
편집
2,440

둘러보기 메뉴