주 메뉴 열기

wwiki β

Tesseract

Jhkim (토론 | 기여)님의 2019년 10월 29일 (화) 00:16 판 (새 문서: # version 4 ## LSTM(Long Short Term Memory) 기반 엔진 구현 * RNN(Recurrent Neural Network)의 일종 ## 다른 딥러닝 기술 * CNN(Convolutional Neural Network): 단일문...)
(차이) ← 이전 판 | 최신판 (차이) | 다음 판 → (차이)
  1. version 4
    1. LSTM(Long Short Term Memory) 기반 엔진 구현
* RNN(Recurrent Neural Network)의 일종
    1. 다른 딥러닝 기술
* CNN(Convolutional Neural Network): 단일문자가 포함된 이미지를 인식하는 경우
    1. 설치
 https://github.com/tesseract-ocr/tesseract/wiki#installation
    1. 명령어 사용법
$ tesseract 영문텍스트.png stdout -l eng --oem 1 --psm 3
      1. 파라미터
        1. oem (OCR Engine modes)
 0    Legacy engine only.
 1    Neural nets LSTM engine only.
 2    Legacy + LSTM engines.
 3    Default, based on what is available.
        1. psm (Page segmentation modes)
 0    Orientation and script detection (OSD) only.
 1    Automatic page segmentation with OSD.
 2    Automatic page segmentation, but no OSD, or OCR.
 3    Fully automatic page segmentation, but no OSD. (Default)
 4    Assume a single column of text of variable sizes.
 5    Assume a single uniform block of vertically aligned text.
 6    Assume a single uniform block of text.
 7    Treat the image as a single text line.
 8    Treat the image as a single word.
 9    Treat the image as a single word in a circle.
10    Treat the image as a single character.
11    Sparse text. Find as much text as possible in no particular order.
12    Sparse text with OSD.
13    Raw line. Treat the image as a single text line, bypassing hacks that are Tesseract-specific.
      1. 결과 향상시키기
* 300DPI이미지 사용하기