Membuat Optical Character Recognition (OCR) Sederhana menggunakan Python

Create Simple Optical Character Recognition (OCR) with Python

sudo apt-get install tesseract-ocr
brew install tesseract
$ pip install pytesseract
from PIL import Image
import pytesseract
Image sample
filename = 'image_01.png'
img1 = Image.open(filename)
text = pytesseract.image_to_string(img1)
Image sample dengan noise
filename = 'image_02.png'
img2 = Image.open(filename)
text = pytesseract.image_to_string(img2)
print(text)
import numpy as np
import cv2
norm_img = np.zeros((img.shape[0], img.shape[1]))
img = cv2.normalize(img, norm_img, 0, 255, cv2.NORM_MINMAX)
img = cv2.threshold(img, 100, 255, cv2.THRESH_BINARY)[1]
img = cv2.GaussianBlur(img, (1, 1), 0)
Preprocessed image

Membuat Text Localization dan Detection pada Gambar Menggunakan Tesseract OCR

Tetap Terhubung dengan Kami
Share this
×