2021年1月27日星期三

How to process this captcha image for Pytesseract?

I want to solve automatically captchas like this one (all of them with red background and white letters) with Pytesseract

Captcha image

I have been trying processing image to make Pytesseract be able to read it, but no success. Would be great to receive your ideas to process this image. Here my code:

import cv2  import pytesseract    tessdata_dir_config = '--tessdata-dir "C:\\Program Files\\Tesseract-OCR\\tessdata"'  pytesseract.pytesseract.tesseract_cmd = 'C:\\Program Files\\Tesseract-OCR\\tesseract.exe'    img = cv2.imread("captcha.png")  img = cv2.resize(img, None, fx=2, fy=2)  gray = cv2.cvtColor(img, cv2.COLOR_BGR2GRAY)  adaptive = cv2.adaptiveThreshold(      gray, 255, cv2.ADAPTIVE_THRESH_GAUSSIAN_C, cv2.THRESH_BINARY, 85, 20)  print((pytesseract.image_to_string(img, config=tessdata_dir_config)).strip())  print((pytesseract.image_to_string(gray, config=tessdata_dir_config)).strip())  print((pytesseract.image_to_string(adaptive, config=tessdata_dir_config)).strip())    cv2.imshow("Captcha", img) # Output: IMQW  cv2.imshow("Gray", gray) # Output: IMOW  cv2.imshow("Adaptive", adaptive) # Output: IMOW,    cv2.waitKey(7000)
https://stackoverflow.com/questions/65930463/how-to-process-this-captcha-image-for-pytesseract January 28, 2021 at 11:05AM

没有评论:

发表评论