How can I separate lines in this text for OCR?

Ask Question

Asked 2 years, 4 months ago

Modified 2 years, 4 months ago

Viewed 497 times

I want to use OCR on this block of text:

It works well on some lines, but on other lines it doesn't detect anything / gibberish. I'm pretty sure it's because of how the text is skewed, since if I alter the angle of the block just slightly, I get better/worse results for certain lines.

Normally I would use contours to deskew the whole block, however, each line has a different skew. So I thought it would be best to separate each line and then deskew and apply OCR for each line independently. I wanted to use Hough transform to detect the horizontals separating the text lines, but it only seems to detect vertical lines.. Do you have any idea how to fix this or maybe do you have an entirely different idea to deskew the image?

Here's the code for the Hough transform:

def hough_lines2(cvImage):
    img = cvImage.copy()
    # since the input image is already pre-processed, I don't have to perform binarization
    edges= cv2.Canny(img,50,150,apertureSize = 3)
    # I invert the edges since I want to detect lines where there is no text
    # i.e. the space between the text lines
    inv = np.invert(edges)
    # I use the parameter MaxLineGap = 1 since I only want to detect lines where there is no
    # text in the way
    linesP = cv2.HoughLinesP(inv,1,np.pi/180,200,None,150,1)
    # Draw the lines
    img2 = cv2.cvtColor(inv, cv2.COLOR_GRAY2BGR)
    if linesP is not None:
        for i in range(0, len(linesP)):
            l = linesP[i][0]
            x1 = l[0]
            y1 = l[1]
            x2 = l[2]
            y2 = l[3]
            cv2.line(img2, (x1, y1), (x2, y2), (0, 255, 0), 2)
    # Display the lines in the image
    cv2.namedWindow('Resized',cv2.WINDOW_NORMAL)
    cv2.resizeWindow('Resized', 600,900)
    cv2.imshow("Resized", imutils.resize(img2, width=500))
    cv2.waitKey(0)
    return 0

And these are the detected lines:

edited Jul 12, 2023 at 10:47

Christoph Rackwitz

16.4k5 gold badges42 silver badges56 bronze badges

asked Jul 12, 2023 at 10:35

anon

111 silver badge3 bronze badges

Almost all the test is detected in my environment

user16612111
– user16612111

2023-07-12 10:40:11 +00:00
Commented Jul 12, 2023 at 10:40
Some of the text is not clear and that is why tesseract ocr doesn't recognize those characters.

user16612111
– user16612111

2023-07-12 10:40:59 +00:00
Commented Jul 12, 2023 at 10:40
this line separation is supposed to be done by OCR, not by you or opencv.

Christoph Rackwitz
– Christoph Rackwitz

2023-07-12 10:47:26 +00:00
Commented Jul 12, 2023 at 10:47
Try converting the image to a binarybversion where the white pixels become black and the black pixels become white and then inject the image to OCR

user16612111
– user16612111

2023-07-12 10:49:13 +00:00
Commented Jul 12, 2023 at 10:49
deskewing that receipt is not for beginners. you'd best skip that.

Christoph Rackwitz
– Christoph Rackwitz

2023-07-12 10:49:31 +00:00
Commented Jul 12, 2023 at 10:49

| Show 5 more comments

0 Your Answer

Sign up or log in

Post as a guest

Name

Required, but never shown

Post as a guest

Name

Required, but never shown

By clicking “Post Your Answer”, you agree to our terms of service and acknowledge you have read our privacy policy.

Start asking to get answers

Find the answer to your question by asking.

Ask question

Explore related questions

See similar questions with these tags.

Collectives™ on Stack Overflow

How can I separate lines in this text for OCR?

0

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

0

Know someone who can answer? Share a link to this question via email, Twitter, or Facebook.

Your Answer

Sign up or log in

Post as a guest