Skip to main content
Filter by
Sorted by
Tagged with
0 votes
1 answer
49 views

I am using docling and trying to get images with scanned text to parse with Tesseract OCR (could be any OCR, but tesseract is preferred if possible). My code is: pipeline_options = PdfPipelineOptions()...
Paul Gibson's user avatar
-1 votes
2 answers
182 views

Image contains single document printed in white paper. Background of image can be different. Tried to get document using code from https://scanbot.io/techblog/document-edge-detection-with-opencv/ with ...
Andrus's user avatar
  • 28.2k
0 votes
2 answers
74 views

Pytesseract cannot understand very simple and clear text. I've tried nearest neighbor, bilinear, gaussian blur, and everything else and cannot get tesseract to read the text consistently, the best I ...
RvBVakama's user avatar
  • 117
1 vote
1 answer
78 views

I am struggling with tesseract package (5.3.2 version) for R, trying to have a XML ALTO as output of the ocr() function. I read the documentation which states that this has something to do with the ...
PierreGeorge's user avatar
-1 votes
1 answer
142 views

Receipt clip contains structured background: Tried to remove it using textcleaner ImageMagic wrapper script from Remove receipt image border using ImageMagick answer. Used code from answer How to use ...
Andrus's user avatar
  • 28.2k
2 votes
1 answer
210 views

I am trying to extract numbers from dotted LED-style digits (0–9) using Tesseract OCR in a MAUI/Xamarin app on Android and iOS, fully offline. My boss wants a local solution that works on mobile ...
boss's user avatar
  • 1,638
2 votes
3 answers
162 views

My code works as python file but I am struggling to make it work using pyscript.I am sharing the code which I tried. main.py import pytesseract pytesseract.pytesseract.tesseract_cmd = r"Tesseract-...
nasrin begum pathan's user avatar
1 vote
0 answers
184 views

I’m using Docling to OCR scanned PDFs. I want to control Tesseract’s page-segmentation mode (PSM), e.g. --psm 6. Docling exposes both TesseractOcrOptions and TesseractCliOcrOptions, but neither ...
Pamudu Ranasinghe's user avatar
2 votes
1 answer
70 views

I'm attempting to perform OCR on a set of single letters inside an image using Python. I'm new to this so apologies if I get the terminology wrong, but I've filtered and have obtained (I think) quite ...
user201341's user avatar
0 votes
1 answer
154 views

Tried to use https://github.com/Sicos1977/TesseractOCR Nuget package in Debian 12. It looks that it requires new version of leptonica libleptonica-1.85.0.dll.so which is not avaliable in Debian: #apt ...
Andrus's user avatar
  • 28.2k
0 votes
1 answer
78 views

I am trying to OCR a specific area of a PDF page in a multi-page document (total page count varies between 600-10,000 pages). I initially receive the data as .pcl files in batches of 500 records, ...
bearskies's user avatar
1 vote
1 answer
83 views

I am currently using tesseract 5.0 and am training a model. I have generated the png, box and the ground truth files for a thousand images. However, when I run the command: make training MODEL_NAME=...
Akshay NN's user avatar
0 votes
0 answers
82 views

I followed the steps for fine-tuning Tesseract for handwriting recognition. I have the character images and the corresponding box files. Then I generated the .lstmf files, followed by the lstm_train....
TestING's user avatar
0 votes
0 answers
35 views

I’m trying to automate the process of selecting a WiFi network by its name during the initial setup of an Android device — the setup flow that includes selecting language, region, and connecting to ...
sharath chandra's user avatar
0 votes
1 answer
167 views

I'm trying to get the data out of this image: and no matter what I try I can't get a good result. I have tried ImageEnhance and cv2 I got the most promising result using cv2 and adaptive Treshold: ...
Cyclo's user avatar
  • 3
1 vote
1 answer
330 views

I've been building a Dockerfile which has to build my ASP.NET Core Web API and requires some libraries to work, such as OpenCV and Tesseract. My Dockerfile builds containers and images properly, but ...
bdvazquez17's user avatar
1 vote
1 answer
81 views

I have a PDF document that I want to scan with pytesseract, but the page numbers are not recognized. The page number is not recognized on any of the pages. The PDF is written with Latex. I ried ...
mike3467's user avatar
0 votes
1 answer
125 views

I need support to treat a captcha image, I made an algorithm for testing to treat some images that come from a website's captcha and for some types I was able to treat and use tesseract to extract the ...
Mateus Epifanio Linhares's user avatar
0 votes
2 answers
237 views

I am using Tesseract.js using a realtime video stream. I want to draw a box around the words that have been recognised. I found the below code but perhaps it's old so I can't get the bbox nor data....
chongzixin's user avatar
  • 1,995
2 votes
0 answers
87 views

I’m building a C# application that reads single numbers (1–13) from clean, binary‐style images using Tesseract OCR. Everything works perfectly except the number 11, which Tesseract consistently ...
Ali Ihsan Elmas's user avatar
0 votes
1 answer
62 views

I'm using pytesseract to read tabular data out of an image but I'm having trouble with the software making "educated guesses" about characters and word splitting based on context. I have a ...
SpliFF's user avatar
  • 39.1k
0 votes
0 answers
62 views

I am trying to fine-tune an Optical Character Recognition (OCR) model on Tesseract's provided tesstrain repository for Japanese . I tried encoding the bash commands into Python in VSCode as I wanted ...
Jiansen Chan's user avatar
0 votes
0 answers
65 views

I am trying to use the setLanguage method of TesseractOCRParser in Apache Tika in Java. When I pass any Indian language, such as Hindi, Marathi, Tamil, etc., it doesn't work and still displays data in ...
Kunal Goel's user avatar
-2 votes
1 answer
152 views

I am using Tesseract OCR (v5.5.0) in C++ to extract text from images, but I’m encountering issues when the images are rotated. Tesseract’s PSM_AUTO_OSD (PageSegMode::PSM_AUTO_OSD) works well for ...
OMKAR GULAMBE's user avatar
2 votes
0 answers
111 views

I’m working on a project where I have to detect objects in a PDF document. After detecting the objects, I need to read the text at this location since it will be used as the object's name. Example: I’...
Jonathan Fischer's user avatar
1 vote
1 answer
120 views

I'm working on scanned documents (registers) that contain both French and Arabic text. When I run Tesseract OCR with lang='fra', all the French text is extracted perfectly. But when I use lang='ara+...
nevermiind's user avatar
0 votes
0 answers
79 views

I’m working with a scanned PDF that contains a table with two columns, where each column has two lines of text. When I convert the scanned PDF using OCRmyPDF, I'm encountering an issue with the ...
Bikram Kumar Nayak's user avatar
0 votes
0 answers
70 views

I want to use the HOCR that I get from TesseractJS (I work on Javascript) and somehow transform it to be readable by a LLM. The goal is to reade technical documents with prices, tabs, header, lines, ...
Blovnar's user avatar
  • 55
0 votes
0 answers
147 views

ExitCodeException _common.py:271 Traceback (most recent call last): File "C:\<USER>\apps\python\...
Username's user avatar
0 votes
0 answers
28 views

I have a large collection of structured text, in Hebrew & English, mixed in every sentence. Tesseract is re-ordering the words in these sentence, without success. How can I tell Tesseract just to ...
Berry Tsakala's user avatar
1 vote
1 answer
159 views

I’ve been following this tutorial from YouTube: Guide to Tesseract Training https://www.youtube.com/watch?v=KE4xEzFGSU8&t=13s and its corresponding GitHub repository: astutejoe/tesseract_tutorial. ...
Impetus's user avatar
-1 votes
1 answer
68 views

I'm trying to convert the attached image using the pytesseract and opencv libraries in python, but the conversion is not satisfactory, since many characters are converted incorrectly. Does anyone have ...
Cristi Garcia's user avatar
-1 votes
1 answer
58 views

I am working with a Django application, there for some purpose i need to solve captcha i am already saving temporary captcha file but when i try to read captcha using pytesseract it return nothing ...
Mohit Prajapat's user avatar
0 votes
2 answers
94 views

I try to replace the turquoise part (words) from the image with white background to have a clear source for tesseract-ocr. The picture is loaded ok, image mask is created ok. My question is how to ...
HEP's user avatar
  • 63
0 votes
0 answers
62 views

I'm having trouble training Tesseract with my custom dataset. Despite double-checking that my box file coordinates are correct and setting the image resolution to 300 DPI, I keep getting the error &...
Impetus's user avatar
0 votes
0 answers
67 views

I'm also new to flutter and tesseract so might be a problem with my understanding. I am wanting to make an app that takes an image from my gallery and can read the text from them. I'm using Google ML ...
TC_neva's user avatar
  • 61
0 votes
0 answers
67 views

I'm trying to build a small application (C#) that can OCR process some images, extracting the raw text with layout roughly intact (using tabs, spaces or whatever, to position the text in the output ...
Aidal's user avatar
  • 869
1 vote
1 answer
154 views

I am trying to run a PoC for extracting text from an image file using tesseract-ocr in a FastAPI python code file installed in a DigitalOcean FARM Linux droplet and I get an error, while running the ...
Panos Kontopoulos's user avatar
0 votes
1 answer
246 views

I get a strange error message when running a OCRmyPDF command My setup: macOS Sequoia 15.2 OCRmyPDF 16.8.0 (installed by Brew) tesseract 5.5.0 (installed by Brew) Command: ocrmypdf -l deu+fra+eng --...
user9803071's user avatar
0 votes
1 answer
104 views

I'm still new to Python, and I've been using ChatGBT as my tutor, I'm trying to automate a mini game, it needs to read the single character (letter or number) on my screen, and then when the timing is ...
Christian Findninge's user avatar
2 votes
0 answers
69 views

import org.bytedeco.tesseract.TessBaseAPI; public class TesseractInitExample { public static void main(String[] args) { // Create a new Tesseract API instance TessBaseAPI api = ...
Anonymous Creator's user avatar
1 vote
0 answers
66 views

I am using Java - Tess4j-5.13.0.jar to read a pdf containing a table like image. Its the first time using Tess4j/tesseract. Tess4j is located here : https://github.com/nguyenq/tess4j The pdf I am ...
Renier's user avatar
  • 1,830
0 votes
0 answers
43 views

I'm building a Fiber Go application that uses the gosseract package for OCR. While Tesseract works perfectly from the command line, the application fails to compile with the following error: github....
KICH Yassine's user avatar
0 votes
0 answers
171 views

There’s a bot on Discord called Karuta that generates an image with 3 or 4 cards. I’m trying to extract all the information from these cards. I’ve made significant progress, but sometimes the results ...
user28922942's user avatar
0 votes
1 answer
40 views

I am trying to use pytesseract to extract numbers from images. It works for some of them (1, 2, 3, 5, 6, 20...) but I would like to make it work for all of them. Here is a sample of the data that I'm ...
User_123917425's user avatar
0 votes
0 answers
72 views

I need to recognize digits on 7 seg clocks(see picture below), so I use following python code: def detect_date(image: cv2.UMat, bbox:list) -> datetime: gry1 = cv2.cvtColor(image, ...
Sharov's user avatar
  • 460
0 votes
0 answers
51 views

I am trying to use Tesseract to create a small Windows application that allows the user to: Take a screenshot of the monitor and cut a smaller portion containing a table (the table always has the ...
Riccardo's user avatar
0 votes
0 answers
56 views

I'm using Tesseract both on my Windows 10 machine and in the Docker container based on the alpine:3.21 image. Windows tesseract --version is tesseract v5.5.0.20241111 leptonica-1.85.0 libgif 5.2.2 :...
Roman Khomyshynets's user avatar
0 votes
0 answers
442 views

I'm trying to write a simple application in C# that uses Naps2.sdk to scan text documents and save a searchable PDF using the OCR feature. However, the output is a PDF with non-selectable text. I'm ...
azathoth's user avatar
0 votes
1 answer
107 views

I'm using FSCrawler 2.10 to crawl trough hundreds of PDF files, and with Tessarect OCR extract text from scanned documents/images. It works, successfully configured everything, however the folder I am ...
Petar Stankovic's user avatar

1
2 3 4 5
89