I am trying to OCR pdf file with tesseract, but it says:
Tesseract Open Source OCR Engine name_to_image_type:Error:Unrecognized image type:upload526.pdf IMAGE::read_header:Error:Can't read this image type:upload526.pdf tesseract:Error:Read of file failed:upload526.pdf Segmentation fault
I need it to make a database to search through pdfs that were scanned manually (to images)... What am I doing wrong? I read that it supports pdfs... No idea what version it is as tesseract --version or tesseract -v doesn't work at all.