Python3: Download PDF to memory and convert first page to image

i try to do the following:

Download a PDF file to memory
Convert the first page to an image
Use that image with tweepy

I tried the following code, but run into an error.

from PIL import Image
from pdf2image import convert_from_path
from urllib.request import urlopen
from io import StringIO, BytesIO

url = 'http://somedomain.com/assets/applets/internet.pdf'
scrape = urlopen(url) # for external files
pdfFile = BytesIO(scrape.read())
pdfFile.seek(0)
pages = convert_from_path(pdfFile,last_page=1, dpi=100)

for page in pages:
    page.save('/home/out.jpg', 'JPEG')

Here is the error:

TypeError: Can't convert '_io.BytesIO' object to str implicitly

The generated image should later be used to upload it to twitter by tweepy. I don't need to store it to disk, that's why i try to do all in memory. Anybody who could help me please?

asked Jun 8, 2018 at 15:41

Lionking

631 silver badge9 bronze badges

You need to use convert_from_bytes method instead of convert_from_path

kip
– kip

2018-06-08 15:44:41 +00:00
Commented Jun 8, 2018 at 15:44
@kip I changed the code to pages = convert_from_bytes(pdfFile,last_page=1, dpi=100) Also imported this function, but still get an error: TypeError: a bytes-like object is required, not '_io.BytesIO'

Lionking
– Lionking

2018-06-08 21:54:36 +00:00
Commented Jun 8, 2018 at 21:54
1

try pass to above method a bytes from the BytesIO object, maybe with getvalue(), something like: convert_from_bytes(pdfFile.getvalue(), but I think that scrape.read() return a bytes object.... convert_from_bytes(scrape.read()

kip
– kip

2018-06-09 03:29:00 +00:00
Commented Jun 9, 2018 at 3:29
2

To use scrape.read() directly did the trick. Thank you a lot.

Lionking
– Lionking

2018-06-09 09:25:47 +00:00
Commented Jun 9, 2018 at 9:25

Add a comment |

0 Your Answer

Sign up or log in

Post as a guest

Name

Required, but never shown

Post as a guest

Name

Required, but never shown

By clicking “Post Your Answer”, you agree to our terms of service and acknowledge you have read our privacy policy.

Start asking to get answers

Find the answer to your question by asking.

Ask question

Explore related questions

See similar questions with these tags.

Collectives™ on Stack Overflow

Python3: Download PDF to memory and convert first page to image

0

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

0

Know someone who can answer? Share a link to this question via email, Twitter, or Facebook.

Your Answer

Sign up or log in

Post as a guest

Linked