Python - Scraping a PDF file from a URL

Question

I want to scrape pdf files from this site https://www.sigmaths.net/Reader.php?var=manuels/ph/physique_pilote_7b.pdf I tried this code for that but it doesn't work. Can anybody tell me why, please?

res = requests.get('https://www.sigmaths.net/Reader.php?var=manuels/ph/physique_7b.pdf')
with open('C:\\Users\\sioud\\Desktop\\Manuels scolaires TN\\1\\test.pdf', 'wb') as f:
f.write(ress.content)

What doesn't work? What does this code do? What do you expect it to do? What output/errors do you see? Is your code properly formatted when you're trying to run it (is f.write() correctly indented in the with block)? — gen_Eric
– gen_Eric, Commented Jan 28, 2021 at 18:22

Ajay · Accepted Answer · 2021-01-28 18:35:52Z

1

res = requests.get('https://www.sigmaths.net/manuels/ph/physique_7b.pdf',stream=True)
with open('test.pdf', 'wb') as f:
    f.write(res.content)

your url is pointing to a reader https://www.sigmaths.net/Reader.php?var=manuels/ph/physique_7b.pdf, remove the 'reader.php?var= for the actual pdf

answered Jan 28, 2021 at 18:35

Ajay

5,3092 gold badges26 silver badges30 bronze badges

Sign up to request clarification or add additional context in comments.

Comments

Steffi Keran Rani J · Accepted Answer · 2021-01-30 09:14:21Z

0

You can also use urlretrieve. Check out my solution code.

from urllib.request import urlretrieve
pdfurl = u"https://www.sigmaths.net/manuels/ph/physique_7b.pdf";
urlretrieve(pdfurl, "test.pdf")

And you will find the required pdf download under the name test.pdf

edited Jan 30, 2021 at 9:14

answered Jan 30, 2021 at 9:07

Steffi Keran Rani J

4,1734 gold badges41 silver badges63 bronze badges

Collectives™ on Stack Overflow

Python - Scraping a PDF file from a URL

2 Answers 2

Comments

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

2 Answers 2

Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Related