Web Scraping using python(Beautifulsoup)

Question

I am just started learning web scraping using python Beautifulsoup and requests library and using Pycharm tool.

import requests
from bs4 import BeautifulSoup
    
result1 = requests.get("https://www.grainger.com/")
print('result1 is '+ str(result1.status_code))

While I am using this website its keeps on loading and if I use google.com it's giving output.

I wonder why I didn't get output for the above website?

Cho'Gath · Accepted Answer · 2020-10-16 18:07:28Z

1

To get status 200 from this site, specify User-Agent HTTP header:

import requests
from bs4 import BeautifulSoup

headers = {'User-Agent': 'Mozilla/5.0 (X11; Ubuntu; Linux x86_64; rv:81.0) Gecko/20100101 Firefox/81.0'}

result1 = requests.get("https://www.grainger.com/", headers=headers)

print('result1 is '+ str(result1.status_code))

Prints:

result1 is 200

The reason why this is works is because some sites will ignore requests that don't appear to be made from a web browser. By default, requests uses the User-Agent python-requests, so the website can tell you are not requesting the website from a web browser. The reason why your request hangs and eventually times out is likely because their server is ignoring your request.

edited Oct 16, 2020 at 18:07

Cho'Gath

4483 silver badges9 bronze badges

answered Oct 16, 2020 at 17:57

Andrej Kesely

196k15 gold badges60 silver badges105 bronze badges

Sign up to request clarification or add additional context in comments.

Comments

wyatt-stanke · Accepted Answer · 2020-10-16 17:51:44Z

0

Hmm... there are a couple of things.

The website might not exist
You're using http instead of https
That site blocks scraping (send a user agent header)
It might be a problem with requests. Try using a different library.

answered Oct 16, 2020 at 17:51

wyatt-stanke

586 bronze badges

Collectives™ on Stack Overflow

Web Scraping using python(Beautifulsoup)

2 Answers 2

Comments

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

2 Answers 2

Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Related