Python 3 Get HTTP page

Question

How can I get python to get the contents of an HTTP page? So far all I have is the request and I have imported http.client.

Greg Hewgill · Accepted Answer · 2010-01-07 21:53:54Z

56

Using urllib.request is probably the easiest way to do this:

import urllib.request
f = urllib.request.urlopen("http://stackoverflow.com")
print(f.read())

edited Jan 7, 2010 at 21:53

answered Jan 7, 2010 at 21:48

Greg Hewgill

1.0m192 gold badges1.2k silver badges1.3k bronze badges

Sign up to request clarification or add additional context in comments.

4 Comments

BiscottiGummyBears Over a year ago

Tried that and I got "AttributeError: 'module' object has no attribute 'urlopen'"

Greg Hewgill Over a year ago

Sorry, I just noticed that you were using Python 3. I've updated my example to match.

Greg Hewgill Over a year ago

@Davide Gualano: The Python 2.x urllib2 module has been rolled into the Python 3.x urllib set of modules: docs.python.org/library/urllib2.html

Davide Gualano Over a year ago

@Greg: my bad, I didn't read the question title carefully enough :)

Quentin · Accepted Answer · 2019-12-26 10:54:57Z

Usage built-in module "http.client"

import http.client

connection = http.client.HTTPSConnection("api.bitbucket.org", timeout=2)
connection.request('GET', '/2.0/repositories')
response = connection.getresponse()
print('{} {} - a response on a GET request by using "http.client"'.format(response.status, response.reason))
content = response.read().decode('utf-8')
print(content[:100], '...')

Result:

200 OK - a response on a GET request by using "http.client" {"pagelen": 10, "values": [{"scm": "hg", "website": "", "has_wiki": true, "name": "tweakmsg", "links ...

Usage third-party library "requests"

response = requests.get("https://api.bitbucket.org/2.0/repositories")
print('{} {} - a response on a GET request by using "requests"'.format(response.status_code, response.reason))
content = response.content.decode('utf-8')
print(content[:100], '...')

Result:

200 OK - a response on a GET request by using "requests" {"pagelen": 10, "values": [{"scm": "hg", "website": "", "has_wiki": true, "name": "tweakmsg", "links ...

Usage built-in module "urllib.request"

response = urllib.request.urlopen("https://api.bitbucket.org/2.0/repositories")
print('{} {} - a response on a GET request by using "urllib.request"'.format(response.status, response.reason))
content = response.read().decode('utf-8')
print(content[:100], '...')

Result:

200 OK - a response on a GET request by using "urllib.request" {"pagelen": 10, "values": [{"scm": "hg", "website": "", "has_wiki": true, "name": "tweakmsg", "links ...

Notes:

Python 3.4
Result from the responses most likely will be differ only content

dimsum88 · Accepted Answer · 2016-05-18 06:04:34Z

3

You can also use the requests library. I found this particularly useful because it was easier to retrieve and display the HTTP header.

import requests

source = 'http://www.pythonlearn.com/code/intro-short.txt'

r = requests.get(source)

print('Display actual page\n')
for line in r:
    print (line.strip())

print('\nDisplay all headers\n')
print(r.headers)

answered May 18, 2016 at 6:04

dimsum88

574 bronze badges

1 Comment

Nam G VU Over a year ago

Is this Python 3?

Anthony Awuley · Accepted Answer · 2018-11-09 19:08:18Z

1

pip install requests

import requests

r = requests.get('https://api.spotify.com/v1/search?type=artist&q=beyonce')
r.json()

answered Nov 9, 2018 at 19:08

Anthony Awuley

4,00334 silver badges20 bronze badges

Comments

kenorb · Accepted Answer · 2015-10-15 13:30:54Z

0

Add this code which can format data for human reading:

text = f.read().decode('utf-8')

edited Oct 15, 2015 at 13:30

kenorb

169k95 gold badges712 silver badges796 bronze badges

answered Oct 15, 2015 at 7:53

SKGoC

212 bronze badges

Collectives™ on Stack Overflow

Python 3 Get HTTP page

5 Answers 5

4 Comments

Comments

1 Comment

Comments

Comments

Your Answer

Linked

Hot Network Questions

Collectives™ on Stack Overflow

5 Answers 5

4 Comments

Comments

1 Comment

Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Linked

Related