Pandas read_csv response codes when using an external url

Question

I'm replacing requests.get() with pd.read_csv() and would like to write some exception logic if pandas does not get the equivalent of a status code 200.

With requests, I can write:

response = requests.get(report_url)
if response.status_code != 200:

How can I apply the same logic to pd.read_csv()? Are there any status codes I can check on?

I'm not sure how to test this outside of passing an incorrect URL which doesn't test exactly what I want to check against. — Bonteq
– Bonteq, Commented Jul 18, 2022 at 20:09
you can't get status code with read_csv() - it simply raise error when it can't read it. You have to use requests.get() to check status and get data from url and later use read_csv( io.StringIO( text ) ). Or you should use try/except to catch error when it can't read data. — furas
– furas, Commented Jul 18, 2022 at 20:11
Hmm, that's odd. I can pass an external URL to read_csv() so I'd assume their goal with this feature was to replace any need for requests. — Bonteq
– Bonteq, Commented Jul 18, 2022 at 20:14
you can use url in read_csv() but this function doesn't have method to gives you status code. It simply raise error when it can't read url. — furas
– furas, Commented Jul 18, 2022 at 20:15

furas · Accepted Answer · 2022-07-18 20:39:30Z

2

You can use url in read_csv() but it has no method to gives you status code. It simply raises error when it has non-200 status code and you have to use try/except to catch it. You have example in other answer.

But if you have to use requests then you can later use io.StringIO to create file-like object (file in memory) and use it in read_csv().

import io
import requests
import pandas as pd

response = requests.get("https://people.sc.fsu.edu/~jburkardt/data/csv/addresses.csv")

print('status_code:', response.status_code)

#if response.status_code == 200:
if response.ok:
    df = pd.read_csv( io.StringIO(response.text) )
else:
    df = None

print(df)

The same way you can use io.StringIO when you create web page which gets csv using HTML with <form>.

As I know read_csv(url) works in similar way - it uses requests.get() to get file data from server and later it uses io.StringIO to read data.

edited Jul 18, 2022 at 20:39

answered Jul 18, 2022 at 20:34

furas

149k12 gold badges121 silver badges171 bronze badges

Sign up to request clarification or add additional context in comments.

2 Comments

Digital Farmer Over a year ago

I had forgotten to add io to my answer, but you already did, I deleted it because yours is a lot more complete.

Bonteq Over a year ago

Actually looks like this is the route I'm going to take. Thank you.

NelsonGon · Accepted Answer · 2022-07-18 20:35:17Z

2

My suggestion is to write a custom reader that makes it possible to check that a URL is valid before reading it although this defeats the purpose

import requests
def custom_read(url):
    try: 
        return_file = pd.read_csv(url) 
    except requests.exceptions.HTTPError as err:
        raise
    else:
        return return_file

A valid URL will work

my_file = custom_read("https://people.sc.fsu.edu/~jburkardt/data/csv/addresses.csv")

This fails and raises a requests error

my_file1 = custom_read("https://uhoh.com")

Otherwise, there is no way to access the status code of a URL for a DataFrame object once it has been read.

edited Jul 18, 2022 at 20:35

answered Jul 18, 2022 at 20:15

NelsonGon

13.3k7 gold badges32 silver badges60 bronze badges

7 Comments

Bonteq Over a year ago

Aw, that just seems inefficient having to run two http requests.

furas Over a year ago

@Bonteq you can use read_csv( io.StringIO( response.text )) instead or running read_csv(url)

NelsonGon Over a year ago

Sorry, I have edited, you only need to check that a requests error was raised. @Bonteq

Bonteq Over a year ago

@furas I think you're right, that's probably the best route here.

furas Over a year ago

@Bonteq when read_csv has non-200 status code then it raises error and you have to use try/except to catch it

|

Collectives™ on Stack Overflow

Pandas read_csv response codes when using an external url

2 Answers 2

2 Comments

7 Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

2 Answers 2

2 Comments

7 Comments

Your Answer

Sign up or log in

Post as a guest

Related