'str' object cannot be interpreted as an integer in python

Question

I am scrapping cricket test match details i have tested the results now i want to save it inside the file. while saving the html in file I am getting str object cannot be interedpreted as an integer

this is my code

for i in range(0, 2000):
    url = 'http://search.espncricinfo.com/ci/content/match/search.html?search=test;all=1;page=%s' %i
    html = requests.get(url)

    print ('Checking page %s of 2000' %(i+1))

    soupy = bs4.BeautifulSoup(html.text, 'html.parser')

    time.sleep(1)
    for new_host in soupy.findAll('a', {'class' : 'srchPlyrNmTxt'}):
        try:
            new_host = new_host['href']
        except:
            continue
        odiurl = BASE_URL + new_host
        new_host = odiurl
        print(new_host)
        html = requests.get(new_host).text
        with open('espncricinfo-fc/{0!s}'.format(str.split(new_host, "/")[4]), "wb") as f:
                f.write(html)

I am getting this error str object cannot be interedpreted as an integer

I am getting error in this line

with open('espncricinfo-fc/{0!s}'.format(str.split(new_host, "/")[4]), "wb") as f:

You're writing in byte mode ("wb"), but I'm guessing you're trying to write str data rather than bytes. What happens if you change requests.get(new_host).text to requests.get(new_host).text.encode()? — JaminSore
– JaminSore, Commented Nov 29, 2018 at 5:39

Shivam Singh · Accepted Answer · 2018-11-29 05:56:39Z

1

If you are using Python 3.x, try changing the last line to

f.write(bytes(html, 'UTF-8'))

Also try this,

new_host = str(new_host['href'])

edited Nov 29, 2018 at 5:56

answered Nov 29, 2018 at 5:43

Shivam Singh

1,6241 gold badge11 silver badges9 bronze badges

Sign up to request clarification or add additional context in comments.

6 Comments

Tayyab Vohra Over a year ago

I am getting the same error as before 'str' object cannot be interpreted as an integer

Shivam Singh Over a year ago

Updated the answer

Shivam Singh Over a year ago

You need to fix the split function, It should be str.split(new_host, "/")[6] and BASE_URL = "http://espncricinfo.com"

Tayyab Vohra Over a year ago

Shivam done both of these above edit but still it is not saving in the file.

Shivam Singh Over a year ago

What are you getting exactly? Does 'espncricinfo-fc' folder exist?

|

phil · Accepted Answer · 2018-11-29 05:40:59Z

0

The problem is your print statement. It should read

print('checking %d etc.' % (i + 1))

answered Nov 29, 2018 at 5:40

phil

5653 silver badges10 bronze badges

3 Comments

Tayyab Vohra Over a year ago

its working i have checked i have also removed this line but the error remains same..

phil Over a year ago

What line number, does new_host as a class have a __str__ method. You print it and use str.split in the final line

phil Over a year ago

What does soupy.findall('a', { ... }) return. A range or a list of strings. You set new_host to BASE_URL + new_host. After setting it to a dictionary lookup new_host['href']

Collectives™ on Stack Overflow

'str' object cannot be interpreted as an integer in python

2 Answers 2

6 Comments

3 Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

2 Answers 2

6 Comments

3 Comments

Your Answer

Sign up or log in

Post as a guest

Related