Hello Stack Overflow contributors!
I want to scrape multiple pages of a news website; it shows an error message during this step
response = requests.get(page, headers = user_agent)
The error message is
AttributeError: 'int' object has no attribute 'get'
The lines of code are
user_agent = {'user-agent': 'Mozilla/5.0 (Windows NT 10.0; WOW64; Trident/7.0; Touch; rv:11.0) like Gecko'}
#controlling the crawl-rate
start_time = time()
request = 0
def scrape(url):
urls = [url + str(x) for x in range(0,10)]
for page in urls:
response = requests.get(page, headers = user_agent)
print(page)
print(scrape('https://nypost.com/search/China+COVID-19/page/'))
More specifically, this page and pages next to it are what I want to scrape:
https://nypost.com/search/China+COVID-19/page/1/?orderby=relevance
Any helps would be greatly appreciated!!
requestssomewhere else in your code with an integer value.