Get value of "data-..." attribute with .css selector with Scrapy

Question

I am trying to get the value of a data-attribute with scrapy:

response.css('.product-header-top div::attr("data-background-image")').get()

But I do not get the value of data-background-image and Python throws an error:

raise SelectorSyntaxError(cssselect.parser.SelectorSyntaxError: Got pseudo-element ::FunctionalPseudoElement[::attr(['data-background-image'])] not at the end of a selector

Here is the relevant HTML Code of the webpage:

<div data-background-image="/images/image.jpg" style="background-image: url("/images/image.jpg");"></div>

Thanks

UPDATE F.Hoque is right and it works fine. The website is dynamic and renders the data-background-image with JS. So the ::attr("data-...") is working. Thanks for your help @F.Hoque!

Md. Fazlul Hoque · Accepted Answer · 2022-05-29 15:33:49Z

2

Your CSS selection is working fine. There is a typo ); just remove it.

response.css('.product-header-top div::attr("data-background-image")').get()

Proven by Scrapy shell:

In [26]:  sel.css('div::attr("data-background-image")').get()
Out[26]: '/images/image.jpg'

edited May 29, 2022 at 15:33

answered May 29, 2022 at 12:44

Md. Fazlul Hoque

16.2k5 gold badges15 silver badges33 bronze badges

Sign up to request clarification or add additional context in comments.

8 Comments

Waldgeist Over a year ago

Sadly not: raise SelectorSyntaxError( cssselect.parser.SelectorSyntaxError: Expected ident or '*', got <STRING 'data-background-image' at 24>

Md. Fazlul Hoque Over a year ago

@Waldgeist, I've updated. Now it's working fine. Thanks

Waldgeist Over a year ago

Now it is not throwing an error anymore but returns None :-/ ?

Waldgeist Over a year ago

Oh, thanks. I corrected the typo .. I am adding the domain in front and had to put a str() around the response.css(). Interesting that you get the image url. I still get a none when trying to get the url: print(response.css('.product-header-top div::attr("data-background-image")').get())

Md. Fazlul Hoque Over a year ago

Because the website is most likely dynamic and scrapy can't render js

|

Collectives™ on Stack Overflow

Get value of "data-..." attribute with .css selector with Scrapy

1 Answer 1

8 Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

1 Answer 1

8 Comments

Your Answer

Sign up or log in

Post as a guest

Related