Xpath to select text from a child node and current node at once

Question

I'm using scrapy and I got to this point where I'd like to extract the text from a list with the following HTML structure:

u'<div id="someId">'
u'<p><strong>Text1:</strong> next to text 1</p>'
u'<p><strong>Text2:</strong> next to text 2</p>'
u'<p><strong>Text3:</strong> next to text </p>'
u'</div>'

so I'd like to get just the text:

Text1: next to text1

Text2: next to text2

Text3: next to text3

I want to extract the text with XPath as much as possible, I've been trying to use some XPath predicates without resolving my issue.

with

response.xpath('//*[@id="someid"]/p/text()').extract()

I don't get the text for the strong tag within P

any help will be more than appreciated.

eLRuLL · Accepted Answer · 2016-12-10 20:36:33Z

4

you were close:

'//*[@id="someid"]/p//text()'

This will get you a list with all the text inside that p tag.

answered Dec 10, 2016 at 20:36

eLRuLL

18.8k9 gold badges79 silver badges106 bronze badges

Sign up to request clarification or add additional context in comments.

2 Comments

pedrommuller Over a year ago

Thanks, I wasn't aware of "//"

eLRuLL Over a year ago

my pleasure @jack.the.ripper

Collectives™ on Stack Overflow

Xpath to select text from a child node and current node at once

1 Answer 1

2 Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

1 Answer 1

2 Comments

Your Answer

Sign up or log in

Post as a guest

Related