2

I found the HTML Agility Pack useful and easy to use for screen scraping web sites. What's the equivalent library for HTML screen scraping in Java, Ruby, Python?

2 Answers 2

5

Found what I was looking for: Options for HTML scraping?

Sign up to request clarification or add additional context in comments.

1 Comment

The link is broken, could you please share what did you find years ago?
3

BeautifulSoup is the standard Python screen scraping tool.

Recently, however, I used the (incomplete at the moment) pyQuery, which is more or less a rewrite of jQuery into python, and found it to be very useful.

1 Comment

I would also suggest Scrapy for a robust infrastructure.

Your Answer

By clicking “Post Your Answer”, you agree to our terms of service and acknowledge you have read our privacy policy.

Start asking to get answers

Find the answer to your question by asking.

Ask question

Explore related questions

See similar questions with these tags.