Suppose I have parsed a line of HTML that is the following...
<a href="http://www.google.com" class="blah"><img src="logo.png" border="0"></img><br><span class="red">Go to google!</span></a>
This is just an example...but how would I go about stripping everything EXCEPT the following:
http://www.google.com
logo.png
Go to google!
Also, is it possible to search for wildcards?