PHP - GET tag from url

Question

I want to get a specific tag from url, from example:

If I have this content:

<div id="hey">
   <div id="bla"></div>
</div>

<div id="hey">
   <div id="bla"></div>
</div>

And I want to get all divs with the id "hey", ( i think its with preg_match_all ), How can I do that?

The content inside the tag can be changed.

I'm not exactly sure what you're asking for here. Are you saying you want to pass in an ID via a query string parameter and then search for that parameter in the page content? Or are you passing that HTML string via a query parameter and you need to parse that? Also, your HTML markup is invalid so you'll likely have a tough time programmatically parsing it under any conditions. — Joe Landsman
– Joe Landsman, Commented Aug 23, 2011 at 22:47
To be clear "from URL", it appears you mean "Given a webpage (whose URL I know) how do I scrape the contents from inside a particular HTML tag?" As it is, it is hard to tell what is being asked, particularly because "GET" (in all caps) in relation to URLs normally refers to a method of form-data encoding. (e.g. http://example.org/?field1=value1 is a URL which could result from a GET form) — Conspicuous Compiler
– Conspicuous Compiler, Commented Aug 23, 2011 at 22:47
FYI, ids are supposed to be single use. If you want to apply styles to multiple elements, you should be defining them to have the same class. Having multiple elements with the same ID can cause issues with JavaScript, forms, etc. — anon
– anon, Commented Aug 23, 2011 at 22:48
To get url ( for example $url ) and to print only the content inside the divs which I want to print ( like "hey"). — Daniel
– Daniel, Commented Aug 23, 2011 at 22:57

leticia · Accepted Answer · 2011-08-23 23:26:04Z

3

I recommend use DOMDocument class instead of regular expressions (is less resource consumer and more clear IMHO).

$content = '<div id="hey">
   <div id="bla"></div>
</div>

<div id="hey">
   <div id="bla"></div>
</div>';

$doc = new DOMDocument();
@$doc->loadHTML($content); // @ for possible not standard HTML
$xpath = new DOMXPath($doc);
$elements = $xpath->query("//div[@id='hey']");

/*@var $elements DOMNodeList */
for ($i=0;$i<$elements->length;$i++) {
    /*@var $curr_element DOMElement */
    $curr_element = $elements->item($i);

    // Here do what you want with the element
    var_dump($curr_element);
}

If you want to get the content from an URL you can use this line instead to fill the variable $content:

$content = file_get_contents('http://yourserver/urls/page.php');

edited Aug 23, 2011 at 23:26

answered Aug 23, 2011 at 23:10

leticia

2,3885 gold badges30 silver badges41 bronze badges

Sign up to request clarification or add additional context in comments.

4 Comments

anon Over a year ago

You should probably also suppress errors on the loadHTML() call, otherwise the DOMDocument will complain loudly about the multiple elements with the same id.

leticia Over a year ago

@AgentConundrum, I test with the HTML with same ids and surprisingly none problems arise, but just in case I add the @ to that line.

anon Over a year ago

What are your error reporting settings? It issues an E_WARNING for me: Warning: DOMDocument::loadHTML() [domdocument.loadhtml]: ID hey already defined in Entity, line: 5

leticia Over a year ago

I set as the initial line error_reporting(E_ALL); and nothing appears. I will check again.

Collectives™ on Stack Overflow

PHP - GET tag from url

1 Answer 1

4 Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

1 Answer 1

4 Comments

Your Answer

Sign up or log in

Post as a guest

Related