How to scrape html contents of one div by id using php

Question

The page on another of my domains which I'd like to scrape one div from contains:

<div id="thisone">
    <p>Stuff</p>
</div>

<div id="notthisone">
    <p>More stuff</p>
</div>

Using this php...

<?php
    $page = file_get_contents('http://thisite.org/source.html');
    $doc = new DOMDocument();
    $doc->loadHTML($page);
    foreach ($doc->getElementsByTagName('div') as $node) {
        echo $doc->saveHtml($node), PHP_EOL;
    }
?>

...gives me all divs on http://thisite.org/source.html, with html. However, I only want to pull through the div with an id of "thisone" but using:

foreach ($doc->getElementById('thisone') as $node) {

doesn't bring up anything.

SoWhat · Accepted Answer · 2013-08-08 10:44:19Z

4

$doc->getElementById('thisone');// returns a single element with id this one

Try $node=$doc->getElementById('thisone'); and then print $node

On a side note, you can use phpQuery for a jquery like syntext: pq("#thisone")

answered Aug 8, 2013 at 10:44

SoWhat

5,6222 gold badges30 silver badges64 bronze badges

Sign up to request clarification or add additional context in comments.

Comments

x4rf41 · Accepted Answer · 2013-08-08 10:43:13Z

1

$doc->getElementById('thisone') returns a single DOMElement, not an array, so you can't iterate through it

just do:

$node = $doc->getElementById('thisone');
echo $doc->saveHtml($node), PHP_EOL;

answered Aug 8, 2013 at 10:43

x4rf41

5,3672 gold badges24 silver badges36 bronze badges

Comments

Anshul · Accepted Answer · 2013-08-08 10:49:35Z

1

Look at PHP manual http://php.net/manual/en/domdocument.getelementbyid.php getElementByID returns an element or NULL. Not an array and therefore you can't iterate over it.

Instead do this

<?php
    $page = file_get_contents('example.html');
    $doc = new DOMDocument();
    $doc->loadHTML($page);
    $node = $doc->getElementById('thisone');
     echo $doc->saveHtml($node), PHP_EOL;
?>

On running php edit.php you get something like this

<div id="thisone">
      <p>Stuff</p>
  </div>

answered Aug 8, 2013 at 10:49

Anshul

7201 gold badge5 silver badges19 bronze badges

Collectives™ on Stack Overflow

How to scrape html contents of one div by id using php

3 Answers 3

Comments

Comments

Comments

Your Answer

Hot Network Questions

Collectives™ on Stack Overflow

3 Answers 3

Comments

Comments

Comments

Your Answer

Sign up or log in

Post as a guest

Related