在div标签之间提取文本 - Simple Html Dom Parser [关闭]

在div标签之间提取文本 -  Simple Html Dom Parser [关闭]

问题描述:

Code :

$html = file_get_html('http://url.com');
$ret = $html->find('div[samplediv]');
echo $ret;

The output I get is just Array. that means it is empty. I am sure the div is preset on the page I am scraping.

Also, another thing I am trying to achieve is, take the text from the html. when I simply convert it to plaintext, it results in lot of unwanted numbers and stuff. So what I am trying to do is, get the text that I see in the browser. (Instead of getting the whole text from the html).

All suggestions are welcome.

Looks like you're outputting the whole document. Try

echo $ret->innertext;

to just output the contents of the div.

PS: I just looked this up at on google and found http://simplehtmldom.sourceforge.net/manual.htm