在div标签之间提取文本 - Simple Html Dom Parser [关闭]
问题描述:
Code :
$html = file_get_html('http://url.com');
$ret = $html->find('div[samplediv]');
echo $ret;
The output I get is just Array. that means it is empty. I am sure the div is preset on the page I am scraping.
Also, another thing I am trying to achieve is, take the text from the html. when I simply convert it to plaintext, it results in lot of unwanted numbers and stuff. So what I am trying to do is, get the text that I see in the browser. (Instead of getting the whole text from the html).
All suggestions are welcome.
答
Looks like you're outputting the whole document. Try
echo $ret->innertext;
to just output the contents of the div.
PS: I just looked this up at on google and found http://simplehtmldom.sourceforge.net/manual.htm