file_get_html错误,无法正常工作

时间:2017-02-14 15:36:06

标签: php html dom simple-html-dom scrape

我正在使用Simple HTML Dom来尝试抓取HTML表格。 我按照他们的说明查看了许多其他代码示例,但file_get_html似乎没有用。

这是我的代码:

<?php

// Simple HTML Dom Parser
include('simple_html_dom.php');

//$worlds = ["Amera", "Antica", "Astera", "Aurera", "Aurora", "Bellona", "Belobra", "Beneva", "Calmera", "Calva", "Calvera", "Candia", "Celesta", "Chrona", "Danera", "Dolera", "Efidia", "Eldera", "Ferobra", "Fidera", "Fortera", "Garnera", "Guardia", "Harmonia", "Honera", "Hydera", "Inferna", "Iona", "Irmada", "Julera", "Justera", "Kenora", "Kronera", "Laudera", "Luminera", "Magera", "Menera", "Morta", "Mortera", "Neptera", "Nerana", "Nika", "Olympa", "Osera", "Pacera", "Premia", "Pythera", "Quilia", "Refugia", "Rowana", "Secura", "Serdebra", "Shivera", "Silvera", "Solera", "Tavara", "Thera", "Umera", "Unitera", "Veludera", "Verlana", "Xantera", "Xylana", "Yanara", "Zanera", "Zeluna"];

//foreach ($worlds as $world) {
  // All HTML from the online list
  $html = file_get_html('https://secure.tibia.com/community/?subtopic=worlds&world=Antica');

  // Search for the online list table content
  foreach ($html->find('tr[class=Table2]') as $row) {
    $name = $row->find('td', 0)->plaintext;
    $level = $row->find('td', 1)->plaintext;
    $vocation = $row->find('td', 2)->plaintext;

    echo $name . ' | ' . $level . ' | ' . $vocation . '<br>';
  }
//}

?>

我得到了这些错误:

Warning: file_get_contents(): stream does not support seeking in D:\xampp\htdocs\simple_html_dom.php on line 76

Warning: file_get_contents(): Failed to seek to position -1 in the stream in D:\xampp\htdocs\simple_html_dom.php on line 76

Fatal error: Uncaught Error: Call to a member function find() on boolean in D:\xampp\htdocs\index.php:13 Stack trace: #0 {main} thrown in D:\xampp\htdocs\index.php on line 13

我做错了什么? 我试图抓的表是“玩家在线”表: https://secure.tibia.com/community/?subtopic=worlds&world=Antica

3 个答案:

答案 0 :(得分:3)

这是最新版PHP的simple_html_dom库问题。 要更正它,只需更改&#34; $offset = -1, &#34;到&#34; $offset = 0, &#34;在&#34; file_get_html &#34;的参数中功能在&#34; simple_html_dom.php &#34;文件。

答案 1 :(得分:2)

试试这个:

$html = str_get_html(file_get_contents($url));

答案 2 :(得分:0)

我不太了解简单,但我认为您可能需要使用更强大的库,例如https://github.com/FriendsOfPHP/Goutte