从外部html源自定义对象读取

时间:2013-01-28 21:54:55

标签: php html

我从外部html源读取时遇到问题  我想要的只是在我的案例'HSDPA 2100'中读取自定义对象 但我的实际代码是从外部源读取所有nfo类。

外部HTML:

<table cellspacing="0">
<tbody><tr>
<th rowspan="8" scope="row">General</th>
<td class="ttl"><a href="network-bands.php3">2G Network</a></td>
<td class="nfo">CDMA 800 / 1900 </td>
</tr><tr>
<td class="ttl">&nbsp;</td>
<td class="nfo">GSM 850 / 900 / 1800 / 1900 </td>
</tr>
<tr>
<td class="ttl"><a href="network-bands.php3">3G Network</a></td>
<td class="nfo">HSDPA 2100 </td>
</tr>
<tr>
<td class="ttl">&nbsp;</td>
<td class="nfo">CDMA2000 1xEV-DO </td>
</tr>
<tr>
<td class="ttl"><a href="network-bands.php3">4G Network</a></td>
<td class="nfo">LTE 800 </td>
</tr>
<tr>
<td class="ttl"><a href="glossary.php3?term=sim">SIM</a></td>
<td class="nfo">Mini-SIM</td>
</tr><tr>
<td class="ttl"><a href="#" onclick="helpW('h_year.htm');">Announced</a></td>
<td class="nfo">2013, January</td>
</tr>
<tr>
<td class="ttl"><a href="#" onclick="helpW('h_status.htm');">Status</a></td>
<td class="nfo">Coming soon. Exp. release 2013, February</td>
</tr>
</tbody></table><table cellspacing="0">
<tbody><tr>
<th rowspan="2" scope="row">Body</th>
<td class="ttl"><a href="#" onclick="helpW('h_dimens.htm');">Dimensions</a></td>
<td class="nfo">-</td>
</tr><tr>
<td class="ttl"><a href="#" onclick="helpW('h_weight.htm');">Weight</a></td>
<td class="nfo">&nbsp;</td>
</tr>

</tbody></table><table cellspacing="0">
<tbody><tr>
<th rowspan="4" scope="row">Display</th>
<td class="ttl"><a href="glossary.php3?term=display-type">Type</a></td>
<td class="nfo">TFT capacitive touchscreen, 16M colors</td>
</tr><tr>
<td class="ttl"><a href="#" onclick="helpW('h_dsize.htm');">Size</a></td>
<td class="nfo">1080 x 1920 pixels, 5.9 inches (~373 ppi pixel density)</td>
</tr>
<tr>
<td class="ttl"><a href="glossary.php3?term=multitouch">Multitouch</a></td>
<td class="nfo">Yes</td>
</tr>
<tr><td class="ttl">&nbsp;</td><td class="nfo">- Flux UX UI</td> 

我正在尝试使用此代码:

  <?php
  include_once('/simple_html_dom.php');
  $dom = file_get_html("http://www.site.com/pantech_vega_no_6-5268.php");
  // alternatively use str_get_html($html) if you have the html string already...
  foreach ($dom->find('td[class=nfo]') as $node)
{
$result = $node->innertext;
$price = explode(",", $result);
echo $price[0];
} 
?>

我收到了这个:CDMA 800 / 1900 GSM 850 / 900 / 1800 / 1900 HSDPA 2100 CDMA2000 1xEV-DO LTE 800 Mini-SIM2013Coming soon. Exp. r... etc

我想要的是HSDPA 2100,但对于其他型号的手机,价值可以是HSDPA 1900或其他,而HSPDA始终是稳定的。

1 个答案:

答案 0 :(得分:1)

所有td都有相同的类名“nfo”,你遍历所有元素,所以你得到的结果是预期的。

如果您想要的数据始终位于第三行,则可以填充数组而不是获取变量,然后获取第三个值。喜欢这个$ result [2]

更新:如果HSDPA总是在那里检查它。

     <?php
      include_once('/simple_html_dom.php');
      $dom = file_get_html("http://www.site.com/pantech_vega_no_6-5268.php");
      // alternatively use str_get_html($html) if you have the html string already...
      foreach ($dom->find('td[class=nfo]') as $node)
    {
    $result = $node->innertext;
if (strpos($result, 'HSDPA') === false)
{
continue;
}
    $price = explode(",", $result);
    echo $price[0];
break;
    } 
    ?>