Question

我正在尝试解析来自this page的链接。

我使用以下代码：

$link = 'https://ws-na.assoc-amazon.com/widgets/cm?t=websitcom05-20&o=1&p=26&l=ur1&category=primeent&banner=1XSEYPQA2R6RS2D2B802&f=ifr';
$pageContent = file_get_contents($link);
$html = str_get_html($pageContent);

$body = $html->find('body', 0, true);

foreach ($body->find('area') as $area)
{
    echo $area->href;
}

输出为http://rcm-na.amazon-adsystem.com/e/cm/privacy-policy.html?o=1 https://www.amazon.com

第二个链接被截断。请帮助我以获取页面的完整链接。

我需要此输出：https://www.amazon.com/gp/video/offers/ref=dvm_us_dd_as_evgrn?ie=UTF8&linkCode=ur1&redirectToAsin=B01EN8ZQKS&tag=INSERTSTOREID&tag=websitcom05-20

Answer 1

您似乎正在使用“ PHP简单HTML DOM解析器”（位于http://simplehtmldom.sourceforge.net/）

不是完整的“ DOMDocument”，因此它可能会对链接上的处理方式有所限制。

对于DOMDocument，这里有一个问题： Using domDocument, and parsing info, I would like to get the 'href' contents of an 'a' tag

涵盖获取href

如何使用PHP DOM从href属性获取完整链接？

1 个答案: