我使用Rollingcurl抓取各种页面。
滚动卷曲:https://github.com/LionsAd/rolling-curl
<?php
error_reporting(E_ALL);
ini_set('display_errors', 1);
require "RollingCurl.php";
require "tmdb_class.php";
$tmdb = new Tmdb;
if (isset($_GET['action']) || isset($_POST['action'])) {
$action = (isset($_GET['action'])) ? $_GET['action'] : $_POST['action'];
} else {
$action = "";
}
echo " Test<br /><br />";
/*function most_popular($response, $info)
{
$doc = new DOMDocument();
libxml_use_internal_errors(true); //disable libxml errors
if (!empty($response)) {
//if any html is actually returned
$doc->loadHTML($response);
libxml_clear_errors(); //remove errors for yucky html
$xpath = new DOMXPath($doc);
//get all the h2's with an id
$row = $xpath->query("//div[contains(@class, 'lister-item-image') and contains(@class, 'float-left')]/a/@href");
$nexts = $xpath->query("//a[contains(@class, 'lister-page-next') and contains(@class, 'next-page')]");
$names = $xpath->query('//img[@class="loadlate"]');
foreach ($nexts as $next) {
echo "Next URL: " . $next->getAttribute('href') . "<br/>";
}
foreach ($names as $name) {
echo "Release Name: " . $name->getAttribute('alt') . "<br/>";
}
if ($row->length > 0) {
foreach ($row as $row) {
echo $doc->saveHtml($row) . "<br/>";
}
}
}
}*/
if ($action == "most_popular") {
if (isset($_GET['date'])) {
$link = "https://www.imdb.com/search/title?title_type=feature,tv_movie&release_date=,".$_GET['date'];
} else {
$link = "https://www.imdb.com/search/title?title_type=feature,tv_movie&release_date=,2018";
}
$urls = array($link);
$rc = new RollingCurl("most_popular");
$rc->window_size = 20;
foreach ($urls as $url) {
$request = new RollingCurlRequest($url);
$rc->add($request);
}
$stream = $rc->execute();
}
以简单的方式,当然可以调用函数“ most_popular”。但是我想构建并调用自己的类以使用某些功能。
如果函数“ most_popular”在我的课程中,则调用它并不是那么容易:
我的课:
<?php
class Tmdb
{
public function __construct()
{
/* */
}
// SEARCH
public function most_popular($response, $info)
{
$doc = new DOMDocument();
libxml_use_internal_errors(true); //disable libxml errors
if (!empty($response)) {
//if any html is actually returned
$doc->loadHTML($response);
libxml_clear_errors(); //remove errors for yucky html
$xpath = new DOMXPath($doc);
//get all the h2's with an id
$row = $xpath->query("//div[contains(@class, 'lister-item-image') and contains(@class, 'float-left')]/a/@href");
$nexts = $xpath->query("//a[contains(@class, 'lister-page-next') and contains(@class, 'next-page')]");
$names = $xpath->query('//img[@class="loadlate"]');
foreach ($nexts as $next) {
echo "Next URL: " . $next->getAttribute('href') . "<br/>";
}
foreach ($names as $name) {
echo "Release Name: " . $name->getAttribute('alt') . "<br/>";
}
if ($row->length > 0) {
foreach ($row as $row) {
echo $doc->saveHtml($row) . "<br/>";
}
}
}
}
}
有人知道它是如何工作的吗?
这不起作用:
$rc = new RollingCurl($tmdb->most_popular);
或
$rc = new RollingCurl($tmdb->most_popular());
非常感谢您。
答案 0 :(得分:1)
要使用对象方法作为回调,请创建一个包含对象和方法名称的数组。
$rc = new RollingCurl([$tmdb, 'most_popular']);