我正在尝试找到一种方法来从阿里巴巴的Agriculture growing media页中抓取信息。我正在尝试抓取信息product_name,公司名称,min_order,company_name和URL_of_product_image。所有产品中。
我想连续获取一种产品的信息,并继续抓取直到最后一个分页链接。
代码
mysql> show variables like 'character_set_%';
+--------------------------+---------------------------------------
--------------------+
| Variable_name | Value
|
+--------------------------+---------------------------------------
--------------------+
| character_set_client | utf8mb4
|
| character_set_connection | utf8mb4
|
| character_set_database | utf8mb4
|
| character_set_filesystem | binary
|
| character_set_results | utf8mb4
|
| character_set_server | latin1
|
| character_set_system | utf8
|
| character_sets_dir | /usr/local/mysql-5.7.23-macos10.13-
x86_64/share/charsets/ |
+--------------------------+---------------------------------------
--------------------+
8 rows in set (0.01 sec)
以下是可能清晰可见的图片: