废除阿里巴巴的产品

时间:2018-09-06 11:58:19

标签: python xpath web-scraping scrapy

我正在尝试找到一种方法来从阿里巴巴的Agriculture growing media页中抓取信息。我正在尝试抓取信息product_name,公司名称,min_order,company_name和URL_of_product_image。所有产品中。

我想连续获取一种产品的信息,并继续抓取直到最后一个分页链接。

代码

    mysql> show variables like 'character_set_%';
    +--------------------------+--------------------------------------- 
    --------------------+
    | Variable_name            | Value                                                     
    |
    +--------------------------+--------------------------------------- 
    --------------------+
    | character_set_client     | utf8mb4                                                   
    |
    | character_set_connection | utf8mb4                                                   
    |
    | character_set_database   | utf8mb4                                                   
    |
    | character_set_filesystem | binary                                                    
    |
    | character_set_results    | utf8mb4                                                   
    |
    | character_set_server     | latin1                                                    
    |
    | character_set_system     | utf8                                                      
    |
    | character_sets_dir       | /usr/local/mysql-5.7.23-macos10.13- 
    x86_64/share/charsets/ |
    +--------------------------+--------------------------------------- 
    --------------------+
    8 rows in set (0.01 sec)

以下是可能清晰可见的图片: Here is the picture that might gives the clear vision.

0 个答案:

没有答案