Scrapy FormRequest解码\\ u0022

时间:2020-03-16 10:05:49

标签: unicode scrapy

我正在尝试取消这种site

Json响应似乎是unicode with the "\u0022" instead of the quotation mark

我已经看过一些主题并找到答案的开头,但是如果使用解码方法,我必须处理字符串响应,因此解析起来并不方便。

from scrapy.http import FormRequest
import scrapy
import json

class MySpider(scrapy.Spider):
  name = 'expert'

#custom_settings={ 'FEED_URI': "export/%(time)s.csv",
#FEED_FORMAT': 'csv'}

def start_requests(self):
    url = "https://www.expertsantevisuelle.com/findOpticians"
    frmdata = {"user_position[lat]": "50.62925", "user_position[lng]":"3.057256"}
    headers = {"charset":"UTF-8", 
               "Content-Type": "application/x-www-form-urlencoded; charset=UTF-8", 
               "Accept-Encoding":"gzip, deflate, br",
               "X-Requested-With": "XMLHttpRequest",
               }
    return [scrapy.FormRequest(url,
                               formdata= frmdata,
                               callback=self.parse,
                               headers = headers)]

def parse(self, response):
    data = json.loads(response.body.decode())

    print ('-------------------------')
    print (type(data))
    print ('-------------------------')

您知道解析此JSON响应的更好方法吗?

0 个答案:

没有答案