我正在尝试取消这种site。
Json响应似乎是unicode with the "\u0022" instead of the quotation mark。
我已经看过一些主题并找到答案的开头,但是如果使用解码方法,我必须处理字符串响应,因此解析起来并不方便。
from scrapy.http import FormRequest
import scrapy
import json
class MySpider(scrapy.Spider):
name = 'expert'
#custom_settings={ 'FEED_URI': "export/%(time)s.csv",
#FEED_FORMAT': 'csv'}
def start_requests(self):
url = "https://www.expertsantevisuelle.com/findOpticians"
frmdata = {"user_position[lat]": "50.62925", "user_position[lng]":"3.057256"}
headers = {"charset":"UTF-8",
"Content-Type": "application/x-www-form-urlencoded; charset=UTF-8",
"Accept-Encoding":"gzip, deflate, br",
"X-Requested-With": "XMLHttpRequest",
}
return [scrapy.FormRequest(url,
formdata= frmdata,
callback=self.parse,
headers = headers)]
def parse(self, response):
data = json.loads(response.body.decode())
print ('-------------------------')
print (type(data))
print ('-------------------------')
您知道解析此JSON响应的更好方法吗?