我正在开展项目,在网页内容中搜索一些数据
from lxml import html
import requests
def tabletPhone(webAddress):
page = requests.get(webAddress)
tree = html.fromstring(page.content)
product = tree.xpath("""//h1[@class="product_title entry-\
title"]/text()""")
price = tree.xpath("""//span[@class='price-number']/text()""")
availability = tree.xpath("//n:link",namespaces={'n':'availability'})
return product,price,availability
我找到产品的可用性有问题,html代码类似于:
<link itemprop="availability" href="http://schema.org/InStock" />
是否可以返回{&#39;可用性&#39;:&#39; http://schema.org/InStock&#39;}或返回&#39; http://schema.org/InStock&#39;