Question

此代码：

    url="http://www.royalcanin.fr/nos-aliments/gammes-pour-chiens/tous-les-aliments-pour-chiens/les-aliments-chez-les-veterinaires/chiens-en-bonne-sante/small/chien-sterilise/neutered-adult-small-dog"## read URL from an array coming from an Url-CSV
#print(url)

page_0=urllib.request.urlopen(url)
soup_0 = BeautifulSoup(page_0.read(),"html.parser") 
restricted_webpage_title_indication= soup_0.find( "div", {"class":"bloc"} ) # to get title and indication
readable_restricted_title_indication=str(restricted_webpage_title_indication)
soup_title_indication=BeautifulSoup(readable_restricted_title_indication,"html.parser")

indication=[]


for li in soup_title_indication.find_all('li'):
    indication.append(li.get_text().strip())

Pair_indication=["Indications",indication]
print(Pair_indication)

给我以下印刷品：

['Indications', ['Risque de prise de poids', 'Sensibilité buccodentaire', "Risque de calculs d'oxalate et de struvite"]]

为什么最后一个元素用“”引用而不是像前两个一样？我在这里不明白的是，在网站上，三个“li”被标记并以相同的方式书写。像这样：

Risque de prize de poids

Sensibilitébuccodentaire

Risque de calculs d'oxalate et de struvite

为什么会这样？我错过了什么？谢谢你的帮助！

Answer 1

你没有遗漏任何东西，Python用双引号打印最后一个，因为该字符串的主体已经有单引号，所以不是用单引号打印并显示内部单引号转义，Python使用双引号打印该字符串。 / p>

无论列表中的所有元素都是字符串，区别仅在于打印时。

一个非常简单的例子来展示这个 -

>>> l = ['123','12\'3']
>>> l
['123', "12'3"]
>>> repr(l[1])
'"12\'3"'
>>> print(repr(l[1]))
"12'3"

请注意以上只是repr()处理它的方式，在写入csv时，csv模块将以不同方式正确处理它。示例 -

>>> l = ['123','12\'3',"222'333"]
>>> with open('b.csv','w') as f:
...     writer = csv.writer(f,quotechar="'")
...     writer.writerow(l)
...
24
>>> with open('b.csv','r') as f:
...     reader = csv.reader(f,quotechar="'")
...     for line in reader:
...             print(line)
...
['123', "12'3", "222'333"]
[]

csv文件看起来像 -

123,'12''3','222''333'

报废和字符串格式化问题：一次废话与''另一次与“”

1 个答案: