Question

我想从这个xml Feed导入{{3}} 我遇到了一些麻烦，因为我想提取的一些数据有法语重音符号。

url = 'http://www.lnv.fr/xml/ajaccio/calendrier.xml'
r = requests.get(url)
soup = BeautifulSoup(r.content)
matches = soup.findAll('match')

当我这样做时

for match in matches:
    print match.equipedomicile.string

它会打印出来，因为对于像Sète这样的重音标记的团队来说没有问题。

但是当我这样做时

def GetGames():
homeTeamList = []    
for match in matches:
    homeTeam = unicode(match.equipedomicile.text)        
    homeTeamList.append(homeTeam)
return homeTeamList

并且调用该功能列表团队的重音标记不正确。即Sète 现在变成你'\ xe8te'

Answer 1

您获得的是unicode字符串的repr版本，在列表的各个元素上使用print，您将获得正确的输出。

>>> a = [u'S\xe8te']
>>> a
[u'S\xe8te']
>>> print a[0]
Sète

将unicode对象附加到列表时出现问题

1 个答案: