Question

我尝试废弃此网址：http://www.timeanddate.com/holidays/us/2013#!hol=25

具体来说，我希望所有来自美国的特别日期 - 2013年的过滤器＆＃34;假期和一些纪念活动＆＃34;。当我去http://www.timeanddate.com/holidays/us/2013我可以看到很多假期后，我应用了＆＃34;假期和一些纪念活动＆＃34;过滤我看到一张表，其中包含我要废弃的所有假期。使用＆＃34;检查元素＆＃34; chrome的功能我发现假期表还包含所有假期，但隐藏的假期表有＃34; dn＆＃34;。因此，我使用BeautifulSoap来过滤掉所有节假日的课程＆＃39; dn＆＃39;并期望只有我点击http://www.timeanddate.com/holidays/us/2013#!hol=25

时才能看到的

不幸的是，到目前为止还没有成功。

r =  requests.get("http://www.timeanddate.com/holidays/us/2013#!hol=25")
print(r.url)
> http://www.timeanddate.com/holidays/us/2013#!hol=25   #URL seems to be right

soup = BeautifulSoup(r.content, "lxml")

n = 0
for link in soup.find_all('tr'):
   if link.get('class')[0] != 'head':  #removes the 2 head rows of the table
       if link.get('class')[0] != 'dn':
          n += 1
print(n)
> 197  # instead of 17 as in Chrome

由于请求似乎以正确的方式处理网址，我不知道为什么我没有得到与Chrome相同的结果？

请求（）是否可能无法处理＆＃34;＃！hol = 25＆＃34; URL的一部分？

Python请求模块 - 结果与chrome

0 个答案: