for i in range(1,1000000):
page = urllib2.urlopen("http://www.palgrave.com/products/title.aspx?pid="+str(i))
print "http://www.palgrave.com/products/title.aspx?pid="+str(i)
soup = BeautifulSoup(page) #retreive
books = soup.findAll("div",{"id":"Title"}) #process
我需要浏览发布商的整个目录。 我需要检索:
答案 0 :(得分:0)
使用XPath从这些位置提取内容