Question

我一直在寻找一段时间，我似乎无法找到使用urllib2的index命令的指南。基本上该函数找到第一个单词，该单词是/或包含给定索引函数的字符，然后返回它的位置。

page_content= urllib2.urlopen(URL).read() #Gets the website
i1 = page_content.index(keyword) #finds keyword in the website

从这里i1可以用于urllib2中的不同内容，但这与此无关。

我的问题是.index没有得到与关键字匹配的所有单词，但只有第一个。如果您知道网站上只有一种类型，这没有问题，但我希望将所有这些分配到一个列表，然后我可以从中获取并使用它们。

Answer 1

用户重新启动功能

import re 
page_content= urllib2.urlopen(URL).read() #Gets the website
idall = [m.start() for m in re.finditer(keyword,page_content)]

print idall