Question

arg = urllib2.urlopen(argv[1]).read()
soup = BeautifulSoup(arg)
a_tags = soup.find_all('a') #so this stores a list with all the <a href="" /a> tags

我只需要那些没有链接到同一页面的人（没有符号#in href）

任何人都应该......

Answer 1

您可以将href属性与函数匹配：

for a in soup.find_all('a', href=lambda value: value.startswith('#')):
    a.extract()