在以下脚本中收到错误:"linkdetail = link.get('href') AttributeError: 'NoneType' object has no attribute 'get'"
。
如果我使用注释行而不是最后两行来运行它,它会抓取正确的元素。
from bs4 import BeautifulSoup
import sys
import os.path
for i in os.listdir(os.getcwd()):
soup = BeautifulSoup(open(i))
link_list = []
soup.prettify().encode('UTF-8')
link = soup.find(class_="attribute-url")
## link_list.append(link)
## print link_list
linkdetail = link.get('href')
print linkdetail
任何帮助表示感谢,我被困住了。
答案 0 :(得分:1)
尝试这样做 -
links = soup.findAll('a')
for link in links:
linkdetail = link.get('href')
print linkdetail
答案 1 :(得分:0)
尝试这样做:
links = soup.findAll(class_="attribute-url")
for link in links:
print link.text