美丽的汤获取属性错误

时间:2014-02-08 05:20:28

标签: python beautifulsoup

在以下脚本中收到错误:"linkdetail = link.get('href') AttributeError: 'NoneType' object has no attribute 'get'"

如果我使用注释行而不是最后两行来运行它,它会抓取正确的元素。

from bs4 import BeautifulSoup

import sys

import os.path

for i in os.listdir(os.getcwd()):
    soup = BeautifulSoup(open(i))
    link_list = []
    soup.prettify().encode('UTF-8')
    link = soup.find(class_="attribute-url")
    ## link_list.append(link)
    ## print link_list
    linkdetail = link.get('href')
    print linkdetail

任何帮助表示感谢,我被困住了。

2 个答案:

答案 0 :(得分:1)

尝试这样做 -

    links = soup.findAll('a')
    for link in links:
        linkdetail = link.get('href')
        print linkdetail

答案 1 :(得分:0)

尝试这样做:

    links = soup.findAll(class_="attribute-url")
    for link in links:
        print link.text