我是python js的新手我试图运行github上可用的项目
但是当我尝试运行时会出现以下错误
Traceback (most recent call last): File "main.py", line 81, in
<module>
crawler_machine() File "main.py", line 76, in crawler_machine
driver.close() AttributeError: 'NoneType' object has no attribute 'close'
Exception AttributeError: "'Service' object has no attribute
'process'" in <bound method Service.__del__ of
<selenium.webdriver.phantomjs.service.Service object at 0x107d05790>>
ignored
我遵循该项目中提供的所有指示
,代码是 main.py
from selenium import webdriver
from selenium.webdriver.common.keys import Keys
from config import URL_FORMAT, SCROLL_TIMES, PHANTOM_JS_PATH, WEBDRIVER
from parser import FParser
from optparse import OptionParser
import signal
import sys
import time
driver = None
def close(signal, frame):
'''
When you press Ctrl-C the browser closes
'''
global driver
print('You pressed Ctrl+C!')
driver.close()
signal.pause()
sys.exit(0)
def scroll_down(driver):
'''
This helps you to scroll the search results page to load more results
'''
for i in range(SCROLL_TIMES):
driver.execute_script("window.scrollTo(0, document.body.scrollHeight);")
time.sleep(0.3)
def get_page(driver, url):
driver.get(url)
return driver.page_source
def press_the_button_2_crawl(driver, keyword):
"""
Main function handles everything
"""
html_data = get_page(driver, URL_FORMAT % keyword)
scroll_down(driver)
parser = FParser(html_data)
parser.store() #store into db
time.sleep(2)
def crawler_machine(search_word=None):
global driver
optparser = OptionParser()
optparser.add_option("-s", "--search",
type="string", dest="search")
(options, args) = optparser.parse_args()
keyword = options.search or search_word
print "Keyword",keyword
try:
try:
#headless phantomjs for 32bit unix based machines
driver = webdriver.PhantomJS(executable_path=PHANTOM_JS_PATH)
except:
#firefox
driver = webdriver.Firefox()
signal.signal(signal.SIGINT, close)
press_the_button_2_crawl(driver, keyword)
finally:
driver.close() #line 76=================================
if __name__ == '__main__':
crawler_machine() #line 81=================================
答案 0 :(得分:2)
我的猜测是,两次尝试创建驱动程序 - PhantomJS和FireFox - 都会失败,这意味着当您到达finally块时driver
仍然是None
。您可以通过添加显式检查来确认:
finally:
if driver:
driver.close()
else:
print "Could not create driver"
至于为什么驱动程序创建失败,最可能的解释是安装问题。您可以通过在Python环境中运行一个简单的示例来测试它:
>>> from selenium.webdriver import Firefox
>>> d = Firefox()
>>> d.get('http://stackoverflow.com')
如果这不打开Firefox并导航到Stack Overflow的首页,请查看您的操作系统的Selenium文档。
答案 1 :(得分:1)
driver
和PhantomJS
的创建都失败,则 Firefox
尚未初始化。
在结束前添加支票:
if driver:
driver.close()
答案 2 :(得分:1)
如果crawler_machine中的try ... except子句都失败,则驱动程序的全局值将为None。