我正在学习Python来整理一个网络抓取技巧的项目。我无法下载足球统计数据表。我收到以下错误:
FeatureNotFound: Couldn't find a tree builder with the features you requested: lxml. Do you need to install a parser library?
这是我的完整代码。任何帮助都非常感谢。
import pandas as pd
import requests
from bs4 import BeautifulSoup
res = requests.get("http://www.fftoday.com/stats/playerstats.php?Season=2002&GameWeek=1&PosID=10&LeagueID=26955")
soup = BeautifulSoup(res.content,'lxml')
table = soup.find_all('table')[1]
Traceback (most recent call last):
File "<ipython-input-20-e6d65d59d7e8>", line 6, in <module>
soup = BeautifulSoup(res.content,'lxml')
File "C:\Users\Unciv\Anaconda3\envs\ML27\lib\site-packages\bs4\__init__.py", line 165, in __init__
% ",".join(features))
FeatureNotFound: Couldn't find a tree builder with the features you requested: lxml. Do you need to install a parser library?
答案 0 :(得分:3)
如果lxml
不存在,您可以使用
pip install lxml
您也可以使用不同的解析器来实现相同的效果。默认情况下, html.parser
和 html5lib
均可用。
soup = BeautifulSoup(res.content,'html.parser')
这应解决抓取网页的问题。一旦你抓了它,我认为你需要加载table[3]
,用于玩家统计表。