Question

我正在学习Python来整理一个网络抓取技巧的项目。我无法下载足球统计数据表。我收到以下错误：

FeatureNotFound: Couldn't find a tree builder with the features you requested: lxml. Do you need to install a parser library?

这是我的完整代码。任何帮助都非常感谢。

import pandas as pd
import requests
from bs4 import BeautifulSoup

res = requests.get("http://www.fftoday.com/stats/playerstats.php?Season=2002&GameWeek=1&PosID=10&LeagueID=26955")
soup = BeautifulSoup(res.content,'lxml')
table = soup.find_all('table')[1]


  Traceback (most recent call last):

  File "<ipython-input-20-e6d65d59d7e8>", line 6, in <module>
    soup = BeautifulSoup(res.content,'lxml')

  File "C:\Users\Unciv\Anaconda3\envs\ML27\lib\site-packages\bs4\__init__.py", line 165, in __init__
    % ",".join(features))

FeatureNotFound: Couldn't find a tree builder with the features you requested: lxml. Do you need to install a parser library?

Answer 1

如果lxml不存在，您可以使用

进行安装

pip install lxml

您也可以使用不同的解析器来实现相同的效果。默认情况下， html.parser 和 html5lib 均可用。

soup = BeautifulSoup(res.content,'html.parser')

这应解决抓取网页的问题。一旦你抓了它，我认为你需要加载table[3]，用于玩家统计表。

FeatureNotFound：找不到具有您请求的功能的树构建器 - 使用Pandas进行Webscraping

1 个答案: