在通过livy连接的另一个实例上,我有一个AWS EMR集群和一本Jupyert笔记本,我已经在EMR主节点和节点上安装了软件包,Jupyter似乎可以识别除熊猫以外的所有软件包。
我检查了sys.executable(/ usr / bin / python3),该文件在Jypter和终端上都相同。
遇到错误:
Pandas >= 0.19.2 must be installed; however, it was not found.
Traceback (most recent call last):
File "/usr/lib/spark/python/lib/pyspark.zip/pyspark/sql/dataframe.py", line 2085, in toPandas
require_minimum_pandas_version()
File "/usr/lib/spark/python/lib/pyspark.zip/pyspark/sql/utils.py", line 129, in require_minimum_pandas_version
"it was not found." % minimum_pandas_version)
ImportError: Pandas >= 0.19.2 must be installed; however, it was not found.
有人可以帮我吗?
答案 0 :(得分:0)
它抱怨找不到至少pandas
或更高的最低要求版本的0.19.2
。您是否已检查依赖项,是否已安装了较早的版本?