我们在默认Python 2.7附带的RHEL 7计算机中安装了cloudera CDH 6.2。在虚拟环境中使用python 3.7进行火花提交pyspark作业。使用--master local和--deploy-mode客户端的客户端模式可以正常工作。但是,--master纱线和--deploy-mode群集存在问题。
此命令 spark-submit --master yarn --deploy-mode cluster --conf spark.yarn.appMasterEnv.PYSPARK_PYTHON=/home/user/R1_I5/bin/python --conf spark.yarn.appMasterEnv.SPARK_HOME=/opt/cloudera/parcels/CDH/lib/spark --conf spark.executorEnv.SPARK_HOME=/opt/cloudera/parcels/CDH/lib/spark sample.py
失败,出现以下2个错误
案例1错误日志:-部署模式群集
Cannot run program "/home/user/R1_I5/bin/python": error=13, Permission denied
详细日志:https://drive.google.com/file/d/1J7HLNGABnStJ91ISHFBMdNe5OLEUQZ6B/view
案例2错误日志:-主纱线
下面的2行重复出现而没有终止程序
INFO yarn.Client: Application report for application_1594339922772_0012 (state: ACCEPTED)
INFO yarn.SparkRackResolver: Got an error when resolving hostNames. Falling back to /default-rack for all
答案 0 :(得分:0)