我只是安装了Spark 2.2.1独立群集和zeppelin 0.7.3。我尝试了一些逻辑回归代码,如下所示:
lr = LogisticRegression(maxIter=10000, regParam=0.2)
model1 = lr.fit(training)
trainingSummary = model1.summary
objectiveHistory = trainingSummary.objectiveHistory
for objective in objectiveHistory:
print(objective)
trainingSummary.roc.show()
print("areaUnderROC(training): " + str(trainingSummary.areaUnderROC))
prediction = model1.transform(test)
prediction_train = model1.transform(training)
evaluator=BinaryClassificationEvaluator().setLabelCol("label").setRawPredictionCol("probability").setMetricName("areaUnderROC")
pred_test=prediction.select("label","probability","rawPrediction")
pred_train=prediction_train.select("label","probability","rawPrediction")
ROC_test=evaluator.evaluate(pred_test)
ROC_train=evaluator.evaluate(pred_train)
print("areaUnderROC(training): " + str(ROC_train))
print("areaUnderROC(testing): " + str(ROC_test))
并收到以下错误。谷歌搜索并发现此问题在阅读JSON时在0.7.1中得到修复。
Py4JJavaError: An error occurred while calling z:org.apache.spark.api.python.PythonRDD.runJob.
: org.apache.spark.SparkException: Job aborted due to stage failure: Task 0 in stage 1.0 failed 4 times, most recent failure: Lost task 0.3 in stage 1.0 (TID 4, 192.168.0.17, executor 0): java.io.InvalidClassException: org.apache.commons.lang3.time.FastDateParser; local class incompatible: stream classdesc serialVersionUID = 2, local class serialVersionUID = 3
at java.io.ObjectStreamClass.initNonProxy(ObjectStreamClass.java:687)
at java.io.ObjectInputStream.readNonProxyDesc(ObjectInputStream.java:1885)
at java.io.ObjectInputStream.readClassDesc(ObjectInputStream.java:1751)
at
...
Caused by: java.io.InvalidClassException:
org.apache.commons.lang3.time.FastDateParser; local class incompatible:
stream classdesc serialVersionUID = 2, local class serialVersionUID = 3
at java.io.ObjectStreamClass.initNonProxy(ObjectStreamClass.java:687)
at java.io.ObjectInputStream.readNonProxyDesc(ObjectInputStream.java:1885)
at java.io.ObjectInputStream.readClassDesc(ObjectInputStream.java:1751)
at java.io.ObjectInputStream.readOrdinaryObject(ObjectInputStream.java:2042)
....