应用错误收集

已经创建了一个模型，并在给定Pyspark的情况下使用H2O XGBoost在测试数据上对后者进行了评估，是否有办法获得真正的阳性率，最好在[0.95，0.9，...]等分位数中给出？给定一个（py）spark Sparksession之上的H2o上下文h2oContext，到目前为止，以下是我的代码，我不知道计算量化的TP速率。感谢您的帮助！

estH2o = H2OXGBoostEstimator(max_depth=6, backend='cpu', eta=0.2, gamma=0.1, ntrees=60, stopping_metric='auc',
    categorical_encoding="LabelEncoder")
estH2o.train(training_frame=h2oTrain, validation_frame=h2oTest, x=featureCols, y="myTargetVariable")
testResult = estH2o.predict(h2oTest[featureColsTest])
testResultDF = testResult.as_data_frame()
testResultTarget = h2oTest['myTargetVariable'].as_data_frame()

带有H2O XGBoost和pyspark的真正分位数指标

0 个答案: