Question

我使用篮球参考数据创建了这个DataFrame，并获得了每个特征的均值。 DataFrame Data 无论我用来训练线性模型的哪一列，我的R2得分都接近于0，而且预测很糟糕。

std::vector<std::string> parse(const std::string& str) {
    std::vector<std::string> result;
    bool masked = false;
    std::vector<char> current;        // stores chars of the current item
    for (const char c : str) {
        if (masked || (c != ',')) {
            current.push_back(c);
            switch (c) {
            case '<': masked = true; break;
            case '>': masked = false;
            }
        }
        else {            // unmasked comma: store item and prepare next
            current.push_back('\0');  // a terminating null for the vector data
            result.push_back(std::string(&current[0]));
            current.clear();
        }
    }
    // do not forget the last item...
    current.push_back('\0');
    result.push_back(std::string(&current[0]));
    return result;
}

我认为这是我创建或使用DataFrame的方式，但我不知道如何解决该问题。希望您能帮助我，谢谢。

Answer 1

也许这不是机器学习的正确问题。您确定数据框中存在的一个或多个统计信息与BPM得分之间是否存在某种关系。也许尝试使用像决策树这样的多类分类算法，使用所有功能，将BPM更改为分类得分后，例如0到-2之间是替补球员，4到6之间是全明星考虑。我对篮球一无所知，我使用了this链接来了解事物。

线性回归模型的问题

1 个答案: