Question

我正在尝试使用机器学习来预测四个目标数值变量的值，我对机器学习概念非常陌生，请帮我为以下提到的数据集创建模型。请建议使用哪种方法来预测多个值。我严重不知道如何开始，从哪里开始以及使用哪种算法。

这是我的输入数据集和输出数据集。

输入数据集

    // Input dataset
        {
      "width":1000,
      "height":500
      "objects": [
        {"left": 27.76, "top": 27.5, "width":671, "height": 197},
        {"left": 312.2, "top": 154.27, "width":499, "height": 452},
        {"left": 707, "top":41.3, "width":1000, "height":714}
      ]
    },
    {
      "width":1000,
      "height":500
      "objects": [
        {"left": 30.12, "top": 37.5, "width":721, "height": 217},
        {"left": 360.2, "top": 160.27, "width":530, "height": 520},
        {"left": 720, "top":60, "width":1200, "height":814}
      ]
    },
    {
      "width":1000,
      "height":500
      "objects": [
        {"left": 35.12, "top": 40.2, "width":721, "height": 217},
        {"left": 370.2, "top": 170.27, "width":540, "height": 530},
        {"left": 800, "top":90, "width":1250, "height":910}
      ]
    }

输出数据集

{
  "width":1000,
  "height":500
  "objects": [
    {"left": 40.27, "top": 30, "width":671, "height": 197},
    {"left": 370, "top": 160, "width":499, "height": 452},
    {"left": 750, "top":50.13, "width":1000, "height":714}
  ]
},
{
  "width":1000,
  "height":500
  "objects": [
    {"left": 35.15, "top": 47.3, "width":721, "height": 217},
    {"left": 410, "top": 190, "width":530, "height": 520},
    {"left": 650, "top":90, "width":1200, "height":814}
  ]
},
{
  "width":1000,
  "height":500
  "objects": [
    {"left": 45.12, "top": 45, "width":721, "height": 217},
    {"left": 390, "top": 185, "width":540, "height": 530},
    {"left": 820, "top":100, "width":1250, "height":910}
  ]
}

Answer 1

您的问题太笼统了。因此，我将自由地从理论上回答它。

假设您是在谈论数据集中变量的预测。因此，您需要做的第一件事是准备一个包含所有变量的数据集（应完成类别变量的转换），然后可以将派生变量添加到数据集中。数据集准备好后，您需要创建训练数据集和测试数据集。在训练数据集上，您可以创建模型。一旦创建了模型，您就可以使用测试数据集评估模型，以预测您感兴趣的变量（例如，汽车数据集的情况下的汽车价格）。

现在，有一些理论上的东西：基本上，预测分析具有3种类型的ML算法，即回归，分类和聚类。根据需要，您需要选择其中之一。回归是为了预测连续变量。分类是按照标签对数据集进行分类。聚类是识别未知的聚类。

在回归问题中，将有多个自变量将用于预测因变量的值（例如，根据里程数，汽车重量，高度，长度，马力等来预测汽车价格）。在这里，汽车价格是因变量，而其他所有变量本质上都是自变量。

确定您想要做什么，然后应用这些概念。

哪种算法适用于使用“机器学习”预测多个输出值

1 个答案: