Question

我想在我的C ++源代码中直接使用LIBLINEAR（http://www.csie.ntu.edu.tw/~cjlin/liblinear）。虽然在MATLAB / JAVA这样的语言中使用它似乎很简单，但在C中看起来很难;例如，读取README文件，似乎我必须以特定的链表格式转换每个数据矩阵;来自README

`x' is an array 
of pointers, each of which points to a sparse representation (array 
of feature_node) of one training vector.

For example, if we have the following training data:

LABEL       ATTR1   ATTR2   ATTR3   ATTR4   ATTR5
-----       -----   -----   -----   -----   -----
1           0       0.1     0.2     0       0
2           0       0.1     0.3    -1.2     0
1           0.4     0       0       0       0
2           0       0.1     0       1.4     0.5
3          -0.1    -0.2     0.1     1.1     0.1

and bias = 1, then the components of problem are:

l = 5
n = 6

y -> 1 2 1 2 3

x -> [ ] -> (2,0.1) (3,0.2) (6,1) (-1,?)
     [ ] -> (2,0.1) (3,0.3) (4,-1.2) (6,1) (-1,?)
     [ ] -> (1,0.4) (6,1) (-1,?)
     [ ] -> (2,0.1) (4,1.4) (5,0.5) (6,1) (-1,?)
     [ ] -> (1,-0.1) (2,-0.2) (3,0.1) (4,1.1) (5,0.1) (6,1) (-1,?)

所以，似乎我不能直接使用矩阵，相反，我必须创建feature_node的这个大链表;不存在更简单的系统或任何示例s.t.我能以更简单的方式做到这一点吗？

Answer 1

我不确定自发布此文档后文档是否已更改，但我使用了另一个问题[here]上显示的方法，其中我将问题值加载到这样的矩阵中，而不是重写我的输入文件以匹配格式。（我正在逐行解析文件并在逗号上拆分，然后存储在名为myData的二维数组中）：

struct svm_problem prob;
struct svm_node *x_space;

prob.l = problemSize;

svm_node** x = Malloc(svm_node*, prob.l);

for (int row = 0; row < prob.l; row++)
{
    svm_node* x_space = Malloc(svm_node,4);
    for (int col = 0; col < 4; col++)
    {
        x_space[col].index = col;
        x_space[col].value = myData[row][col];
    }
    x_space[4].index = -1;
    x[row] = x_space;
}

prob.x = x;

它似乎运作良好，至少对我的需求而言。我不太清楚这是否能解决你的问题（也许是时间太晚了）。

希望也可能对其他人有用。

Answer 2

对于那些被困在这里的人来说，索引应该从1开始。所以它应该是：

x_space[col].index = col + 1;

这是针对liblinear的，我不确定libsvm是否具有相同的格式。

C \ C ++中的LIBLINEAR

2 个答案: