Polynominal error in Rapidminer when doing n-gram classification

时间:2015-06-06 18:24:27

标签: svm libsvm n-gram rapidminer concept

I am trying to classify different concepts in a text using n-gram. My data tyically exists of six columns:

  1. The word that needs classification
  2. The classification
  3. First word on the left of 1)
  4. Second word on the left of 1)
  5. First word on the right of 1)
  6. Second word on the right of 1)

When I try to use a SVM in Rapidminer, I get the error that it can not handle polynominal values. I know that this can be done because I have read it in different papers. I set the second column to 'label' and have tried to set the rest to 'text' or 'real', but it seems to have no effect. What am I doing wrong?

2 个答案:

答案 0 :(得分:0)

一种方法可以是创建名称等于单词和值的属性,该属性等于与感兴趣的单词的距离。当然,所有可能的单词都需要表示为属性,因此输入数据会很大。

答案 1 :(得分:0)

您必须使用Support Vector Machine (LibSVM)运营商。 与仅支持两个类问题的经典SVM相比,LibSVM实现(http://www.csie.ntu.edu.tw/~cjlin/papers/libsvm.pdf)支持多类分类和回归。