应用错误收集

How can I reduce the "zeroes" effect on a high dimensional sparse matrix?

时间：2018-09-18 20:35:53

标签： python data-science multilabel-classification

I'm a newbie at python and data science and I'm trying to run a multilabel classification. However, I have over 2.000.000 observations and 230 categories to predict. The main problem here is that my sparse matrix will result in a lot of "zeroes", so the accuracy will be monstrously high (classifying everything as 0).

For example, the category "animals" appears 11340 times. So, there will be over 1,9m "0" in this category.

Is there a way to reduce this effect? I used binary relevance, naive Bayes and some others but i think the main issue is the data frame itself.

0 个答案:

没有答案

如何有效地从（非稀疏）矩阵中删除零？
多维稀疏矩阵压缩
在numpy中减少稀疏矩阵
PCA用于高维矩阵
Java N维稀疏矩阵
如何在Chapel中的稀疏矩阵中迭代非零
np.meshgrid用于高维矩阵
如何在TensorFlow中执行稀疏矩阵*稀疏矩阵乘法？
How can I reduce the "zeroes" effect on a high dimensional sparse matrix?
减少属性稀疏矩阵的数量

我写了这段代码，但我无法理解我的错误
我无法从一个代码实例的列表中删除 None 值，但我可以在另一个实例中。为什么它适用于一个细分市场而不适用于另一个细分市场？
是否有可能使 loadstring 不可能等于打印？卢阿
java中的random.expovariate()
Appscript 通过会议在 Google 日历中发送电子邮件和创建活动
为什么我的 Onclick 箭头功能在 React 中不起作用？
在此代码中是否有使用“this”的替代方法？
在 SQL Server 和 PostgreSQL 上查询，我如何从第一个表获得第二个表的可视化
每千个数字得到
更新了城市边界 KML 文件的来源？

How can I reduce the &#34;zeroes&#34; effect on a high dimensional sparse matrix?

0 个答案:

How can I reduce the "zeroes" effect on a high dimensional sparse matrix?