Question

我试图了解VADER在句子分析中的作用。为什么此处将超参数Alpha设置为15？我知道当放开它时它是不稳定的，但是为什么15呢？

def normalize(score, alpha=15):
"""
Normalize the score to be between -1 and 1 using an alpha that
approximates the max expected value
"""
norm_score = score/math.sqrt((score*score) + alpha)
return norm_score

Answer 1

Vader的归一化公式为 $\frac{x}{\sqrt{x^2 + \alpha}}$ ，它是的公式

我已经从这里阅读了有关Vader的研究论文：http://comp.social.gatech.edu/papers/icwsm14.vader.hutto.pdf

不幸的是，我找不到任何理由选择这样的公式和15作为alpha值，但是实验和图形显示，随着x的增长（即情感分数的总和），该值变得越来越接近- 1或1表示随着单词数量的增加，得分趋向于-1或1。这意味着Vader与短文档或推文相比，在长文档上的效果更好。

为什么在NLTK-VADER中将alpha设置为15？

1 个答案: