从nltk或其他NLP库中的副词中获取形容词

时间:2013-06-21 22:15:52

标签: python nlp nltk

有没有办法在NLTK或其他python库中获得与给定副词相对应的形容词。 例如,对于副词“非常”,我需要“可怕”。 感谢。

2 个答案:

答案 0 :(得分:7)

wordnet中存在将adjectives连接到adverbs的关系,反之亦然。

>>> from itertools import chain
>>> from nltk.corpus import wordnet as wn
>>> from difflib import get_close_matches as gcm
>>> possible_adjectives = [k.name for k in chain(*[j.pertainyms() for j in chain(*[i.lemmas for i in wn.synsets('terribly')])])]
['terrible', 'atrocious', 'awful', 'rotten']
>>> gcm('terribly',possible_adjectives)
['terrible']

计算possible_adjective的更易读的方法如下:

possible_adj = []
for ss in wn.synsets('terribly'):
  for lemmas in ss.lemmas: # all possible lemmas.
    for lemma in lemmas: 
      for ps in lemma.pertainyms(): # all possible pertainyms.
        for p in ps:
          for ln in p.name: # all possible lemma names.
            possible_adj.append(ln)

编辑:在较新版本的NLTK中:

possible_adj = []
for ss in wn.synsets('terribly'):
  for lemmas in ss.lemmas(): # all possible lemmas
      for ps in lemmas.pertainyms(): # all possible pertainyms
          possible_adj.append(ps.name())

答案 1 :(得分:1)

正如MKoosej所提到的,nltk的引理不再是一种属性,而是一种方法。我也做了一点简化以获得最可能的单词。希望其他人也可以使用它:

wordtoinv = 'unduly'
s = []
winner = ""
for ss in wn.synsets(wordtoinv):
    for lemmas in ss.lemmas(): # all possible lemmas.
        s.append(lemmas)

for pers in s:
    posword = pers.pertainyms()[0].name()
    if posword[0:3] == wordtoinv[0:3]:
        winner = posword
        break

print winner # undue