Question

使用NLTK RegexpParser对已经标记化的块输入，转换为小写并使用词性标记。我想将chunker的输出写入文件以供以后使用。相关代码如下;

poscalltokens = nltk.pos_tag(lccalltokens)
pattern = "NP:{<DT>?<JJ>*<NN>}"
NPChunker = nltk.RegexpParser(pattern)
textchunks = NPChunker.parse(poscalltokens)
chunksfile = open ('chunksfile.txt', 'w')
chunksfile = ' '.join(textchunks)

当我从NPChunker获得输出时，代码行

chunksfile = ‘ ‘.join(textchunks)

会产生错误消息

TypeError: sequence item 0: expected string, tuple found

尝试使用‘/n’.join(textchunks)，但收到相同的错误消息

NPChunker的前10行打印输出是

Columbus/VBZ
Southern/JJ
2009/CD
(NP review/NN)
Acitivities/NNS
Feel/VBP
Comfortable/JJ
Impacted/VBN
(NP seet/NN)
Activities/NNS

感谢任何帮助，特别是有助于我理解为什么.join函数在这种情况下不起作用的见解。

如何将nltk.RegexpParser的输出写入文件

0 个答案: