我正在研究与doc2vec相关的问题,我需要找到与特定单词相关的标签。对于ex(csv文件):
Data Label / Tags
In a future world devastated by disease, a convict is sent back sci-fi in time to gather information about the man-made virus that wiped out most of the human population on the planet. You have slipped under my skin, invaded my blood and seized my action heart. That sounds more like a poison than a person,” was all I could say. His confession had both shocked and thrilled me.
可以使用大量这样的数据来训练模型。现在,我想要的结果就像,当我输入一个特定的单词,如病毒时,它会给我相应的标签( sci-fi ),无论使用哪个单词,还要给出那些标签(动作),其中病毒这个词本身不存在,但它存在语义相关的词(如毒药,有毒)。可以从模型中轻松获取语义相关的单词。我只想列出标签。
我想知道是否可以应用某些内容而非使用关键字搜索。任何可以帮我解决这个问题的特殊方法。
由于