Question

我正在尝试用StanfordPOSTagger标记法语句子。我尝试了一个在this web site

上看到的示例

我的问题是python在尝试标记句子时引发错误。

我一直在寻找解决类似问题的解决方案，但找不到解决我问题的解决方案。有些人或多或少有相同的错误，但没有相同的代码，因此我无法将解决方案应用于我的特定情况。

from nltk.stem.snowball import FrenchStemmer #import the French stemming library
from nltk.corpus import stopwords #import stopwords from nltk corpus
import nltk #import the Natural Language Processing Kit
import re #import the regular expressions library; will be used to strip punctuation
from collections import Counter #allows for counting the number of occurences in a list
import pandas as pd
import os #import os module
from french_lefff_lemmatizer.french_lefff_lemmatizer import FrenchLefffLemmatizer

from nltk.tag.stanford import StanfordPOSTagger
root_path="/YolannGauvin/stanford-postagger-full-2017-06-09/"
pos_tagger = StanfordPOSTagger(root_path + "models/french.tagger", root_path + "stanford-postagger.jar",encoding='utf8') #instance de la classe StanfordPOSTagger en UTF-8
def pos_tag(sentence):
    tokens = nltk.word_tokenize(sentence) #je transforme la phrase en tokens => si vous avez un texte avec plusieurs phrases, passez d'abord par nltk pour récupérer les phrases
    tags = pos_tagger.tag(tokens) #lance le tagging
    return tags

print(pos_tag("Bonjour j'aime les fruits"))

在这里，Python引发的错误：

OSError：Java命令失败： ['D：\ YolannGauvin \ jdk1.8.0_212 / jdk-8u212-windows-x64.exe'， '-mx1000m'，'-cp'， '/YolannGauvin/stanford-postagger-full-2017-06-09/stanford-postagger.jar'， 'edu.stanford.nlp.tagger.maxent.MaxentTagger'，'-model'， '/YolannGauvin/stanford-postagger-full-2017-06-09/models/french.tagger'， '-textFile'，'C：\ Users \ dsi \ AppData \ Local \ Temp \ tmphd6v_gqp'， '-tokenize'，'false'，'-outputFormatOptions'，'keepEmptySentences'， '-encoding'，'utf8']

如何正确使用stanfordPOSTagger的标签功能？

0 个答案: