Question

如果您运行： java -mx3g -cp“*”edu.stanford.nlp.pipeline.StanfordCoreNLPServer -props StanfordCoreNLP-spanish.properties

java -mx3g -cp“*”edu.stanford.nlp.pipeline.StanfordCoreNLP -props StanfordCoreNLP-spanish.properties

第二个命令打开终端并且西班牙语解析器工作正常，但是从Server版本它使用英语解析器而不是西班牙语。

~/CoreNLP/stanford-corenlp-full-2015-12-09# java -mx3g -cp "*" edu.stanford.nlp.pipeline.StanfordCoreNLPServer  -props StanfordCoreNLP-spanish.properties
-- listing properties --
pos.model=edu/stanford/nlp/models/pos-tagger/sp...
ner.model=edu/stanford/nlp/models/ner/spanish.a...
ner.useSUTime=false
parse.model=edu/stanford/nlp/models/lexparser/spa...
tokenize.language=es
annotators=tokenize, ssplit, pos, ner, parse
ner.applyNumericClassifiers=false
Starting server on port 9000 with timeout of 5000 milliseconds.
StanfordCoreNLPServer listening at /0:0:0:0:0:0:0:0:9000
[/0:0:0:0:0:0:0:1:49579] API call w/annotators tokenize,ssplit,parse
El presidente Julio Sanches formo ungrupo de ministros a quienes llamo los cinco economistas magnificos.
[pool-1-thread-1] INFO edu.stanford.nlp.pipeline.StanfordCoreNLP - Adding annotator tokenize
[pool-1-thread-1] INFO edu.stanford.nlp.pipeline.StanfordCoreNLP - Adding annotator ssplit
[pool-1-thread-1] INFO edu.stanford.nlp.pipeline.StanfordCoreNLP - Adding annotator parse
[pool-1-thread-1] INFO edu.stanford.nlp.parser.common.ParserGrammar - Loading parser from serialized file edu/stanford/nlp/models/lexparser/englishPCFG.ser.gz ...
done [0.4 sec].

对于我使用的客户端： wget --post-data'El presidente Julio Sanches formo ungrupo de ministros a quienes llamo los cinco economicistas magnificos。 'localhost：9000 /？properties = {“tokenize.whitespace”：“true”，“annotators”：“parse”，“outputFormat”：“text”}' - O -

我需要使用西班牙模型文件运行StanfordCoreNLPServer，我需要一个特殊的参数吗？

Answer 1

解决方案在Running Stanford corenlp server with French models

但我只是按照以下方式运行：

服务器java -mx4g -cp“*”edu.stanford.nlp.pipeline.StanfordCoreNLPServer 客户

wget --post-data'el perro corre detras del carro。' --header =“Content-Type：text / plain; charset = UTF-8”'localhost：9000 /？properties = {“annotators”：“tokenize，ssplit，pos，parse”，“parse.model”：“edu /stanford/nlp/models/lexparser/spanishPCFG.ser.gz","pos.model":"edu/stanford/nlp/models/pos-tagger/spanish/spanish.tagger","tokenize.language":"fr “，”outputFormat“：”text“}' - O -

及其作品

Answer 2

在stanford-corenlp-full-2016-10-31版本中你可以使用以下配置，这似乎更方便（和轻松:)）

wget --post-data 'el perro corre detras del carro.' --header="Content-Type: text/plain; charset=UTF-8" 'localhost:9000/?properties={"annotators": "tokenize,ssplit,pos,parse", "pipelineLanguage": "es","outputFormat": "text"}' -O -

StanfordCoreNLP与StanfordCoreNLPServer

2 个答案: