应用错误收集

我正在尝试使用Stanford NLP Parser 将文档解析为一组句子。它读取包含多个句子的文件名作为类的参数，DocumentPreprocessor 然后输出解析的句子。但是，我想按字符串给出输入，其中包含一组句子，但整个文件包含一堆句子。

我能做到的任何方式吗？

当前程序如下所示（https://nlp.stanford.edu/software/tokenizer.shtml）

public class TokenizerDemo {

  public static void main(String[] args) throws IOException {
    for (String arg : args) {
      // option #1: By sentence.
      DocumentPreprocessor dp = new DocumentPreprocessor(arg);
      for (List<HasWord> sentence : dp) {
        System.out.println(sentence);
      }
    }
  }
}

有什么方法可以为Stanford-parser DocumentPreprocessor类提供字符串输入吗？

0 个答案: