使用java计算文本文件中的句子

时间:2013-05-31 08:33:52

标签: java opennlp

下面的源代码将使用openNLP检测文本文件中的句子。但是我不知道如何计算和打印文本文件中的句子数量?

    package com.mycompany.app;

    import java.io.BufferedReader;
    import java.io.File;
    import java.io.FileInputStream;
    import java.io.FileReader;
    import java.io.IOException;
    import java.io.InputStream;

    import opennlp.tools.sentdetect.SentenceDetectorME;
    import opennlp.tools.sentdetect.SentenceModel;
    import opennlp.tools.util.InvalidFormatException;

    public class SentenceDetector {

    public static void main(String[] args) throws InvalidFormatException,IOException {
    try
    {
    File file = new File("D:/NetBeansProjects/my-app/textfile.txt");
    BufferedReader br = new BufferedReader(new FileReader(file));
    String word2 = br.readLine();

InputStream is = new FileInputStream("D:/NetBeansProjects/my-app/src/main/resources
    /en-sent.zip");
SentenceModel model = new SentenceModel(is);
SentenceDetectorME sdetector = new SentenceDetectorME(model);
String sentences[] = sdetector.sentDetect(word2);

for (String str 
    :sentences){                                                                
    System.out.println(str);
    }

br.close();  
    is.close(); 
    } 
    catch (IOException e)
    {
    // File not found
    e.printStackTrace();
    }
    }
    }

1 个答案:

答案 0 :(得分:1)

我对你的问题感到困惑。句子数是

sentences.length

你打印出来就像这样:

System.out.println("the document contains", sentences.length, "sentences.");

我错过了什么吗?