Java将文本拆分为基于" ...",“。”,"?","!",

时间:2016-01-22 18:33:47

标签: java

如何使用正则表达式将文本拆分成句子数组?

示例文字:

This is test... Sentense number 2. Sentence number 3?

预期产出:

This is test... 

Sentense number 2.  

Sentence number 3?

1 个答案:

答案 0 :(得分:2)

您也可以使用a BreakIterator

  String s = "This is test... Sentense number 2. Sentence number 3?";
  BreakIterator bi = BreakIterator.getSentenceInstance(Locale.ENGLISH);
  bi.setText(s);
  int start = 0;
  int end = 0;
  while ((end = bi.next()) != BreakIterator.DONE) {
    System.out.println(s.substring(start, end));
    start = end;
  }