我有一个问题,我想得到这样的东西:
<text>
<sentence>
<word>a</word>
<word>had</word>
<word>lamb</word>
<word>little</word>
<word>Mary</word>
</sentence>
<sentence>
<word>Aesop</word>
<word>and</word>
<word>called</word>
<word>came</word>
<word>for</word>
<word>Peter</word>
<word>the</word>
<word>wolf</word>
</sentence>
<sentence>
<word>Cinderella</word>
<word>likes</word>
<word>shoes</word>
</sentence>
但我只得到这个:
<text>
<sentence>
<word>a</word>
<word>had</word>
<word>lamb</word>
<word>little</word>
<word>Mary</word>
</sentence></text>
示例文字
“玛丽有一只小羊羔。 彼得叫狼,而伊索来了。 灰姑娘喜欢鞋......“我的班级
public class StaxWriteXmlTest {
/**
* @param args
* @throws FileNotFoundException
* @throws XMLStreamException
*/
public static void main(String[] args) throws FileNotFoundException,
XMLStreamException {
String[] word = initItems();
String itemXmlFile = "D:\\items.xml";
// xml event writer with output stream
XMLOutputFactory xmlOutFactory = XMLOutputFactory.newInstance();
OutputStream outputStream = new FileOutputStream(itemXmlFile);
XMLEventWriter eventWriter = xmlOutFactory
.createXMLEventWriter(outputStream);
ByteArrayOutputStream outStream = new ByteArrayOutputStream();
XMLEventWriter xmlWrite = (XMLOutputFactory.newInstance())
.createXMLEventWriter(outStream);
XMLEventFactory eventFactory = XMLEventFactory.newInstance();
XMLEvent end = createNewLine(eventFactory);
XMLEvent tab = createTab(eventFactory);
// Create start tag
StartDocument startDocument = eventFactory.createStartDocument();
EndDocument endDocument = eventFactory.createEndDocument();
eventWriter.add(startDocument);
// create config open tag
eventWriter.add(end);
StartElement configStartElement = eventFactory.createStartElement("",
"", "text");
eventWriter.add(configStartElement);
eventWriter.add(end);
eventWriter.add(tab);
StartElement itemStartElement = eventFactory.createStartElement("", "",
"sentence");
eventWriter.add(itemStartElement);
eventWriter.add(end);
eventWriter.add(tab);
// add words
for (String words : word) {
eventWriter.add(tab);
createItemNode(eventFactory, eventWriter, "word", words);
eventWriter.add(tab);
}
// eventWriter.add(tab);
EndElement itemEndElement = eventFactory.createEndElement("", "",
"sentence");
eventWriter.add(itemEndElement);
eventWriter.add(end);
EndElement configEndElement = eventFactory.createEndElement("", "",
"text");
eventWriter.add(configEndElement);
eventWriter.add(end);
eventWriter.add(endDocument);
eventWriter.close();
}
public static void createItemNode(XMLEventFactory eventFactory,
XMLEventWriter eventWriter, String elementName, String value)
throws XMLStreamException {
XMLEvent end = eventFactory.createDTD("\n");
StartElement startElement = eventFactory.createStartElement("", "",
elementName);
eventWriter.add(startElement);
Characters characters = eventFactory.createCharacters(value);
eventWriter.add(characters);
EndElement endElement = eventFactory.createEndElement("", "",
elementName);
eventWriter.add(endElement);
eventWriter.add(end);
}
public static XMLEvent createTab(XMLEventFactory eventFactory) {
return eventFactory.createDTD("\t");
}
public static XMLEvent createNewLine(XMLEventFactory eventFactory) {
return eventFactory.createDTD("\n");
}
public static String[] initItems() {
FileReader fr = null;
try {
fr = new FileReader("text.txt");
} catch (FileNotFoundException e1) {
e1.printStackTrace();
}
BufferedReader inputText = new BufferedReader(fr);
String text = "", newText = "";
String allTogether = "";
try {
while ((text = inputText.readLine()) != null) {
newText += text.replaceAll("\\s+", " ").replaceAll(" ,", ",")
.replaceAll(" \\.", ".").replaceAll("\\..", ".");
allTogether = newText.replaceAll("\\s+", " ");
String[] splitText = allTogether.split(".");
}
} catch (IOException e) {
e.printStackTrace();
}
String[] splitText = allTogether.split("[.]");
for (int i = 0; i < splitText.length; i++) {
// System.out.println(splitText[i]);
String[] nexSplit = splitText[i].split("[ \t]");
for (int x = 0; x < nexSplit.length; x++) {
Arrays.sort(nexSplit, String.CASE_INSENSITIVE_ORDER);
// System.out.println(nexSplit[x]);
return nexSplit;
}
}
return splitText;
}}
答案 0 :(得分:0)
我还没有解析你的所有代码,但这部分看起来很可疑:
String[] splitText = allTogether.split("[.]");
for (int i = 0; i < splitText.length; i++) {
// System.out.println(splitText[i]);
String[] nexSplit = splitText[i].split("[ \t]");
for (int x = 0; x < nexSplit.length; x++) {
Arrays.sort(nexSplit, String.CASE_INSENSITIVE_ORDER);
// System.out.println(nexSplit[x]);
return nexSplit;
}
}
return splitText;
你真的想在一次迭代后返回那个内循环吗?我对此表示怀疑。
另外,请务必使用initItems()
方法关闭阅读器。