如何通过解析每个子节点都有父节点的xml来创建集合

时间:2017-05-08 20:11:52

标签: java xml scala javax

我使用javax xml来解析XML,我想要的是在所有孩子中包含父。

<a>
   <b value1 = "b1", value2 = "b2">
       <c value1 = "c1", value2 = "c2" />
       <c value1 = "c3", value2 = "c4" />
   </b>
   <b value1 = "b3">
       <c value1="c5" />
       <c value1 ="c6" />
   </b>
</a>

我希望结果形式如下:

Set((b1,c1,c2), (b1,c3,c4), (b3, c5, ""), (b3, c6, "")) 

1 个答案:

答案 0 :(得分:2)

以下是我在Scala中使用XML文字实现此目的的方法。它不是最有效的方法,但它实现了您指定的目标:

scala> val xml: scala.xml.Elem = <a>
  <b value1="b1" value2="b2">
    <c value1="c1" value2="c2"/>
    <c value1="c3" value2="c4"/>
  </b>
  <b value1="b3">
    <c value1="c5"/>
    <c value1="c6"/>
  </b>
</a>

scala> val bs = xml\ "b"
bs: scala.xml.NodeSeq = <b value1="b1" value2="b2">
    <c value1="c1" value2="c2"/>
    <c value1="c3" value2="c4"/>
  </b><b value1="b3">
    <c value1="c5"/>
    <c value1="c6"/>
  </b>

scala> val gatheringNodes = bs.map { b =>
  val cs = b\"c"
  b\"@value1" -> cs.map(c => (c\"@value1", c\"@value2"))
}
gatheringNodes: scala.collection.immutable.Seq[(scala.xml.NodeSeq, scala.collection.immutable.Seq[(scala.xml.NodeSeq, scala.xml.NodeSeq)])] = List((b1,List((c1,c2), (c3,c4))), (b3,List((c5,), (c6,))))

scala> val finalOutput = gatheringNodes.flatMap { case(b, cs) =>
    cs.map { case(c1, c2) => (b, c1, c2)}
}.toSet
finalOutput: scala.collection.immutable.Set[(scala.xml.NodeSeq, scala.xml.NodeSeq, scala.xml.NodeSeq)] = Set((b1,c1,c2), (b1,c3,c4), (b3,c5,), (b3,c6,))