Question

我试图从数百个XML文件中划分出来。 XML文档的结构类似于：

<document>
<nodes>
<node id=123>pages of txt</node>
<node id-=124>more example pages of txt and sub elements</node>
</nodes></document>

我只想提取所有<node>个元素。我一直在尝试使用xmlstarlet：

xmlstarlet sel -t -c “/document/nodes”

问题是它只返回</nodes>。

我只需要提取以下示例：

<node id=123>pages of txt</node>
<node id-=124>more example pages of txt and sub elements</node>

任何人都可以推荐更好的选择，工具或方法吗？非常感谢。

Answer 1

你的xpath错误：

xmlstarlet sel -t -c '//node'

此外，有效的XML要求引用所有属性值

<document>
<nodes>
<node id="123">pages of txt</node>
<node id="124">more example pages of txt and sub elements</node>
</nodes></document>