如何从XML获取子节点列表

时间:2012-06-17 19:55:18

标签: powershell

给出以下示例脚本(在我的硬盘上保存为 sample.txt ):

DECLARE @xVar XML
SET @xVar = 
  '<bookstore>
      <book>
        <title>Writing Secure Code</title>
        <author>
          <first-name>Michael</first-name>
          <last-name>Howard</last-name>
        </author>
        <author>
          <first-name>David</first-name>
          <last-name>LeBlanc</last-name>
        </author>
        <price>39.99</price>
      </book>
      <book>
        <title>Old Man and the sea</title>
        <author>
          <first-name>Earnest</first-name>
          <last-name>Hemmingway</last-name>
        </author>
        <price> 9.99</price>
      </book>
    </bookstore>
    '

SELECT nref.value('first-name[1]', 'nvarchar(50)') FirstName,
       nref.value('last-name[1]', 'nvarchar(50)') LastName
FROM   @xVar.nodes('//author') AS R(nref)
WHERE  nref.exist('.[first-name != "David"]') = 1

我想提取XML并确定是否已将任何子节点添加到节点 book 。对于此用例,假设 author 是新节点。

我编写了一个可行的脚本,但它看起来非常低效:

Set-StrictMode -Version Latest

cls
#The list of fields that I want to find
[array]$expected_fields = "title", "price"

#extract the xml from sample.txt and put it in an xml variable
[string]$file_script = Get-Content "C:\PowerShellScripts\sample.txt"
[int]$start_pos = $file_script.IndexOf("'") + 1
[int]$end_pos = $file_script.SubString($start_pos + 1).IndexOf("'") + 1
[xml]$xml_result = $file_script.SubString($start_pos,$end_pos)

#NOTE:  THIS IS THE PART THAT FEELS WRONG
#Convert the xml snipput into CSV file and then get the headers (which is the only thing I want)
$export_file_name = "C:\PowerShellScripts\test.csv"
Select-Xml 'child::bookstore/book' $xml_result  | Select-Object -expand Node | Export-Csv $export_file_name -NoTypeInformation -Delimiter:"`t" -Encoding:UTF8 
[string]$field_names = Get-Content $export_file_name | Select-Object -first 1
Remove-Item "C:\Users\Jennifer\Google Drive\PowerShellScripts\test.csv"
[array]$found_fields = $field_names.Replace("""","").Split("`t")

#report new fields
foreach ($specific_field in $found_fields) {
    if ($expected_fields -notcontains $specific_field)
    {
        Write-Host "New field found:" $specific_field
    }
}

是否有更好的方法来填充* $ found_fields *而不是创建CSV文件,将第一行存储在变量中然后删除CSV文件?

1 个答案:

答案 0 :(得分:3)

尝试将您的-Expand节点更改为名称(和Where-Object以排除标题

$xml_result.SelectNodes("bookstore/book/*") | Select-Object -Expand Name | Where-Object { ($_ -ne "title") -and ($_ -ne "price") }

这将为您提供book的任意意外子节点。