请你帮我找到一个处理多字段列的csv文件的解决方案
文件 1.csv
Teams,Category,Members
Team1,A,Smith;Johnson
Team1,C,Jones;Miller;Garcia
Team3,E,Wilson;Martinez
Team4,A,Martin;Jackson;White;Williams
File2.csv
Teams,Category,Members
Team1,A,Smith;Johnson
Team2,C,Jones;Miller;Garcia
Team3,E,Wilson;Martinez;Gonzalez;Hall
Team4,A,Martin;Jackson;Williams
差异:
$1 = Import-Csv -Path ".\File1.csv" -Delimiter ','
$2 = Import-Csv -Path ".\File2.csv" -Delimiter ','
Compare-Object $1 $2 -Property Members -PassThru
结果:
Teams Category Members SideIndicator
Team3 E Wilson;Martinez;Gonzalez;Hall =>
Team4 A Martin;Jackson;Williams =>
Team3 E Wilson;Martinez <=
Team4 A Martin;Jackson;White;Williams <=
预期:
Teams Category Members SideIndicator
Team3 E Gonzalez and Hall =>
Team4 A White <=
答案 0 :(得分:2)
我会首先比较对象以找出差异(请注意,我比较了两个属性:Teams 和 Members,以避免在不同团队的成员资格匹配的情况下丢失条目),然后比较从匹配对象创建的数组:
$1 = Import-Csv -Path ".\File1.csv" -Delimiter ','
$2 = Import-Csv -Path ".\File2.csv" -Delimiter ','
$comparisonRes = Compare-Object $1 $2 -Property Teams,Members -PassThru
foreach ($obj in $comparisonRes | Where-Object SideIndicator -eq "=>") {
# $obj = ($comparisonRes | Where-Object SideIndicator -eq "=>")[0]
$matchingEntry = $1 | Where-Object {$_.Teams -eq $obj.Teams}
$matchingEntryMembers = $matchingEntry.Members -split ";"
$currentEntryMembers = $obj.Members -split ";"
$diffMembers = Compare-Object $matchingEntryMembers $currentEntryMembers
# Uncomment to log
# $diffMembers
# Do something with $diffMembers here
}
答案 1 :(得分:0)
您可能想要使用 json 而不是支持数组和数字的 csv。否则团队看起来就像两个分号分隔的字符串。
file1.json
[
{"Teams":"Team1","Category":"A","Members":["Smith","Johnson"]},
{"Teams":"Team1","Category":"C","Members":["Jones","Miller","Garcia"]},
{"Teams":"Team3","Category":"E","Members":["Wilson","Martinez"]},
{"Teams":"Team4","Category":"A","Members":["Martin","Jackson","White","Williams"]}
]
file2.json
[
{"Teams":"Team1","Category":"A","Members":["Smith","Johnson"]},
{"Teams":"Team2","Category":"C","Members":["Jones","Miller","Garcia"]},
{"Teams":"Team3","Category":"E","Members":["Wilson","Martinez","Gonzalez","Hall"]},
{"Teams":"Team4","Category":"A","Members":["Martin","Jackson","Williams"]}
]
$1 = cat file1.json | convertfrom-json
$2 = cat file2.json | convertfrom-json
Compare-Object $1 $2 -Property Members -PassThru
Teams Category Members SideIndicator
----- -------- ------- -------------
Team3 E {Wilson, Martinez, Gonzalez, Hall} =>
Team4 A {Martin, Jackson, Williams} =>
Team3 E {Wilson, Martinez} <=
Team4 A {Martin, Jackson, White, Williams} <=
这是一个更接近的答案。一次只对成员运行一行比较对象,然后向其中添加团队和类别。
$1 = cat file1.json | convertfrom-json
$2 = cat file2.json | convertfrom-json
for($i = 0; $i -lt $1.length; $i++) {
compare-object $1[$i].members $2[$i].members |
select @{n='Teams'; e={$1[$i].teams}},
@{n='Category'; e={$1[$i].Category}},
@{n='Members'; e={$_.inputobject}},
sideindicator
}
Teams Category Members SideIndicator
----- -------- ------- -------------
Team3 E Gonzalez =>
Team3 E Hall =>
Team4 A White <=
这是在两个对象列表上使用 zip 函数 PowerShell/CLI: "Foreach" loop with multiple arrays 的另一种方法。
$1 = cat file1.json | convertfrom-json
$2 = cat file2.json | convertfrom-json
function Zip($a1, $a2) { # function allows it to stream
while ($a1) {
$x, $a1 = $a1 # $a1 gets the tail of the list
$y, $a2 = $a2
[tuple]::Create($x, $y)
}
}
zip $1 $2 | % {
$whole = $_ # will lose this $_ in the select
compare-object $whole.item1.members $whole.item2.members |
select @{n='Teams'; e={$whole.item1.teams}},
@{n='Category'; e={$whole.item1.Category}},
inputobject,sideindicator
}
Teams Category InputObject SideIndicator
----- -------- ----------- -------------
Team3 E Gonzalez =>
Team3 E Hall =>
Team4 A White <=