我正在摄取CSV文件:
"ID","LASTNAME","FIRSTNAME","PERM_ADDR1","PERM_ADDR2","PERM_CITY","PERM_ST","PERM_ZIP","DOB","LIB_TYPE","BARCODE","EMAIL","LOCAL_ADDR1","LOCAL_ADDR2","LOCAL_CITY","LOCAL_ST","LOCAL_ZIP","CAMPUS_ADDR1","CAMPUS_ADDR2","CAMPUS_CITY","CAMPUS_ST","CAMPUS_ZIP","DEPARTMENT","MAJOR"
"123","Lastname","Firstname","123 Home St","","Home City","HS","12345-6789","0101","S","1234567890","last.first@domain.local","123 Local St","","Local City","LS","98765-4321","123 Campus St","","Campus City","CS","54321-6789","IT",""
使用Text::CSV
,我试图将其解析为哈希:
my $csv = Text::CSV->new();
chomp(my $line = <READ>);
$csv->column_names(split(/,/, $line));
until (eof(READ)) {
$line = $csv->getline_hr(*READ);
my %linein = %$line;
my %patron;
$patron{'patronid'} = $linein{'ID'};
$patron{'last'} = $linein{'LASTNAME'};
$patron{'first'} = $linein{'FIRSTNAME'};
print p(%linein)."\n";
print p(%patron)."\n";
}
使用此代码,最后的print语句(使用Data::Printer
)将返回:
{
"BARCODE" 1234567890,
"CAMPUS_ADDR1" "123 Campus St",
"CAMPUS_ADDR2" "",
"CAMPUS_CITY" "Campus City",
"CAMPUS_ST" "CS",
"CAMPUS_ZIP" "54321-6789",
"DEPARTMENT" "IT",
"DOB" 0101,
"EMAIL" "last.first@domain.local",
"FIRSTNAME" "Firstname",
"ID" 123,
"LASTNAME" "Lastname",
"LIB_TYPE" "S",
"LOCAL_ADDR1" "123 Local St",
"LOCAL_ADDR2" "",
"LOCAL_CITY" "Local City",
"LOCAL_ST" "LS",
"LOCAL_ZIP" "98765-4321",
"MAJOR" "",
"PERM_ADDR1" "123 Home St",
"PERM_ADDR2" "",
"PERM_CITY" "Home City",
"PERM_ST" "HS",
"PERM_ZIP" "12345-6789"
}
{
first undef,
last undef,
patronid undef
}
我不明白为什么%patron
没有填充%linein
的值。我想知道这是否与使用Text::CSV
有某种关系,因为我正在解析脚本中其他地方的其他文件并且它们工作得很好。但是,这些文件不是CSV,而是固定宽度,所以我手动解析它们。
答案 0 :(得分:6)
尝试
$csv->column_names(map {/"(.*)"/ and $1} split(/,/, $line))
而不是
$csv->column_names(split(/,/, $line));
您的CSV密钥被定义为文字字符串
'"LASTNAME"' , '"FIRSTNAME"'
而不仅仅是
'LASTNAME' , 'FIRSTNAME'
Data::Printer
在向你展示正在发生的事情方面并没有做得太糟糕 -
p(%linein)
中的所有键都显示为包含双引号作为其中一部分
字符串,而不是p(%patron)