我使用 xls2csv 二进制文件,以便在我的linux red-hat机器中将XLS doc翻译为CSV,
在某些情况下,XLS文档在单词之前或之后有空格
XLS文件的示例(word3在右侧有一个空格)
word1 word2 "word3 "
word1 word2 "word3 "
word1 word2 "word3 "
运行 xls2csv
之后as:
xls2csv -x "1252spreadsheet.xls" -b WINDOWS-1252 -c "ut8csvfile.csv" -a UTF-8 )
我注意到csv doc中的word3在word3上的空间与XLS中的相同(" word3")
请建议如何配置 xls2csv 二进制文件以禁用空格????从XLS doc,所以我将获得没有空格的CSV?
FROM XLS2CSV MAN PAGE:
man xls2csv
XLS2CSV(1) User Contributed Perl Documentation XLS2CSV(1)
NAME
xls2csv - A script that recodes a spreadsheetâs charset and saves as CSV.
DESCRIPTION
This script will recode a spreadsheet into a different character set and output the recoded data as a csv file.
The script came about after many headaches from dealing with Excel spreadsheets from clients that were being
received in various character sets.
OPTIONS
-x : filename of the source spreadsheet
-b : the character set the source spreadsheet is in (before)
-c : the filename to save the generated csv file as
-a : the character set the csv file should be converted to (after)
-q : quiet mode
-s : print a list of supported character sets
-h : print help message
-v : get version information
-W : list worksheets in the spreadsheet specified by -x
-w : specify the worksheet name to convert (defaults to the first worksheet)
-n : specify the worksheet number to convert (you cannot use this option with -w)
-f : force the worsheet to be fully parsed. This disables the feature that skips rows when the first cell is blk
答案 0 :(得分:1)
IMO,听起来你正在使用错误的工具。也许您应该编写一个脚本,使用Spreadsheet :: ParseExcel来读取/解析xls文件,然后应用所需的任何字段重新格式化。完成后,使用Text :: CSV_XS输出csv数据。
编辑:我打算将其作为评论发布,但我想这也可以作为答案。