我已经实现了Cuong在这里建议的解决方案: C# Processing Fixed Width Files
我还让它通过一个文件夹并将其应用于该文件夹中的所有.txt文件。
一切正常,但对于某些.txt文件,它在var csvLines上失败并出现以下错误:
{"Index and length must refer to a location within the string.\r\nParameter name: length"}
A first chance exception of type 'System.ArgumentOutOfRangeException' occurred in mscorlib.dll
System.ArgumentOutOfRangeException: Index and length must refer to a location within the string.
Parameter name: length
at System.String.InternalSubStringWithChecks(Int32 startIndex, Int32 length, Boolean fAlwaysCopy)
at System.String.Substring(Int32 startIndex, Int32 length)
at FixedWidthFiles.Main.<>c__DisplayClass11.<>c__DisplayClass13.<buttonProcessAllFiles_Click>b__d(KeyValuePair`2 pair) in \\GBMACCMPFS11\Shhk$\Visual Studio 2010\Projects\FixedWidthFiles\FixedWidthFiles\Main.cs:line 138
at System.Linq.Enumerable.WhereSelectListIterator`2.MoveNext()
at System.String.Join(String separator, IEnumerable`1 values)
at FixedWidthFiles.Main.<>c__DisplayClass11.<buttonProcessAllFiles_Click>b__c(String line) in \\GBMACCMPFS11\Shhk$\Visual Studio 2010\Projects\FixedWidthFiles\FixedWidthFiles\Main.cs:line 137
at System.Linq.Enumerable.WhereSelectArrayIterator`2.MoveNext()
at System.IO.File.InternalWriteAllLines(TextWriter writer, IEnumerable`1 contents)
at System.IO.File.WriteAllLines(String path, IEnumerable`1 contents)
at FixedWidthFiles.Main.buttonProcessAllFiles_Click(Object sender, EventArgs e) in \\GBMACCMPFS11\Shhk$\Visual Studio 2010\Projects\FixedWidthFiles\FixedWidthFiles\Main.cs:line 140
知道出了什么问题吗?它可能是文件,但我希望在代码中可以纠正/改进某些内容:)
代码就是这样:
private void buttonProcessAllFiles_Click(object sender, EventArgs e)
{
if (fileFolderPath == "")
{
MessageBox.Show("Load Folder First", "Error", MessageBoxButtons.OK, MessageBoxIcon.Error);
}
else
{
int count = 0;
//foreach (var file in Directory.GetFiles(fileFolderPath, "*.txt", SearchOption.AllDirectories))
foreach (var file in Directory.GetFiles(fileFolderPath, "*.txt"))
{
count++;
System.Diagnostics.Debug.WriteLine(count);
fileFolderFull = Path.GetFullPath(file);
System.Diagnostics.Debug.WriteLine(fileFolderFull);
fileFolderName = Path.GetFileNameWithoutExtension(file);
System.Diagnostics.Debug.WriteLine(fileFolderName);
//MessageBox.Show("Full Folder: " + fileFolderFull);
//MessageBox.Show("File Name: " + fileFolderName);
var lines = File.ReadAllLines(fileFolderFull);
var widthList = lines.First().GroupBy(c => c)
.Select(g => g.Count())
.ToList();
var list = new List<KeyValuePair<int, int>>();
int startIndex = 0;
for (int i = 0; i < widthList.Count(); i++)
{
var pair = new KeyValuePair<int, int>(startIndex, widthList[i]);
list.Add(pair);
startIndex += widthList[i];
}
try
{
var csvLines = lines.Select(line => string.Join(",",
list.Select(pair => line.Substring(pair.Key, pair.Value))));
File.WriteAllLines(fileFolderPath + "\\" + fileFolderName + ".csv", csvLines);
}
catch (Exception ex)
{
System.Diagnostics.Debug.WriteLine(ex);
}
}
MessageBox.Show("File Saved", "Completed", MessageBoxButtons.OK, MessageBoxIcon.Information);
}
}
错误的行:
var csvLines = lines.Select(line => string.Join(",",
list.Select(pair => line.Substring(pair.Key, pair.Value))));
答案 0 :(得分:3)
通读堆栈跟踪。第一个有趣的位置是:
Visual Studio 2010\Projects\FixedWidthFiles\FixedWidthFiles\Main.cs:line 138
这是您的文件,包含您的代码。例外情况是,您正在读取索引中的字符串,该索引大于字符串大小。这都表示代码中存在错误。转到该文件并分析该行。一旦你想到这一点并且不知道为什么索引可能超出那个地方的范围,复制那些代码以及一些代码,并在此处发布。仅仅通过堆栈跟踪很难说出更多信息..
编辑:你添加了代码,很酷!
由于它在子字符串中肯定失败,请查看子字符串作为索引的内容:它是来自pair的值。该值来自列宽的增量求和,因此很可能您的一个输入文件只有一行......太短。检查文件的开头或结尾有虚假的空行!
你必须剥离这些行的文件,或者修改你的代码:不是盲目地调用子串,而是用if或Math.min保护它:
str.Substring(
Math.Min(str.Length, pair.Key), // here it MAY be needed a str.Length-1 instead!
Math.Min(Math.Max(0,str.Length-pair.Key), pair.Value)
)
这样,对于所有太短的线,场切割器将返回空字符串。请注意,保护这两个参数是好的,并且还需要检查Length-StartIndex与负数,因为它们的StartIndex可能大于空行的长度:)
顺便说一句。通过StartIndex我当然意味着对。关键..
答案 1 :(得分:1)
我认为它在Substring方法上失败了。你可以添加一行检查line.Length&gt; (pair.Key + pair.Value)?