Question

我有一个在文件中执行正则表达式替换的函数。问题是它在它接触的每个文件的开头都添加了一个字符（0x00）（即使它找不到匹配的文件！）。由于我正在编辑 csproj 文件，因此MSBuild给了我这个错误：

error MSB4025: The project file could not be loaded. '.', hexadecimal value 0x00, is an invalid character. Line 2, position 1.

这是我的功能：

function fileStringRegExReplace ([string] $fileToChange, [string] $oldString, [string] $newString) {
    echo "f" | xcopy "$fileToChange" "$fileToChange.og.cs" /Y /Q

    $file = Get-Content "$fileToChange.og.cs" | 
        Foreach-Object {
            $_ -replace $oldString, $newString
        } |
        Out-File "$fileToChange"

    Remove-Item "$fileToChange.og.cs"
}

如何更换我想要的行而不更改文件的任何其他部分？

Answer 1

听起来它正在文件的开头写一个BOM。您可以使用-Encoding ASCII上的out-file参数将编码设置为ASCII（没有BOM）。

Answer 2

我遇到了同样的问题，在使用ForEach替换文字后，我遇到了问题。

对于我的解决方案，我只是想找到最后一个</Target>并添加另一个<Target></Target>。

由于某些原因，我尝试了这种方法并将文件大小翻了一倍，并且0x00处的Line: 2, Position: 1错误也失败了。

我必须相信@Matt这个解决方案，因为我可能不会自己想出正则表达式：https://stackoverflow.com/a/28437855/740575

这让我优雅地不使用ForEach方法。你应该在这个解决方案的某个地方找到答案。

$replaceVar = "<Target> ... </Target" ;
# NOTE: -Raw will read the entire file in as a string, without doing that
#       everything gets read in as an array of lines
$file = Get-Content file.csproj -Raw ;
$newFile = $file -replace "(?s)(.*)</Target>(.*)", "$1$replaceVar$2" ;

# csproj is UTF8
$newFile | Out-File -Encoding UTF8 "new.csproj" ;

解决方案适用于Visual Studio和msbuild.exe。

Answer 3

Out-File的默认编码为Unicode，Windows代表 UTF-16 。当仅从ASCII集中写入字符时，UTF-16基本上具有在每个字符前面添加0x00字节的效果。这解释了为什么visual studio抱怨0x00个字节。

您尝试修改的csproj文件的XML声明自己为 UTF-8 ，因此请使用Out-File中的-Encoding UTF8选项。

不要使用ASCII编码，只要csproj文件中包含非ASCII字符，就会出现问题。

Answer 4

尝试使用set-content替换out-file。

Powershell获取内容 - ＆gt; Foreach-Object - ＆gt; -replace - ＆gt; Out-File在每个文件的开头添加一个char（0x00）

4 个答案: