Question

我正在为一个简单的行解析一个html文件。所以我没有使用第三方库只是普通的字符串函数（s.subString()等）。

我的问题是我无法在HTML中正确找到特定元素，因为它们包含制表符，结束符，回车符。

如何打印以下字符串以将所有回车显示为\ r，所有回车显示为\ n＆amp;等等。那么我就可以看到HTML文件的布局＆amp;确保我的

.subString("<div class=\"x\">")

没有失败，因为文本真的是

("<div \t\r\nclass=\"x\">"

还是什么？

我的代码：

WebClient wc = new WebClient();
string html = wc.DownloadString(String.Format("http://www.ipchecking.com/?ip={0}&check=Lookup", ip));

Console.Write( html ); // I want to print in the raw form where \r characters are actually shown as \r characters

Answer 1

html = html.Replace("\r", "\\r").Replace("\n","\\n").Replace("\t","\\t");

Answer 2

我认为最好的方法是使用正则表达式REGEX：

Sample Regular Expressions

打印一个字符串并显示换行符＆amp;选项卡的字符格式

2 个答案: