从字符串格式化Html代码

时间:2013-08-06 10:01:26

标签: c# html asp.net

我基本上有一些来自某些HTML代码的数据库的字符串。现在我想在asp.net网页上的某种textarea中显示html代码,用户可以编辑html代码,其中包括缩进等...用户可以将html代码保存回到数据库中。

我读过有关使用非托管代码的TidyManaged,我不想使用非托管代码。

是否有一些示例代码或网站解释了这一点以格式化html代码,例如 - Html Formatter

任何方向或帮助将不胜感激。

编辑:

我只想让html标签缩进,记住解析的html字符串可能不符合,这对于这个应用程序来说很好。

使用的html字符串示例 -

string a = "<html><body><h1>hello</h1></body></html>

输出应如下所示

<html>
  <body>
    <h1>
      hello
    </h1>
  </body>
</html>

2 个答案:

答案 0 :(得分:2)

这很冗长,但我认为它可以满足基本需求。我决定自己推动使用库...也许不是最有效的代码:)

    /// <summary>
    /// This function creates an indented format of HTML with new lines for all elements and text.  Errors result in the original text being returned.
    /// </summary>
    private static string FormatHtml(string content)
    {
        string original = content; 
        string open = "<";
        string slash = "/";
        string close = ">";

        int depth = 0; // the indentation
        int adjustment = 0; //adjustment to depth, done after writing text

        int o = 0; // open      <   index of this character
        int c = 0; // close     >   index of this character
        int s = 0; // slash     /   index of this character
        int n = 0; // next      where to start looking for characters in the next iteration
        int b = 0; // begin     resolved start of usable text
        int e = 0; // end       resolved   end of usable test

        string snippet;

        try
        {
            using (StringWriter writer = new StringWriter())
            {
                while (b > -1 && n > -1)
                {
                    o = content.IndexOf(open, n);
                    s = content.IndexOf(slash, n);
                    c = content.IndexOf(close, n);
                    adjustment = 0;

                    b = n; // begin where we left off in the last iteration
                    if (o > -1 && o < c && o == n)
                    {
                        // starts with "<tag>text"
                        e = c; // end at the next closing tag
                        adjustment = 2; //for after this node
                    }
                    else
                    {
                        // starts with "text<tag>"
                        e = o - 1; // end at the next opening tag
                    }

                    if (b == o && b + 1 == s) // ?Is the 2nd character a slash, this the a closing tag: </div>
                    {
                        depth -= 2;//adjust immediately, not afterward ...for closing tag
                        adjustment = 0;
                    }

                    if ((s + 1) == c && c == e) // don't adjust depth for singletons:  <br/>
                    {
                        adjustment = 0;
                    }



                    //string traceStart = content.Substring(0, b);
                    int length = (e - b + 1);
                    if (length < 0)
                    {
                        snippet = content.Substring(b); // happens on the final iteration
                    }
                    else
                    {
                        snippet = content.Substring(b, (e - b + 1));
                    }
                    //string traceEnd = content.Substring(b);


                    if (snippet == "<br>" || snippet == "<hr>") // don't adjust depth for singletons which lack slashes: <br>
                    {
                        adjustment = 0; 
                    }

                    //Write the text
                    if (!string.IsNullOrEmpty(snippet.Trim()))
                    {
                        //Debug.WriteLine(snippet);
                        writer.Write(Environment.NewLine);
                        if (depth > 0) writer.Write(new String(' ', depth)); // add the indentation 
                        writer.Write(snippet); 
                    }

                    depth += adjustment; //adjust for the next line which is likely nested

                    n = e + 1; // the next iteration start at the end of this one.

                }

                return writer.ToString();
            }
        }
        catch (Exception ex)
        {
            Log("Unable to format html. " + ex.Message);
            return original;
        }
    }

答案 1 :(得分:0)

每个HTML标记都有一个'&gt;'符号。您可以使用子字符串并在'&gt;'后附加新的行语句。