Question

我正在尝试解码某些HTML实体，例如'&lt;'成为'<'。

我有一个旧宝石（html_helpers），但似乎已经放弃了两次。

有什么建议吗？我需要在模型中使用它。

Answer 1

要对字符进行编码，您可以使用CGI.escapeHTML：

string = CGI.escapeHTML('test "escaping" <characters>')

要解码它们，有CGI.unescapeHTML：

CGI.unescapeHTML("test &quot;unescaping&quot; &lt;characters&gt;")

当然，在此之前你需要包含CGI库：

require 'cgi'

如果你在Rails中，则不需要使用CGI来编码字符串。有h方法。

<%= h 'escaping <html>' %>

Answer 2

HTMLEntities可以做到：

: jmglov@laurana; sudo gem install htmlentities
Successfully installed htmlentities-4.2.4
: jmglov@laurana;  irb
irb(main):001:0> require 'htmlentities'
=> []
irb(main):002:0> HTMLEntities.new.decode "&iexcl;I&#39;m highly&nbsp;annoyed with character references!"
=> "¡I'm highly annoyed with character references!"

Answer 3

要解码Rails中的字符：

<%= raw '<html>' %>

所以，

<%= raw '&lt;br&gt;' %>

会输出

<br>

Answer 4

我认为Nokogiri gem也是一个不错的选择。它非常稳定，拥有庞大的贡献社区。

样品：

a = Nokogiri::HTML.parse "foo&nbsp;b&auml;r"    
a.text 
=> "foo bär"

或

a = Nokogiri::HTML.parse "&iexcl;I&#39;m highly&nbsp;annoyed with character references!"
a.text
=> "¡I'm highly annoyed with character references!"

Answer 5

如果您不想仅为了执行此操作而添加新的依赖项（例如HTMLEntities）并且您已经在使用Hpricot，那么它可以为您逃脱和转移。它处理的内容远远超过CGI：

Hpricot.uxs "foo&nbsp;b&auml;r"
=> "foo bär"

Answer 6

您可以使用htmlascii gem：

Htmlascii.convert string

Answer 7

<% str="<h1> Test </h1>" %>

result: &lt; h1 &gt; Test &lt; /h1 &gt;

<%= CGI.unescapeHTML(str).html_safe %>

如何在Ruby中编码/解码HTML实体？

7 个答案: