Question

我有许多带有html标记的字符串：

print "$hachage->{'id'}"

我，对于每个标记，我想要一个带有内容和标记的新字符串，在我的例子中它将是：

<font face='verdana'>A great project with many companies
A <b>plumber</b> company as external.
An <i>electricity</i> company.
And a security guard <u><i><b>during 2 weeks </b></i></u>.
</font>

可以删除第一个string1 = "A great project with many companies A" string2 = "plumber" string3 = " company as external. An" string4 = "electricity" string5 = " company. And a string6 = "security " string7 = "guard" string8 = "during 2 weeks " string9 = "." string10 = ""和最后一个。

我尝试过matcher方法，但结果并不是我想要的，或者我的正则表达式并不好。我还试图搜索第一个开始标记和第一个结束标记，但它没有用，因为所有字体标记（和`都有相同的结束标记）;

在字符串中解析HTML

0 个答案: