我有一个长字符串,其中有一些我想要提取的相关文本和一些不需要的html标记。此html标记不存在于数组中的所有字符串中。
例如,
"description\":\"SOME RELEVANT TEXT...<img width='1' height='1' src='http:\/\/someurl.com.feedsportal.com\/c\/33818\/f\/608449\/s\/1c52b2b5\/mf.gif' border='0'\/><div class='mf-viral'><table border='0'><tr><td valign='middle'><a href=\\\"http:\/\/share.feedsportal.com\/viral\/sendEmail.cfm?lang=en&title=some_title&link=http%3A%2F%2Fwww.someurl.com%2Fworld-newssome_title%2FArticle1-805340.aspx\\\" target=\\\"_blank\\\"><img src=\\\"http:\/\/res3.feedsportal.com\/images\/emailthis2.gif\\\" border=\\\"0\\\" \/><\/a><\/td>"
在上面的字符串中,我只想提取“SOME RELEVANT TEXT ...”
答案 0 :(得分:0)
这将删除文本中的所有标记,但不确定这是否是您想要的。
<script type="text/javascript">
var regex = /(<([^>]+)>)/ig;
var body = "Your text with tags in <br> it<br>";
var result = body.replace(regex, "");
</script>
或者使用jQuery
$('Your text with tags in <br> it<br>').text()
答案 1 :(得分:0)
PHP还是JS?
JS:http://jsfiddle.net/mplungjan/fpzEn/
var str = "description\":\"SOME RELEVANT TEXT...<img width='1' height='1' src='http:\/\/someurl.com.feedsportal.com\/c\/33818\/f\/608449\/s\/1c52b2b5\/mf.gif' border='0'\/><div class='mf-viral'><table border='0'><tr><td valign='middle'><a href=\\\"http:\/\/share.feedsportal.com\/viral\/sendEmail.cfm?lang=en&title=some_title&link=http%3A%2F%2Fwww.someurl.com%2Fworld-newssome_title%2FArticle1-805340.aspx\\\" target=\\\"_blank\\\"><img src=\\\"http:\/\/res3.feedsportal.com\/images\/emailthis2.gif\\\" border=\\\"0\\\" \/><\/a><\/td>"
alert(str.split(/description":"/)[1].split(/</)[0])