解码/ Unescape Unicode Google App脚本

时间:2018-11-20 12:33:57

标签: google-apps-script decode unescapestring

我是解码新手,因此在使用UrlFetchApp时无法解码以下响应。

下面是我用来获取数据但获取需要解码的unicode中的值的函数。有什么我可以用来解码的函数吗?

function scrap() {

  var url = UrlFetchApp.fetch("https://immi.homeaffairs.gov.au/visas/working-in-australia/skillselect/invitation-rounds");
  var elements =/id="ctl00_PlaceHolderMain_PageSchemaHiddenField_Input" (.*?)\/>/gim;

  var x = url.getContentText().match(elements);

  Logger.log(x);

}

2 个答案:

答案 0 :(得分:1)

尽管我不确定这是否是最好的方法,但是如何进行修改?

修改后的脚本:

function scrap() {
  var url = UrlFetchApp.fetch("https://immi.homeaffairs.gov.au/visas/working-in-australia/skillselect/invitation-rounds");
  var elements =/id="ctl00_PlaceHolderMain_PageSchemaHiddenField_Input" (.*?)\/>/gim;
  var x = url.getContentText().match(elements);

  var res = unescape(x[0].replace(/\\u/g, "%u")); // Added

  Logger.log(res)
}

结果:

使用上面修改的脚本时,作为示例,值将按以下方式转换。

从:
\u003cp\u003eThe table below shows the number of invitations issued in the SkillSelect invitation round on 11 September 2018.\u003c/p\u003e\n\n\u003ch3\u003eInvitations issued on 11 September 2018\u003c/h3\u003e\n\n
至:
<p>The table below shows the number of invitations issued in the SkillSelect invitation round on 11 September 2018.</p>\n\n<h3>Invitations issued on 11 September 2018</h3>\n\n

参考文献:

如果我误解了你的问题,对不起。

答案 1 :(得分:0)

function scrap() {
  var url = UrlFetchApp.fetch("https://immi.homeaffairs.gov.au/visas/working-in-australia/skillselect/invitation-rounds");
  var elements =/id="ctl00_PlaceHolderMain_PageSchemaHiddenField_Input" (.*?)\/>/gim;
  var x = url.getContentText().match(elements);

  var res = unescape(x[0].replace(/\\u/g, "%u")); // Added

  Logger.log(res)
}