正则表达式替换json对象中的部分表达式

时间:2015-11-21 01:28:07

标签: regex r stringr rjson

我正在尝试编写一个正则表达式来替换json文件对象中的一些有问题的字符。这是简短的 - DEMO

我能够编写php代码,但我无法在str中使用str_replace实现它,因为它开始于{charcters。关于如何改进代码的任何建议?

library(stringr) 
json_file <- json_file<- '{"_id":{"$oid":"4f27779008d69a6dba0208f6"},"actor":{"gravatar_id":"92e5c51218f00220e0362c47b2a94b9a","id":NumberInt(228889),"login":"stefankendall","url":"https://api.github.com/users/stefankendall"},"created_at":"2012-01-31T05:09:37Z","id":"1515677813","org":{"url":"https://api.github.com/orgs/"},"payload":{"commits":[{"author":{"email":"skendall@skendalllaptop.(none)","name":"skendall"},"message":"Made test packages mimic app layout.","sha":"faf1b478f4d98202d4169b6d310812b14ad7f676","url":"https://api.github.com/repos/stefankendall/wendler531-webservices/commits/faf1b478f4d98202d4169b6d310812b14ad7f676"},{"author":{"email":"skendall@skendalllaptop.(none)","name":"skendall"},"message":"_id is now pulled out of \"get\" responses","sha":"d2087821e865ebebf9ff6e47cffb41dd16c6c871","url":"https://api.github.com/repos/stefankendall/wendler531-webservices/commits/d2087821e865ebebf9ff6e47cffb41dd16c6c871"}],"head":"d2087821e865ebebf9ff6e47cffb41dd16c6c871","push_id":NumberInt(59920001),"ref":"refs/heads/master","size":NumberInt(2)},"public":true,"repo":{"id":NumberInt(3186494),"name":"stefankendall/wendler531-webservices","url":"https://api.github.com/repos/stefankendall/wendler531-webservices"},"type":"PushEvent"}' 
str_replace_all(json_file, "\,{"author[^*]*],\\s*","")

1 个答案:

答案 0 :(得分:0)

  1. 你必须逃避"或使用'作为包装而不是"
  2. 您必须在R中两次转义控制字符,即\\{\\^,尤其是\\\\,因为\是R中的控制字符! (每\\替换\并传递给正则表达式解析器。
  3. 最终,\s必须替换为[:space:]
  4. 请参阅here

    你必须使用str_replace_all(json_file,',\\{\\"author[^\\*]*\\],\\s*',"")(即使对我来说没有多大意义......)