使用php从html / javascript中将文本提取到字符串中

时间:2013-06-05 08:04:54

标签: php html regex parsing extract

我有一个问题: 我必须在之间提取文本         "主体" {"文本":" 和         "&#34 ;. 在这个例子中发生了2次,这些是来自Facebook的评论。我想从两个标准之间提取和放置文本到单个变量,由换行符分隔。我使用PHP,已经在名为$ html的变量中获得了整个HTML。

以下是代码:

[
   {
      "id":"537891442938686_1601777",
      "fbid":"537891612938669",
      "legacyid":"1601777",
      "body":{
         "text":"Jaaa:D",
         "ranges":[

         ],
         "aggregatedranges":[

         ]
      },
      "author":"1301211957",
      "ftentidentifier":"537891442938686",
      "likecount":0,
      "hasviewerliked":false,
      "canremove":false,
      "canreport":false,
      "canedit":false,
      "source":null,
      "istranslatable":false,
      "timestamp":{
         "time":1370411670,
         "text":"",
         "verbose":"4. junij 2013 ob 22:54"
      },
      "spamreplycount":0,
      "replyauthors":[

      ],
      "isfeatured":false
   },
   {
      "id":"537891442938686_1601790",
      "fbid":"537895309604966",
      "legacyid":"1601790",
      "body":{
         "text":"Takteri\u010d \u017ee sodelujem?? Hmm 5i\u010d?",
         "ranges":[

         ],
         "aggregatedranges":[

         ]
      },
      "author":"1087657810",
      "ftentidentifier":"537891442938686",
      "likecount":0,
      "hasviewerliked":false,
      "canremove":false,
      "canreport":false,
      "canedit":false,
      "source":null,
      "istranslatable":false,
      "timestamp":{
         "time":1370412832,
         "text":"",
         "verbose":"4. junij 2013 ob 23:13"
      },
      "spamreplycount":0,
      "replyauthors":[

      ],
      "isfeatured":false
   }
],
"profiles":[
   {
      "id":"1301211957",
      "name":"Maja Lamov\u0161ek",
      "firstName":"Maja",
      "vanity":"zmaychee",
      "thumbSrc":"https:\/\/fbcdn-profile-a.akamaihd.net\/hprofile-ak-ash4\/s32x32"
   }
]

谢谢你们!

1 个答案:

答案 0 :(得分:4)

您所拥有的数据采用JSon格式。我强烈建议解码并将其放入对象中。之后,您可以轻松地遍历所需的所有属性。看看json_decode(http://php.net/manual/en/function.json-decode.php)。