具有复杂目标字符串的PHP Regex问题

时间:2009-07-16 13:58:57

标签: php regex

我需要用php regex解析dhcp leases表。 但问题是它包含正则表达式中使用的不同字符。

这里是示例输出

lease 172.17.2.3 {
  starts 4 2009/07/16 11:54:39;
  ends 4 2009/07/16 12:54:39;
  cltt 4 2009/07/16 11:54:39;
  binding state active;
  next binding state free;
  hardware ethernet 00:50:56:c0:00:01;
  uid "\001\000PV\300\000\001";
  client-hostname "Yasin-PC";
}
lease 172.17.2.3 {
  starts 4 2009/07/16 12:24:39;
  ends 4 2009/07/16 13:24:39;
  cltt 4 2009/07/16 12:24:39;
  binding state active;
  next binding state free;
  hardware ethernet 00:50:56:c0:00:01;
  uid "\001\000PV\300\000\001";
  client-hostname "Yasin-PC";
}
lease 172.17.2.3 {
  starts 4 2009/07/16 12:54:39;
  ends 4 2009/07/16 13:54:39;
  cltt 4 2009/07/16 12:54:39;
  binding state active;
  next binding state free;
  hardware ethernet 00:50:56:c0:00:01;
  uid "\001\000PV\300\000\001";
  client-hostname "Yasin-PC";
}

问题是我想将整个表分配到租用后用ip地址索引的数组中..XX.XX.XX.XX {...&会有重复的键,但值会有所不同,所以我需要解决这个问题......

你有什么建议我节省时间为此建立一个好的正则表达式? posix或pcre或逐行阅读?

&安培;我不能确定目标租约表将以相同的格式。 也许有些时候我会期待更多的线条。

2 个答案:

答案 0 :(得分:1)

我认为你可以这样做:

<?php
$order_fields = array('starts', 'ends', 'cltt', 'binding state', 'next binding state', 'hardware ethernet', 'uid', 'client-hostname');
$fields_regexp = '';
foreach ($order_fields as $field)
{
    $fields_regexp .= "\s*".$field." (.*)";
}
$regexp = '/lease (\b(?:[0-9]{1,3}\.){3}[0-9]{1,3}\b) \{'.$fields_regexp.'\s*\}/m';
preg_match_all($regexp, $string, $result, PREG_PATTERN_ORDER);

$arr = array();
foreach ($result[1] as $i => $match) {
    $cont = count($arr[$match]);
    $arr[$match][$cont]['raw'] = $result[0][$i];
    $arr[$match][$cont]['ip'] = $match;
    foreach ($order_fields as $pos => $field)
    {
        $arr[$match][$cont][$field] = $result[$pos + 2][$i];
    }
}
print_r($arr);
?>

示例输出:

Array
(
    [172.17.2.3] => Array
        (
            [0] => Array
                (
                    [raw] => lease 172.17.2.3 {
  starts 4 2009/07/16 11:54:39;
  ends 4 2009/07/16 12:54:39;
  cltt 4 2009/07/16 11:54:39;
  binding state active;
  next binding state free;
  hardware ethernet 00:50:56:c0:00:01;
  uid "�PVÀ�";
  client-hostname "Yasin-PC";
}
                    [ip] => 172.17.2.3
                    [starts] => 4 2009/07/16 11:54:39;
                    [ends] => 4 2009/07/16 12:54:39;
                    [cltt] => 4 2009/07/16 11:54:39;
                    [binding state] => active;
                    [next binding state] => free;
                    [hardware ethernet] => 00:50:56:c0:00:01;
                    [uid] => "�PVÀ�";
                    [client-hostname] => "Yasin-PC";
                )

            [1] => Array
                (
                    [raw] => lease 172.17.2.3 {
  starts 4 2009/07/16 12:24:39;
  ends 4 2009/07/16 13:24:39;
  cltt 4 2009/07/16 12:24:39;
  binding state active;
  next binding state free;
  hardware ethernet 00:50:56:c0:00:01;
  uid "�PVÀ�";
  client-hostname "Yasin-PC";
}
                    [ip] => 172.17.2.3
                    [starts] => 4 2009/07/16 12:24:39;
                    [ends] => 4 2009/07/16 13:24:39;
                    [cltt] => 4 2009/07/16 12:24:39;
                    [binding state] => active;
                    [next binding state] => free;
                    [hardware ethernet] => 00:50:56:c0:00:01;
                    [uid] => "�PVÀ�";
                    [client-hostname] => "Yasin-PC";
                )

            [2] => Array
                (
                    [raw] => lease 172.17.2.3 {
  starts 4 2009/07/16 12:54:39;
  ends 4 2009/07/16 13:54:39;
  cltt 4 2009/07/16 12:54:39;
  binding state active;
  next binding state free;
  hardware ethernet 00:50:56:c0:00:01;
  uid "�PVÀ�";
  client-hostname "Yasin-PC";
}
                    [ip] => 172.17.2.3
                    [starts] => 4 2009/07/16 12:54:39;
                    [ends] => 4 2009/07/16 13:54:39;
                    [cltt] => 4 2009/07/16 12:54:39;
                    [binding state] => active;
                    [next binding state] => free;
                    [hardware ethernet] => 00:50:56:c0:00:01;
                    [uid] => "�PVÀ�";
                    [client-hostname] => "Yasin-PC";
                )

        )

)

答案 1 :(得分:0)

你也可以解析这个问题。逐行读取并将当前状态保存在变量中。当您使用lease ...lease子句匹配时,请将$inLease设置为true,并将其余行作为当前{的参数处理{1}}直到你点击大括号lease,等等。

在这种情况下,正则表达式可以帮助您,但只是简单的逐行解析并不困难。考虑到数据格式的静态性,正则表达式对此只会过度。