我需要用php regex解析dhcp leases表。 但问题是它包含正则表达式中使用的不同字符。
这里是示例输出
lease 172.17.2.3 {
starts 4 2009/07/16 11:54:39;
ends 4 2009/07/16 12:54:39;
cltt 4 2009/07/16 11:54:39;
binding state active;
next binding state free;
hardware ethernet 00:50:56:c0:00:01;
uid "\001\000PV\300\000\001";
client-hostname "Yasin-PC";
}
lease 172.17.2.3 {
starts 4 2009/07/16 12:24:39;
ends 4 2009/07/16 13:24:39;
cltt 4 2009/07/16 12:24:39;
binding state active;
next binding state free;
hardware ethernet 00:50:56:c0:00:01;
uid "\001\000PV\300\000\001";
client-hostname "Yasin-PC";
}
lease 172.17.2.3 {
starts 4 2009/07/16 12:54:39;
ends 4 2009/07/16 13:54:39;
cltt 4 2009/07/16 12:54:39;
binding state active;
next binding state free;
hardware ethernet 00:50:56:c0:00:01;
uid "\001\000PV\300\000\001";
client-hostname "Yasin-PC";
}
问题是我想将整个表分配到租用后用ip地址索引的数组中..XX.XX.XX.XX {...&会有重复的键,但值会有所不同,所以我需要解决这个问题......
你有什么建议我节省时间为此建立一个好的正则表达式? posix或pcre或逐行阅读?
&安培;我不能确定目标租约表将以相同的格式。 也许有些时候我会期待更多的线条。
答案 0 :(得分:1)
我认为你可以这样做:
<?php
$order_fields = array('starts', 'ends', 'cltt', 'binding state', 'next binding state', 'hardware ethernet', 'uid', 'client-hostname');
$fields_regexp = '';
foreach ($order_fields as $field)
{
$fields_regexp .= "\s*".$field." (.*)";
}
$regexp = '/lease (\b(?:[0-9]{1,3}\.){3}[0-9]{1,3}\b) \{'.$fields_regexp.'\s*\}/m';
preg_match_all($regexp, $string, $result, PREG_PATTERN_ORDER);
$arr = array();
foreach ($result[1] as $i => $match) {
$cont = count($arr[$match]);
$arr[$match][$cont]['raw'] = $result[0][$i];
$arr[$match][$cont]['ip'] = $match;
foreach ($order_fields as $pos => $field)
{
$arr[$match][$cont][$field] = $result[$pos + 2][$i];
}
}
print_r($arr);
?>
示例输出:
Array
(
[172.17.2.3] => Array
(
[0] => Array
(
[raw] => lease 172.17.2.3 {
starts 4 2009/07/16 11:54:39;
ends 4 2009/07/16 12:54:39;
cltt 4 2009/07/16 11:54:39;
binding state active;
next binding state free;
hardware ethernet 00:50:56:c0:00:01;
uid "�PVÀ�";
client-hostname "Yasin-PC";
}
[ip] => 172.17.2.3
[starts] => 4 2009/07/16 11:54:39;
[ends] => 4 2009/07/16 12:54:39;
[cltt] => 4 2009/07/16 11:54:39;
[binding state] => active;
[next binding state] => free;
[hardware ethernet] => 00:50:56:c0:00:01;
[uid] => "�PVÀ�";
[client-hostname] => "Yasin-PC";
)
[1] => Array
(
[raw] => lease 172.17.2.3 {
starts 4 2009/07/16 12:24:39;
ends 4 2009/07/16 13:24:39;
cltt 4 2009/07/16 12:24:39;
binding state active;
next binding state free;
hardware ethernet 00:50:56:c0:00:01;
uid "�PVÀ�";
client-hostname "Yasin-PC";
}
[ip] => 172.17.2.3
[starts] => 4 2009/07/16 12:24:39;
[ends] => 4 2009/07/16 13:24:39;
[cltt] => 4 2009/07/16 12:24:39;
[binding state] => active;
[next binding state] => free;
[hardware ethernet] => 00:50:56:c0:00:01;
[uid] => "�PVÀ�";
[client-hostname] => "Yasin-PC";
)
[2] => Array
(
[raw] => lease 172.17.2.3 {
starts 4 2009/07/16 12:54:39;
ends 4 2009/07/16 13:54:39;
cltt 4 2009/07/16 12:54:39;
binding state active;
next binding state free;
hardware ethernet 00:50:56:c0:00:01;
uid "�PVÀ�";
client-hostname "Yasin-PC";
}
[ip] => 172.17.2.3
[starts] => 4 2009/07/16 12:54:39;
[ends] => 4 2009/07/16 13:54:39;
[cltt] => 4 2009/07/16 12:54:39;
[binding state] => active;
[next binding state] => free;
[hardware ethernet] => 00:50:56:c0:00:01;
[uid] => "�PVÀ�";
[client-hostname] => "Yasin-PC";
)
)
)
答案 1 :(得分:0)
你也可以解析这个问题。逐行读取并将当前状态保存在变量中。当您使用lease ...
与lease
子句匹配时,请将$inLease
设置为true
,并将其余行作为当前{的参数处理{1}}直到你点击大括号lease
,等等。
在这种情况下,正则表达式可以帮助您,但只是简单的逐行解析并不困难。考虑到数据格式的静态性,正则表达式对此只会过度。