在字符串中的下一个哈希标记之前的哈希标记之后的preg_match_all

时间:2012-06-21 12:30:02

标签: php regex preg-match-all hashtag

我想找到(如果它们存在的话)任何哈希标签,并获得每个标签的第一次共同加上后面的文本,但是如果另一个标签存在,则在下一个标签之前。

*并非所有消息字符串都带有哈希标记!

以下是我要做的事情:

发送到脚本的可能字符串示例列表:

1)$ message ='添加了一些新内容';

2)$ message =' #BALANCE X'的平衡移动;

3)$ message =' #CHANGE 一些日志文字 #FIX 其他一些日志文字';

$ num = prereg_match_all('@ ????? @',$ message,$ matches);

这是我从匹配中瞄准的数组结构结果:

示例1的结果

(
[0] => Array
    (
        [0] => Added some new stuff
    )
)

示例2的结果

(
[0] => Array
    (
        [0] => balanced movement of X
        [1] => #BALANCE
    )
)

示例3的结果

(
[0] => Array
    (
        [0] => some log text
        [1] => #CHANGE
    )
[1] => Array
    (
        [0] => some other log text
        [1] => #FIX
    )
)

让我疯狂的事情试图在REGEX上找到合适的文档

3 个答案:

答案 0 :(得分:0)

试试这段代码:

$msg = '#CHANGE some log text #FIX some other log text';
$msg = preg_replace('/(#[\w]+)(\s+)/', "\n\n$1\n", $msg);

foreach (explode("\n\n", trim($msg)) as $k => $v) {
    $res[$k] = array_reverse(explode("\n", $v));
}

print_r($res);
/*
Array
(
    [0] => Array
        (
            [0] => some log text 
            [1] => #CHANGE
        )

    [1] => Array
        (
            [0] => some other log text
            [1] => #FIX
        )

)
*/

答案 1 :(得分:0)

对所有测试用例尝试类似的方法:

$messages[] = 'Added some new stuff';
$messages[] = '#BALANCE balanced movement of X';
$messages[] = '#CHANGE some log text #FIX some other log text';

foreach( $messages as $message) {
    preg_match_all( '~(#\w+)?\s*([\w\s]+)~i', $message, $matches);
    // var_dump( $matches);
    echo "Message: " . $message . "\n";
    $count = strlen( $matches[1][0]);
    if( $count == 0) {
        // No hash tags
        echo "No hash tags, so the match string text is: " . $matches[2][0] . "\n";
    } else {
        for( $i = 0; $i < count( $matches[1]); $i++) {
            echo "\t Hash tag $i\n";
            echo "\t\t - Tag: ".$matches[1][$i]." Value: ".$matches[2][$i]."\n";
        }
    }
}

输出:

Message: Added some new stuff
No hash tags, so the match string text is: Added some new stuff

Message: #BALANCE balanced movement of X
     Hash tag 0
         - Tag: #BALANCE Value: balanced movement of X

Message: #CHANGE some log text #FIX some other log text
     Hash tag 0
         - Tag: #CHANGE Value: some log text 
     Hash tag 1
         - Tag: #FIX Value: some other log text

See it work

答案 2 :(得分:0)

我得到了它!经过3个小时的战斗,BSDNOOBZ得到了最接近的答案......

给它以下情况:

$messages[] = 'Blla vajfkj asfhkha asfha lskahfa';
$messages[] = '#CHANGE reduced sentry health/armor to 325/100 and made them easier to repair with welders #FIX reduced sentry build time to 6 seconds (down from 10)';
foreach($messages AS $message)
{
    $num = preg_match_all('@(\#\w+)([^#]+)@', $message, $matches, PREG_SET_ORDER);
    if($num > 0)
    {
        print_r($matches);
    } else {
        echo $message;
    }
}

结果:

string Blla vajfkj asfhkha asfha lskahfa

Array ( 
    [0] => Array 
    ( 
        [0] => #CHANGE reduced sentry health/armor to 325/100 and made them easier to repair with welders 
        [1] => #CHANGE 
        [2] => reduced sentry health/armor to 325/100 and made them easier to repair with welders 
    ) 
    [1] => Array 
    ( 
        [0] => #FIX reduced sentry build time to 6 seconds (down from 10) 
        [1] => #FIX 
        [2] => reduced sentry build time to 6 seconds (down from 10) 
    )
)