我设法实现了一个函数,通过在这个问题中使用@ridgerunner建议的解决方案将驼峰案例转换为单词:
Split camelCase word into words with php preg_match (Regular Expression)
但是,我还想处理这样的嵌入式缩写:
'hasABREVIATIONEmbedded'转换为'嵌入ABREVIATION'
我提出了这个解决方案:
<?php
function camelCaseToWords($camelCaseStr)
{
// Convert: "TestASAPTestMore" to "TestASAP TestMore"
$abreviationsPattern = '/' . // Match position between UPPERCASE "words"
'(?<=[A-Z])' . // Position is after group of uppercase,
'(?=[A-Z][a-z])' . // and before group of lowercase letters, except the last upper case letter in the group.
'/x';
$arr = preg_split($abreviationsPattern, $camelCaseStr);
$str = implode(' ', $arr);
// Convert "TestASAP TestMore" to "Test ASAP Test More"
$camelCasePattern = '/' . // Match position between camelCase "words".
'(?<=[a-z])' . // Position is after a lowercase,
'(?=[A-Z])' . // and before an uppercase letter.
'/x';
$arr = preg_split($camelCasePattern, $str);
$str = implode(' ', $arr);
$str = ucfirst(trim($str));
return $str;
}
$inputs = array(
'oneTwoThreeFour',
'StartsWithCap',
'hasConsecutiveCAPS',
'ALLCAPS',
'ALL_CAPS_AND_UNDERSCORES',
'hasABREVIATIONEmbedded',
);
echo "INPUT";
foreach($inputs as $val) {
echo "'" . $val . "' translates to '" . camelCaseToWords($val). "'\n";
}
输出结果为:
INPUT'oneTwoThreeFour' translates to 'One Two Three Four'
'StartsWithCap' translates to 'Starts With Cap'
'hasConsecutiveCAPS' translates to 'Has Consecutive CAPS'
'ALLCAPS' translates to 'ALLCAPS'
'ALL_CAPS_AND_UNDERSCORES' translates to 'ALL_CAPS_AND_UNDERSCORES'
'hasABREVIATIONEmbedded' translates to 'Has ABREVIATION Embedded'
它按预期工作。
我的问题是: 我可以组合2个正则表达式$ abreviationsPattern和camelCasePattern 所以我可以避免两次运行preg_split()函数?
答案 0 :(得分:1)
这些总是有趣的难题要解决;我把案件缩小到两个:
检测以大写字母开头,后跟小写字母(但不以字边界或主题开头)开头的单词 - (?<!\b)[A-Z][a-z]+
检测从小写到大写的转换 - (?<=[a-z])[A-Z]
function camelFix($str)
{
return preg_replace_callback('/(?<!\b)[A-Z][a-z]+|(?<=[a-z])[A-Z]/', function($match) {
return ' '. $match[0];
}, $str);
}
它适用于您提供的输入;它可能会失败我没有预料到的情况:)