我正在尝试匹配以"example.com/"
开头,后跟4-6位数字且下一个字符不是数字(如果有下一个字符)的URL。
例如,"example.com/12345"
应该匹配。
"example.com/1234567"
应该不匹配。
"example.com/123456g7"
应该匹配。
我尝试过"example.com/(\d{4,6}).*"
,但是当我给它"example.com/1234567"
时,这是不正确的。
如何解决此问题?
答案 0 :(得分:1)
另一种方法。
^example\.com/(\d{4,6})(?:\D.*)?$
答案 1 :(得分:1)
此表达式添加了其他边界,只是为了安全地传递所需的URL:
^(https?:\/\/(www.)?)(example\.com)\/(?:[0-9]{4,6})?([a-z].*)?$
如果愿意,可以缩小界限。在这里,我们可以添加几个捕获组以简化调用。
$
是使您不希望的URL输入失败的键。
如果这不是您想要的表达式,则可以在regex101.com中修改/更改表达式。
您还可以在jex.im中可视化您的表达式:
const regex = /^(https?:\/\/(www.)?)(example\.com)\/(?:[0-9]{4,6})?([a-z].*)?$/gm;
const str = `http://example.com/12345
https://example.com/123456g7
http://www.example.com/12345
https://www.example.com/123456g7
http://www.example.com/12345
https://www.example.com/123456g7
http://www.example.com/123456adfasdfasdf98989898
https://www.example.com/123456g7adfadfa0909009
http://example.com/1234567
https://example.com/1234567`;
let m;
while ((m = regex.exec(str)) !== null) {
// This is necessary to avoid infinite loops with zero-width matches
if (m.index === regex.lastIndex) {
regex.lastIndex++;
}
// The result can be accessed through the `m`-variable.
m.forEach((match, groupIndex) => {
console.log(`Found match, group ${groupIndex}: ${match}`);
});
}
# coding=utf8
# the above tag defines encoding for this document and is for Python 2.x compatibility
import re
regex = r"^(https?:\/\/(www.)?)(example\.com)\/(?:[0-9]{4,6})?([a-z].*)?$"
test_str = ("http://example.com/12345\n"
"https://example.com/123456g7\n"
"http://www.example.com/12345\n"
"https://www.example.com/123456g7\n"
"http://www.example.com/12345\n"
"https://www.example.com/123456g7\n"
"http://www.example.com/123456adfasdfasdf98989898\n"
"https://www.example.com/123456g7adfadfa0909009\n"
"http://example.com/1234567\n"
"https://example.com/1234567")
matches = re.finditer(regex, test_str, re.MULTILINE)
for matchNum, match in enumerate(matches, start=1):
print ("Match {matchNum} was found at {start}-{end}: {match}".format(matchNum = matchNum, start = match.start(), end = match.end(), match = match.group()))
for groupNum in range(0, len(match.groups())):
groupNum = groupNum + 1
print ("Group {groupNum} found at {start}-{end}: {group}".format(groupNum = groupNum, start = match.start(groupNum), end = match.end(groupNum), group = match.group(groupNum)))
# Note: for Python 2.7 compatibility, use ur"" to prefix the regex and u"" to prefix the test string and substitution.
答案 2 :(得分:0)