Question

我有一个html文件，我正在阅读以下行。我想只抓住＆＃39;之后出现的数字：＆＃39;在＆＃39;之前，＆＃39;使用REGEX ...感谢提前

"totalPages":15,"bloodhoundHtml"

Answer 1

"totalPages":([0-9]*),

您可以看到演示here

然后python代码是

import re

p = re.compile('"totalPages":([0-9]*),')
print p.findall('"totalPages":15,"bloodhoundHtml"')

Answer 2

您可以尝试:\d+,获取'：15，' 然后你可以先修剪'：'并修剪结束'，'以获得纯数字，我不知道python是否可以在正则表达式中使用变量，我是#programe，在c＃中，我可以使用:(?<id>\d+),匹配此字符串，并直接通过result.group["id"]

Answer 3

:\d{1,},

也适用于解析您提供的行。根据{{3}}帖子，您可能会在解析HTML

时遇到一些麻烦