我有一个读取文件行的脚本..而且有些行包含错误消息..所以我做了一个循环(这里只是一行)找到这些行并提取消息:
import re
data = "15:31:17 TPP E Line 'MESSAGE': There is a technical problem in the server."
if (re.findall(".*E Line.*",data)):
err = re.match(r'\'MESSAGE\':\s(*$)',data)
print err
我执行此脚本时遇到错误:/我希望它返回:
There is a technical problem in the server
答案 0 :(得分:4)
如果它们都遵循相同的格式,则不需要正则表达式:
>>> data = "15:31:17 TPP E Line 'MESSAGE': There is a technical problem in the server."
>>> data.rsplit(':', 1)[1]
' There is a technical problem in the server.'
但如果你必须使用它们......
>>> data = "15:31:17 TPP E Line 'MESSAGE': There is a technical problem in the server."
>>> ms = re.search(r"'MESSAGE': (.*)$", data)
>>> ms.group(1)
'There is a technical problem in the server.'
如果您愿意,也可以提取其他信息:
>>> ms = re.match(r"(\d\d:\d\d:\d\d)\s+(\S+)\s+(\S+)\s+Line\s+'MESSAGE':\s+(.*)", data)
>>> ms.groups()
('15:31:17', 'TPP', 'E', 'There is a technical problem in the server.')
答案 1 :(得分:1)
试试这个:
import re
data = "15:31:17 TPP E Line 'MESSAGE': There is a technical problem in the server."
r = re.compile("^.*E Line.*'MESSAGE':[ ]*([^ ].*)$")
m = r.match(data)
if m:
err = m.group(1)
print(err)
当然你应该在循环之外编译正则表达式。