我在 python 中使用 datefinder 模块,我需要从字符串中提取 DateTime,但我从只包含一个日期和时间的字符串中获取多个日期。
代码:
hi
(61) [{…}, {…}, {…}, {…}, {…}, {…}, {…}, {…}, {…}, {…}, {…}, {…}, {…}, {…}, {…}, {…}, {…}, {…}, {…}, {…}, {…}, {…}, {…}, {…}, {…}, {…}, {…}, {…}, {…}, {…}, {…}, {…}, {…}, {…}, {…}, {…}, {…}, {…}, {…}, {…}, {…}, {…}, {…}, {…}, {…}, {…}, {…}, {…}, {…}, {…}, {…}, {…}, {…}, {…}, {…}, {…}, {…}, {…}, {…}, {…}, {…}]
输入:
import datefinder
def date_using_datefinder(date_string):
matches = datefinder.find_dates(date_string)
for match in matches:
print(match)
输出:
test3='''
[26/08/2018 06:58:29.126900]
[26/03/2004 06:58:29.126985][SDAP_CODEC][JET_AT_JUMP][43][HTXXHTXX] DUST_RPOP:QFI:9
[02/06/2003 06:58:29.254621][SDAP_CODEC][JET_AT_JUMP][43][HTXXHTXX] DUST_RPOP:QFI:9
[20/05/2022 06:58:29.124898][SDAP_CODEC][JET_AT_JUMP][43][HTXX] DUST_RPOP:QFI:9
[26/08/2020 06:58:29.136579][ALST][stx][29][ggg] JET_AT_JUMP:TRUX_MSGD_HTXX:13265261686865256:QWERT_DUMPING_TDD:45:DUST_RPOP_CVX:32:AXTP_DI:65576
'''
为什么会出现最后两个日期,我想这在字符串中没有任何地方。
PS:我也尝试过 DateUtil 模块,但它显示 ParseError。
仅供参考,代码为:
2018-08-26 06:58:29.126900
2004-03-26 06:58:29.126985
2003-02-06 06:58:29.254621
2022-05-20 06:58:29.124898
2020-08-26 06:58:29.136579
2045-08-02 00:00:00
2032-08-02 00:00:00
注意:在我的情况下使用正则表达式不起作用,因为我的日志行可以有随机模式和任何日期时间格式,或者我可以说不想使用正则表达式。
答案 0 :(得分:0)
让我帮自己
我创建了一个 python 库来完成我的任务,如果需要其他人也可以使用这个库。 试图涵盖大部分日期时间格式,并将更新更多。
是时候使用我们自己的库了
Installation -> pip install MyDateTimeLib==0.1.2
Importing as -> from MyDateTimeLib import myfunction
How to use? --> myfunction.date_find("passing date string")
Returns --> it return the dictionary containing all the dates from the string else null dict.
check on --> https://pypi.org/project/MyDateTimeLib/0.1.2/
演示:
data_for_date = '''
[26/08/2018 06:58:29.126900]
[26/03/2004 06:58:29.126985][SDAP_CODEC][JET_AT_JUMP][43][HTXXHTXX] DUST_RPOP:QFI:9
[02/06/2003 06:58:29.254621][SDAP_CODEC][JET_AT_JUMP][43][HTXXHTXX] DUST_RPOP:QFI:9[26/03/2036 06:58:29.126985]
[20/05/2022 06:58:29.124898][SDAP_CODEC][JET_AT_JUMP][43][HTXX] DUST_RPOP:QFI:9
[26/08/2020 06:58:29.136579][ALST][stx][29][ggg] JET_AT_JUMP:TRUX_MSGD_HTXX:13265261686865256:QWERT_DUMPING_TDD:45:DUST_RPOP_CVX:32:AXTP_DI:65576
'''
代码:
from MyDateTimeLib import myfunction
for x in data_for_date.splitlines():
if len(x)>1:
dic = myfunction.date_find(x)
print()
for k,v in dic.items():
print(k,v)
输出:
Date:0 2018-08-26 06:58:29.126900
Date:0 2004-03-26 06:58:29.126985
Date:0 2003-02-06 06:58:29.254621
Date:1 2036-03-26 06:58:29.126985
Date:0 2022-05-20 06:58:29.124898
Date:0 2020-08-26 06:58:29.136579