我正在尝试使用split函数在python中拆分记录,但无法实现实际结果。
以下是我的.txt
文件的内容:
10000 {(10000,200,300,A),(10000,200,300,B)},{(10000,200,300,C),(10000,200,300,D)}
10001 {(10001,200,300,E),(10001,200,300,F)},{(10001,200,300,G),(10001,200,300,H)}
这是所需的输出:
10000 10000,200,300,A
10000 10000,200,300,B
10000 10000,200,300,C
10000 10000,200,300,D
10001 10001,200,300,E
10001 10001,200,300,F
10001 10001,200,300,G
10001 10001,200,300,H
谢谢您的任何帮助。
答案 0 :(得分:2)
这里是获得所需结果的最简单方法,它只需要sub
包中的findall
和re
方法即可。
from re import sub, findall
string = """
10000 {(10000,200,300,A),(10000,200,300,B)},{(10000,200,300,C),(10000,200,300,D)}
10001 {(10001,200,300,E),(10001,200,300,F)},{(10001,200,300,G),(10001,200,300,H)}
"""
# our results go here
results = []
# loop through each line in the string
for line in string.split("\n"):
# get rid of leading and trailing whitespace
line = line.strip()
# ignore empty lines
if len(line) > 0:
# get the line's id
id = line.split("{")[0].strip()
# get all values wrapped in parenthesis
for match in findall("(\(.*?\))", string):
# add the string to the results list
results.append("{} {}".format(id, sub(r"\{|\}", "", match)))
# display the results
print(results)
以下是函数形式中的相同代码:
from re import sub, findall
def get_records(string):
# our results go here
results = []
# loop through each line in the string
for line in string.split("\n"):
# get rid of leading and trailing whitespace
line = line.strip()
# ignore empty lines
if len(line) > 0:
# get the line's id
id = line.split("{")[0].strip()
# get all values wrapped in parenthesis
for match in findall("(\(.*?\))", string):
# add the string to the results list
results.append("{} {}".format(id, sub(r"\{|\}", "", match)))
# return the results list
return results
然后您将使用该功能,如下所示:
# print the results
print(get_records("""
10000 {(10000,200,300,A),(10000,200,300,B)},{(10000,200,300,C),(10000,200,300,D)}
10001 {(10001,200,300,E),(10001,200,300,F)},{(10001,200,300,G),(10001,200,300,H)}
"""))
祝你好运。