here is image of xyz.txt file我试图从整个简历中仅提取JAVA,Python之类的技能,并列出为"skills":["JAVA","Python"]
。但是在提取文本时,我将数组中的每个文本都作为["skills","JAVA","and","Python"]
。如何只提取编程语言并删除所有其他文本?
txt='xyz.txt'
f = open(txt)
text = f.read()
content= re.split("\s", text)
text_file_doc = {"file_name": txt, "contents" : content }
print(text_file_doc)
我希望结果为{"content":["skills":["JAVA","Python"]]}