by asynchronously what I mean to say is as you can see in the second screenshot, the address and phone details are getting mixed我有一个任务是使用带有一些特定属性的python脚本来解析pdf文件。我必须抓取first name
,last name
,address
和email
。我做了以下。
from PyPDF2 import PdfFileReader
f = open('CV_Smith.pdf', 'rb')
reader = PdfFileReader(f)
contents = reader.getPage(0).extractText()
f.close()
print(contents)
但是因为文本异步而且难以处理而出现问题。
给定pdf的截图。
提前谢谢。