我在使用pdftotext Python 3.x读取pdf时遇到问题

时间:2019-06-28 18:56:11

标签: ubuntu pdf python

代码:

import pdftotext

pdf_file ='/home/bortolossohurst/Documents/ambv_boot/selenium_spider.py/temp/pdf/arelpesquisainternetprecatorio.pdf' 

with open(pdf_file, 'rb') as f:
    pdf = pdftotext.PDF(f)
    text = "\n\n".join(pdf)

print(text)

错误:

Traceback (most recent call last):
  File "/home/bortolossohurst/Documents/ambv_boot/selenium_spider.py/src/teste.py", line 7, in <module>
    pdf = pdftotext.PDF(f)
pdftotext.Error: poppler error creating document

0 个答案:

没有答案