应用错误收集

我正在通过pdfminer阅读附件PDF，但无法在页面上阅读。

for a in range(0, no_of_pages-1):
    for page in PDFPage.get_pages(fp, pagenos=[a], maxpages=maxpages, password=password, caching=caching, check_extractable=True):
        interpreter.process_page(page)
        text = retstr.getvalue()
        print "txt", text
        extract_pdf_data(text, a+1, path)
        retstr.flush()

对于第1页，它工作正常，但对于第2页，它返回第1页和第2页的结果，并且第6页也没有被正确读取。第1栏文字出现在第3栏。

请帮助我阅读页面内容，并且pdf内容不会在内容之间随机播放。

通过python2.7中的pdfminer阅读Page wise PDF

0 个答案: