Question

我已经发现如何使用tesseract输出它用于OCR的tif。但是，此tif始终称为tessinput.tif，如果我在文件夹中有多个文档，那么只显示最后创建的tessinput.tif。多层tif也是如此。

有没有办法让tesseract输出文件夹中每个文档的输入tif文件，例如： G。将_1附加到其名称或其他内容？

修改

好吧，到目前为止，我添加了tessedit_write_images=1配置参数。哪个，看哪，输出tessinput.tif ...但我不知道，如果你能以某种方式改变该文件的名称

Answer 1

输出tessinput.tif后，等待文件写入，然后重命名并处理下一个图像。

Answer 2

我使用Tesseract（4.0）识别单个图像中的多行字符。在这里，我建议一种简化的方法，以适当的格式保存所有tessinput.tif文件，然后再仔细检查输出：

import os
import pytesseract

config = '-l eng --oem 3 --psm 7 --dpi 600 -c tessedit_write_images=true'

'''
in my use case, I extracted lines contours from the image, stored coordinates for
each line before reading the line with tesseract. Here I provide a simplified
solution with a list of 3 images, just for the example
'''

img = ['img1.png', 'img2.png', 'img3.png']     

for i in range(len(image)):
    pytesseract.image_to_string(img[i], config=config)
    os.system('mv tessinput.tif tessinput_{:03d}.tif'.format(i))

如何在tesseract-ocr中为每个文档生成一个input.tif？

2 个答案: