应用错误收集

我使用Pytesseract，尝试在此地图上检测国家的名称：

# So far psm=11 gives the best results
# A grayscale image gives approximately the same results
pytesseract.image_to_data('tmp/tesseract_input.png', config="--oem 3 --psm 11",lang='eng', output_type=Output.DICT)

这个结果并不令人满意，Tesseract似乎旨在识别长文字，但是在这种情况下，我需要它来识别短文字。

如何改善OCR结果？

改进思路：

为算法提供字体大小
提供字体家族，因为它不会改变

Pytesseract短文本

0 个答案: