machine-learning - 如何从 Tesseract / PyTesseract 中提取图像“特征”（不是文本）？ - Thinbug

如何从 Tesseract / PyTesseract 中提取图像“特征”（不是文本）？

时间：2021-01-15 07:25:41

标签： machine-learning deep-learning ocr tesseract python-tesseract

我可以使用 tesseract 绑定 Python 从 PyTesseract 的图像中提取文本，如下所示：

import cv2 
import pytesseract

img = cv2.imread('image.jpg')

custom_config = r'--oem 3 --psm 6'
text = pytesseract.image_to_string(img, config=custom_config)

但我想使用 Tesseract 提取特征。有可能吗？

基本上我的想法是从 OCR 模型中提取图像特征。有许多预训练的 CNN 模型可以处理图像，但它们都不适用于图像WITH 文本。只有可用的模型是 tesseract。如果有任何用于 OCR 的预训练深度学习模型，请提出建议。

0 个答案:

没有答案