Question

Enter image description here

我正在尝试提取X射线中的字符，我尝试使用pytesseract进行提取，但未能成功，我使用了精巧的边缘去除了噪点并进行了提取，但是仍然无法提取文字/字符。能否请您帮助/指导我提取文字/字符

Answer 1

如果它是DICOM文件，则可以使用gdcm来获取属性。它也可以在python上使用。

Answer 2

尝试以下方法查找文本： https://www.pyimagesearch.com/2018/08/20/opencv-text-detection-east-text-detector/

然后，一旦找到您就可以隔离并使用tesseract对其进行识别。

Answer 3

如果文件为'png'或'jpg'格式，则

pytesseract就足够了。现在假设image是您的图像名称。请编写以下代码。

from PIL import Image

from pytesseract import image_to_string

import pytesseract 

pytesseract.pytesseract.tesseract_cmd = r'C:/Program Files (x86)/Tesseract-OCR/tesseract.exe'

im = Image.open('F:/kush/invert.jpg')

pytesseract.image_to_string(im, lang = 'eng')

使用python从X射线图像中提取文本/字符

3 个答案: