Question

我有一些场景文本图像，必须在其上应用二进制掩码以将输出图像设置为文本像素为1 和非文本像素为0 场景文本检测中的预处理阶段。

如何实现相同？

如果需要，我可以提供示例图像。

此处提供了示例图像：

Answer 1

这应该可以解决问题：

from PIL import Image

image_original = Image.open('kSZzc.png')
image_gray = image_original.convert("L") # Convert to grayscale image
image_gray_bw = image_gray.point(lambda pixel: 0 if pixel<128 else 255) # convert to binary image

如果图像的文本部分具有相对较亮的颜色，则将代码中的0和255的位置取反，以使文本颜色为黑色，而不是白色，就像第三张图像一样。

如何在文本图像上应用二进制掩码以在python中分离文本区域？

1 个答案: