Question

我正在尝试使用leptonica处理以下图像，以使用tesseract提取文本。

原始图片： original image

原始图像上的Tesseract产生了这个：

i s l
D2J1FiiE-l191x1iitmwii9 uhiaiislz-2 Q ~37
Bottom linez
With a little time!
you can learn social media technology
using free online resources-
And if you donity
youlll be at a significant disadvantage
to
other HOn-pFOiiTS-

不太好，特别是顶级背景。所以使用leptionica我使用背景去除算法（模糊，差异，阈值，反转）来获得以下图像： processed image

但是tesseract并没有做得很好：

@@r-mair lkrm@W lh@w ilr@ mJs@ iklh@ ii@c2lhm1@ll
mm Mime
VWU1 a Mitt-Jle time-
@1m ll@@Wn Om @@@lh1
using free onhne resources-
Andifyoudoni
9110 ate a $0 D
to other non-profrts
I

似乎主要的问题是，现在所有的文字都是概述而不是固体。如何调整我的算法或我可以添加什么以使文本变为实体？

Answer 1

本文似乎提出了一种解决您问题的二值化方法：

T Kasar，J Kumar和A G Ramakrishnan。 Font and Background Color Independent Text Binarization。（2007）

Kasar etal method performance

使用leptonica进行OCR的图像处理（反色文本）

1 个答案: