Question

我有这张图片。我试图OCR这张图片中的字母。我没有得到期望的结果＃9; 9＆＃39;和＆＃39; R＆＃39;首先，我裁剪了这些字母＆amp; 并执行以下命令。

tesseract 9.png stdout -psm 8
.

它刚刚回归＆＃34;。＆＃34;

所有其他字母的OCR工作正常但不是这两个字母（但我认为他们的图像质量并不差）。任何建议/帮助表示赞赏。

Answer 1

我自己没有使用tesseract的经验，但根据谷歌的结果，复制角色并在https://www.newocr.com/内部使用tesseract添加一些背景作品。

所以我用它作为输入：

在该网络应用上提供了正确的结果：99999999，而单个字符不起作用。也许您可以使用tesseract实现验证这一点，也许它可以帮助您调整孤立的提取字符以使用tesseract。例如尝试将提取的轮廓的多个副本彼此相邻拼接以提高tesseract输出 - 因为您知道将轮廓彼此相邻拼接的频率，如果它识别出相同的字符，则您知道输出可能是正确的经常... ..

同样适用于

边框看起来很重要，没有足够的边框就会识别P。一般来说，你应该尝试用纯黑色和纯白色代替背景和前景！不确定网络应用程序使用什么样的预处理...

此代码可用于使用C ++和OpenCV重复图像，但它不会添加边框。要做到这一点，你会非常相似，但有一些额外的步骤，你必须为边框指定一些颜色。

编辑：我已更新代码，在每个方向上使用4像素的边框（您可以调整变量）和黑色背景颜色。

这段代码非常简单，对于java，python等中的opencv应该非常相似。

int main(int argc, char * argv[])
{
    //cv::Mat input = cv::imread("../inputData/ocrR.png");

    if(argc != 3)
    {
        std::cout << "usage: .exe filename #Repetitions" << std::endl;
        return 0;
    }

    std::string filename = argv[1];
    int nRepetitions = atoi(argv[2]);

    cv::Mat inputImage = cv::imread(filename);
    if(inputImage.empty())
    {
        std::cout << "image file " << filename << " could not be loaded" << std::endl;
        return 0;
    }

    // you instead should try to extract the background color from the image (e.g. from the image border)
    cv::Scalar backgroundColor(0,0,0);

    // size of the border in each direction
    int border = 4;

    cv::Mat repeatedImage = cv::Mat(inputImage.rows + 2*border, nRepetitions*inputImage.cols + 2*border, inputImage.type() , backgroundColor);

    cv::Rect roi = cv::Rect(border,border,inputImage.cols, inputImage.rows);

    for(int i=0; i<nRepetitions; ++i)
    {
        // copy original image to subimage of repeated image
        inputImage.copyTo(repeatedImage(roi));

        // update roi position
        roi.x += roi.width;
    }

    // now here you could send your repeated image to tesseract library and test whether nRepetitions times a letter was found.

    cv::imwrite("repeatedImage.png", repeatedImage);
    cv::imshow("repeated image" , repeatedImage);
    cv::waitKey(0);
    return 0;
}

给出这个结果：

Answer 2

我比你更成功......我做了一个“连接组件分析”来提取单个字母，然后在每个提取的字母周围放一个边框并将它们全部加在一起一条水平线给了我这个：

如果我然后运行tesseract我得到：

VQQTRF

OCR：没有得到理想的结果

2 个答案: