Question

我有许多字母的图案图像字典，我还有一个必须被识别的位图！ “两个”图像的高度是一样的！某些图案图像的宽度不同。

如何迭代X轴并识别模式中的字母？

现在我正在使用此功能检查X位图列中是否包含黑色像素：

static Boolean GetColumnState(Bitmap bmp, int x)
{
    BitmapData pixelData = bmp.LockBits(
      new Rectangle(0, 0, bmp.Width, bmp.Height),
      ImageLockMode.ReadOnly,
      PixelFormat.Format32bppArgb);
    Boolean state = false;
    unsafe
    {
        int* pData = (int*)pixelData.Scan0.ToPointer();
        pData += x;
        for (int i = 0; i < bmp.Height; ++i)
        {
            pData += bmp.Width;
            if (Color.FromArgb(*pData) == Color.FromArgb(255, 0, 0, 0))
            {
                state = true;
                break;
                //pixelColumn[i] = Color.FromArgb(*pData);
            }
        }
    }
    bmp.UnlockBits(pixelData);

    return state;
}

如果GetColumnState()返回True，则裁剪出与图案图像大小相同的图像并进行比较。

        int y = target.Count;

        for (int i = 0; i < b1.Width; i++)
        {
            if (GetColumnState(b1, i + count) == true)
            {
                        int trWidth = target[5].Value.Width;
                        int trHeight = target[5].Value.Height;
                        Bitmap bitm = new Bitmap(trWidth, trHeight);
                        Rectangle section = new Rectangle(new Point(0, b1.Height - trHeight-1), new Size(trWidth, trHeight));
                        Bitmap cropped = CropImage(b1, section);
                        cropped.Save(@"C:\111.png");
                        target[5].Value.Save(@"C:\000.png");

                        if (CompareMemCmp(cropped, target[5].Value) == true)
                        {
                            //count = target[5].Value.Width;
                            textBox2.AppendText(target[5].Key);
                            break;
                        }
                        else { textBox2.AppendText("noo"); }

                //textBox1.Text = "yes!";
            }
            else
            {
                //textBox1.Text = "noo";
            }
            break;
        }

不幸的是，即使裁剪后的图像在视觉上看起来一样 - 它有不同的大小，所以memcmp（我的比较方法基于此）返回 false ..

必须识别的位图和图案图像都是BlackAndWhite颜色..我想知道是否有更可靠的方法来比较另一个Image中的一个图像并通过字典（OCR）返回其值..

Answer 1

正如您所发现的，基于像素值的模式匹配并不健全。如果字体是完全可预测且一致地渲染的，您可以通过标准化图像（对齐，缩放，旋转以匹配），计算图像之间的均方差，然后接受它是否足够小来完成此工作。

面对未知字体或不同的图像源（屏幕截图，相机，扫描等），您需要基于某种机器学习（ML）的更强大的方案。识别位图中的数字是神经网络的经典入门示例，但许多其他ML方案也可以工作 - 请参阅Supervised learning#Approaches_and_algorithms on Wikipedia。请注意，这将涉及培训，这意味着您需要一组好的多样化数据来进行培训。

在另一个图像中检测一个图像并返回它的值

1 个答案: