Question

对于手写数字识别，我已经使用MNIST数据库培训了我的CNN。

现在，我必须将输入图像处理为MNIST格式以检测数字。

对于以下图片：

我正在使用轮廓来分离3和8，以便检测和写入＆＃34; 38＆＃34;我需要先发送3然后再发送8到CNN。

但是来自cv2.findContours（）的轮廓顺序令人困惑，有时会先检测到8，然后是3，然后才会检测到＃34; 83＆＃34;。

如何从左边开始写入轮廓，同样从上到下？

Answer 1

你可以做的是围绕你的两个黑色数字创建边界矩形，然后从最左边的矩形迭代，并将该矩形中的所有轮廓发送到你的CNN。如果您以this为例并从上到下进行操作，则会有两个单独的矩形：

该链接的示例代码：

contours, hier = cv2.findContours(gray,cv2.RETR_LIST,cv2.CHAIN_APPROX_SIMPLE)
for cnt in contours:
    if 200<cv2.contourArea(cnt)<5000:
        cv2.drawContours(img,[cnt],0,(0,255,0),2)
        cv2.drawContours(mask,[cnt],0,255,-1)

然后，当您有一组轮廓时，您可以按顺序对它们进行排序：

import numpy as np

c = np.load(r"rect.npy")
contours = list(c)

# Example - contours = [(287, 117, 13, 46), (102, 117, 34, 47), (513, 116, 36, 49), (454, 116, 32, 49), (395, 116, 28, 48), (334, 116, 31, 49), (168, 116, 26, 49), (43, 116, 30, 48), (224, 115, 33, 50), (211, 33, 34, 47), ( 45, 33, 13, 46), (514, 32, 32, 49), (455, 32, 31, 49), (396, 32, 29, 48), (275, 32, 28, 48), (156, 32, 26, 49), (91, 32, 30, 48), (333, 31, 33, 50)] 

max_width = np.sum(c[::, (0, 2)], axis=1).max()
max_height = np.max(c[::, 3])
nearest = max_height * 1.4

contours.sort(key=lambda r: (int(nearest * round(float(r[1])/nearest)) * max_width + r[0]))

for x, y, w, h in contours:
    print "{:4} {:4} {:4} {:4}".format(x, y, w, h)

取自 - here

在订单中获取轮廓

1 个答案: