Question

我正在阅读有关布局分析的文章，我想使用Python和Opencv对该方法进行编码。

在本文中，他使用了一种连接组件方法来检测文本字母，然后做一些特征提取所有布局。

因此，我正在尝试在python上执行此操作，并且我使用了Opencv的connectedComponentsWithStats函数来获取已连接的组件。

image_gray = cv2.cvtColor(image, cv2.COLOR_BGR2GRAY)
image_threshold = cv2.threshold(image_gray, 0, 255, cv2.THRESH_BINARY + cv2.THRESH_OTSU)[1]

nb_components, output, stats, centroids = cv2.connectedComponentsWithStats(image_threshold, connectivity=8)

for i in range(0, nb_components):
    x_min = stats[i, 0]
    y_min = stats[i, 1]
    x_max = stats[i, 0] + stats[i, 2]
    y_max = stats[i, 1] + stats[i, 3]

    cv2.rectangle(self.image, (x_min, y_min), (x_max, y_max), (0, 255, 0), 3)

所以我想得到这样的结果：

all the connected components are detected

但是我得到这样的信息，其中仅检测到圆形分量：

just the round components are detected

原始图片

连接的组件以检测文本

0 个答案: