Question

我收藏了大约250张图像。它们都是从书本上扫描的，因此它们彼此之间都发生了一些移动或旋转。现在，我想对这些图像进行一些数据提取，但是为了自动执行此操作，所有图像中的所有位置都应该相同。这就是为什么我需要以某种方式对齐这些图像，以便所有图像中的所有位置相互对应。实现这一目标的最佳方法是什么？我认为openCV是执行此操作的最佳方法，但是我不确定应该如何开始。

以下是扫描图像的示例：

Answer 1

虚线可能是一个很好的锚点。

使用Morphological Transformation将虚线变为实线。
使用HoughLines
在蒙版上绘制线条

您可以使用遮罩对齐单独的图像。扫描蒙版的边缘以获取坐标，然后使用它们旋转和移动图像。我的意思是循环遍历蒙版第一行的值。第一个白色像素给出顶部中心坐标。面罩的其他面类似。您可以在不同图像上比较这些值，以确定移位和旋转。要应用这些转换，请阅读here。不过，这将需要做大量工作。也许有一个更简单的选择：

我可能是错的，但是似乎您想对齐页面，因此可以使用硬编码值提取图形。另一种更简单的方法是使用findContours创建“平铺”的子图像。然后可以对它们进行进一步处理。这在下面的代码中实现。

分离的子图像：

代码：

    import cv2
    import numpy as np  
    # load image
    img_large=cv2.imread("BAgla.jpg")
    # resize for ease of use
    img_ori = cv2.resize(img_large, None, fx=0.2, fy=0.2, interpolation= cv2.INTER_CUBIC)
    # create grayscale
    img = cv2.cvtColor(img_ori, cv2.COLOR_BGR2GRAY)
    # create mask for image size
    mask = np.zeros((img.shape[:2]),dtype=np.uint8)
    # do a morphologic close to merge dotted line
    kernel = np.ones((8,8))
    res = cv2.morphologyEx(img,cv2.MORPH_OPEN, kernel)
    # detect edges for houglines
    edges = cv2.Canny(res, 50,50)
    # detect lines
    lines = cv2.HoughLines(edges,1,np.pi/180,200)
    # draw detected lines
    for line in lines:
            rho,theta = line[0]
            a = np.cos(theta)
            b = np.sin(theta)
            x0 = a*rho
            y0 = b*rho
            x1 = int(x0 + 1000*(-b))
            y1 = int(y0 + 1000*(a))
            x2 = int(x0 - 1000*(-b))
            y2 = int(y0 - 1000*(a))

            cv2.line(mask,(x1,y1),(x2,y2),(255),2)
            cv2.line(img,(x1,y1),(x2,y2),(127),2)

    # invert the mask for use with findcontours
    mask_inv = cv2.bitwise_not(mask)
    # use findcontours to get the boundingboxes of the tiles
    contours, hier = cv2.findContours(mask_inv,cv2.RETR_EXTERNAL,cv2.CHAIN_APPROX_SIMPLE)
    boundingrects = []
    for cnt in contours:
            boundingrects.append(cv2.boundingRect(cnt))

    # findContours has no garanteed order, so sort array
    boundingrects.sort()

    # titles for window names / save file names
    title = ['Kaart', None, 'Smaakprofiel', 'Basiswaarden','Gelijkaardige bieren','Chemisch Profiel']

    # create images for top and bottom tiles
    for index in [0,2,3,5]:
            x,y,w,h = boundingrects[index]
            subimg = img_ori[y:y+h,x:x+w]
            cv2.imshow(title[index], subimg  )

    # combine middle tiles
    x1,y1,w1,h1 = boundingrects[1]
    x2,y2,w2,h2 = boundingrects[4]
    subimg = img_ori[y1:y2+h2,x1:x2+w2]
    cv2.imshow(title[4], subimg  )

    # display result
    cv2.imshow("Result", img  )
    cv2.imshow("Mask", mask  )
    cv2.waitKey(0)
    cv2.destroyAllWindows()

请注意，我使用的是缩小版的图片，因此在处理图片时要对此加以考虑。

使用openCV对齐图像

1 个答案: