Question

我有一个大的二进制图像（4k x 7k像素），我想从中提取整个黄色部分作为一个矩形。我尝试了二元侵蚀来均衡黄色区域内的特征。然后我使用bbox的{{1}}方法，但对于具有一个大型bbox的大型图像，它似乎不够快。你有什么建议吗？

Answer 1

由于您提供的图像包含分散注意力的轴，并且颜色错误且太小，因此我在终端中使用 ImageMagick 尽可能创建了逼真的版本：

convert bbox.png -alpha off -crop 120x215+40+13 -colorspace gray -normalize -threshold 50% -scale 4200x7200\! bbox.png

完整尺寸为4200x7200。

然后，我编写了numpy的基于bbox的版本，如下所示

#!/usr/local/bin/python3
import numpy as np
from PIL import Image

def bbox(image):
    """Find bounding box of image"""
    # Project all columns into row same width as image
    proj=np.any(image,axis=0)
    # Find first non-zero value from Left
    L=np.argmax(proj)
    # And right
    R=image.shape[1]-np.argmax(np.flipud(proj))-1
    # Project all rows into column same height as image
    proj=np.any(image,axis=1)
    # Find first non-zero value from Top
    T=np.argmax(proj)
    # And Bottom
    B=image.shape[0]-np.argmax(np.flipud(proj))-1
    return T,L,B,R

image=np.array(Image.open("a.png").convert("L"))
print(bbox(image))

在Mac上运行时间为5.3毫秒。只是为了好玩，我将其穿线并在单独的平行线程上运行了水平投影和垂直投影，结果下降到3.6ms，结果相同。

#!/usr/local/bin/python3
import numpy as np
from PIL import Image

import threading
import queue

def DoOneDim(image,axis,q):
    """Find bounding box of image"""
    proj=np.any(image,axis=axis)
    # Find first non-zero value
    A=np.argmax(proj)
    # And and last
    B=image.shape[1-axis]-np.argmax(np.flipud(proj))-1
    q.put({axis:(A,B)})


def bboxTh(image):
    """Threaded version of bbox() that does vertical and horizontal on their own theads"""
    q = queue.Queue()
    Hthread=threading.Thread(target=DoOneDim, args=(image,0,q))
    Vthread=threading.Thread(target=DoOneDim, args=(image,1,q))
    Hthread.start()
    Vthread.start()
    Hthread.join()
    Vthread.join()
    results=dict()
    while not q.empty():
       results.update(q.get())
    return results

image=np.array(Image.open("a.png").convert("L"))
print(bboxTh(image))

已识别的框如下所示：

Answer 2

由于要查找单个边界框，请勿使用regionprops或任何每个对象的函数。这也使得您无需尝试从所有黄点中制作单个对象。

这里最简单的解决方案是遍历图像，并为每个像素确定其是否“足够黄”（无论对您的应用程序意味着什么）。如果是这样，请将像素的坐标添加到正在运行的边界框计算中。

边界框的计算非常简单：

top_left = [1e9, 1e9]
bottom_right = [0, 0]
for ...:
   # within your loop over the pixels, [x, y] are the current coordinates
   top_left = [min(top_left[0], x), min(top_left[1], y)];
   bottom_right = [max(bottom_right[0], x), max(bottom_right[1], y)];

使用skimage可能有一种无需循环的方法，但我根本不知道该程序包。

在大图像中绘制边界框

2 个答案: