Question

我使用此处提供的PixelAnnotationTool：https://github.com/abreheret/PixelAnnotationTool并使用提供的字典对图像进行了注释：

{
    "labels": {
        "unlabeled": {
            "categorie": "void",
            "color": [
                0,
                0,
                0
            ],
            "id": 0,
            "id_categorie": 0,
            "name": "unlabeled"
        },
        "bicycle_motorcycle": {
            "categorie": "bicycle_motorcycle",
            "color": [
                119,
                11,
                32
            ],
            "id": 1,
            "id_categorie": 1,
            "name": "bicycle_motorcycle"
        },
        "bus": {
            "categorie": "bus",
            "color": [
                102,
                51,
                0
            ],
            "id": 2,
            "id_categorie": 2,
            "name": "bus"
        },

... }

我想将这些RGB蒙版转换为json多边形格式，以便可以在Mask R-CNN中使用它们。这该怎么做？

Answer 1

这是一个python函数，它将接收蒙版Image对象，并返回以RGB颜色为键的子蒙版字典。

from PIL import Image # (pip install Pillow)

def create_sub_masks(mask_image):
    width, height = mask_image.size

    # Initialize a dictionary of sub-masks indexed by RGB colors
    sub_masks = {}
    for x in range(width):
        for y in range(height):
            # Get the RGB values of the pixel
            pixel = mask_image.getpixel((x,y))[:3]

            # If the pixel is not black...
            if pixel != (0, 0, 0):
                # Check to see if we've created a sub-mask...
                pixel_str = str(pixel)
                sub_mask = sub_masks.get(pixel_str)
                if sub_mask is None:
                   # Create a sub-mask (one bit per pixel) and add to the dictionary
                    # Note: we add 1 pixel of padding in each direction
                    # because the contours module doesn't handle cases
                    # where pixels bleed to the edge of the image
                    sub_masks[pixel_str] = Image.new('1', (width+2, height+2))

                # Set the pixel value to 1 (default is 0), accounting for padding
                sub_masks[pixel_str].putpixel((x+1, y+1), 1)

    return sub_masks

有了口罩后，您可以使用imantics将其转换为COCO

将RGB蒙版图像转换为可可JSON多边形格式

1 个答案: