Question

我刚开始使用mask_rcnn进行“深度学习”以进行实时视频屏蔽。经过一些讨论，使其工作仅用于捕获和掩盖背景。但这非常滞后，几乎可以在20秒内处理一帧。

如何提高性能？

广义模型可以提供帮助吗？

...

model = modellib.MaskRCNN(
mode="inference", model_dir=MODEL_DIR, config=config
)
model.load_weights(COCO_MODEL_PATH, by_name=True)

...

 for i in range(n_instances):
        if not np.any(boxes[i]):
            continue

        label = names[ids[i]]

        if label != "person":
            continue


        y1, x1, y2, x2 = boxes[i]
        color = class_dict[label]
        score = scores[i] if scores is not None else None
        caption = '{} {:.2f}'.format(label, score) if score else label
        mask = masks[:, :, i]
        print(boxes[i],label)

        image = apply_mask(image, mask, color)
        image = cv2.rectangle(image, (x1, y1), (x2, y2), color, 2)
        image = cv2.putText(
            image, caption, (x1, y1), cv2.FONT_HERSHEY_COMPLEX, 0.7, color, 2
        )
    return image

我需要一些建议...

使用Anaconda的Mask_RCNN，如何使用“ mask_rcnn_coco.h5”提高性能以检测实时摄像机供稿中的对象？

0 个答案: