Question

我有一组训练和测试图像，正在使用Mask RCNN模型进行训练。输入注释数据采用Pascal VOC格式，如下所示：

<object>
        <name>person</name>
        <pose>Unspecified</pose>
        <truncated>1</truncated>
        <difficult>0</difficult>
        <bndbox>
            <xmin>1</xmin>
            <ymin>99</ymin>
            <xmax>304</xmax>
            <ymax>500</ymax>
        </bndbox>
    </object>

图像中每个人的坐标

现在，我的模型输出中包含图像中每个人的类ID 我用下面的代码得到总数：print（np.sum（r ['class_ids']））

现在我想比较20张图像的实际值和上述每张图像的classid之和所以我的代码结构会像这样

For imageid in testdataset
    a= count (occurence of person in annotation file for that image)
    b= np.sum(r['class_ids']
    print imageid (or filename),a,b

尽管我已经编写了结构，但是我无法用python编写正确的代码。有人可以帮忙吗？

Python中图像识别的测试准确性

0 个答案: