Question

我有一个包含大量图片的文件夹。我正在尝试编写一个python脚本，它将遍历每个图像并返回宽度/高度，然后将其附加到我的字典中。我的字典的简化版本如下所示：

in_metadata = {123: {labels:[1,2]}, 234: {labels:[2,3]}}

我想要的是这个：

in_metadata = {123: {'labels':[1,2], 'bbox':[320,240,640,480]}, 234: {'labels':[2,3], 'bbox':[320,206,640,412]}, ...}

其中bbox = [center x，center y，w，h]

当代码进入循环的第一次迭代时，我得到：

stdout = '640,480'

这是我的期望。但是，第二次通过循环得到：

stdout = '640,480640,412'

第一个宽度和高度值未被刷新。这是我的代码：

command = ['identify', '-format', '%[fx:w],%[fx:h]']
for img_id, img_dict in in_metadata.iteritems():
    if 'bbox' in img_dict:
        continue
    command.append(srcdir + 'images/' + str(img_id) + '.jpg')
    p = Popen(command, stdout=PIPE, stderr=PIPE)
    stdout, stderr = p.communicate()
    if len(stderr) != 0:
        continue
    w, h = map(int, stdout.split(','))
    img_dict['bbox'] = [int(w / 2), int(h / 2), w, h]
    stdout.flush()

我一直都在努力尝试让它工作（p.wait，stdout.flush等），但缓冲区似乎不想空。我知道这很简单，我错过了什么？

感谢。

我在Ubuntu 16.04上使用python 2.7.12

Answer 1

每次迭代，都会附加command。我怀疑你并不是真的想要那个。考虑一下代码的简化版本：

labels = 'LABELS'
srcdir = 'SRCDIR/'
in_metadata = {123: {labels:[1,2]}, 234: {labels:[2,3]}}
command = ['identify', '-format', '%[fx:w],%[fx:h]']

for img_id, img_dict in in_metadata.iteritems():
    command.append(srcdir + 'images/' + str(img_id) + '.jpg')
    print command

输出：

['identify', '-format', '%[fx:w],%[fx:h]', 'SRCDIR/images/234.jpg']
['identify', '-format', '%[fx:w],%[fx:h]', 'SRCDIR/images/234.jpg', 'SRCDIR/images/123.jpg']

你可能想要更像这样的东西：

base_command = ['identify', '-format', '%[fx:w],%[fx:h]']

for img_id, img_dict in in_metadata.iteritems():
    command = base_command + [srcdir + 'images/' + str(img_id) + '.jpg']
    ...

Answer 2

如果您是从bash运行此功能，可以在运行脚本之前尝试设置PYTHONUNBUFFERED吗？

export PYTHONUNBUFFERED=true

使用子进程Popen和PIPE循环遍历python中的图像

2 个答案: