Question

我得到了一个TFRecord数据文件filename = train-00000-of-00001，其中包含未知大小的图像以及其他信息。我知道我可以使用dataset = tf.data.TFRecordDataset(filename)打开数据集。

如何从该文件中提取图像以将其另存为numpy数组？

我也不知道TFRecord文件中是否还有其他信息，例如标签或分辨率。我如何获得这些信息？如何将它们另存为numpy数组？

我通常只使用numpy数组，并且不熟悉TFRecord数据文件。

Answer 1

1。）如何从该文件中提取图像以将其另存为numpy数组？

您正在寻找的是这个

record_iterator = tf.python_io.tf_record_iterator(path=filename)

for string_record in record_iterator:
  example = tf.train.Example()
  example.ParseFromString(string_record)

  print(example)

  # Exit after 1 iteration as this is purely demonstrative.
  break

2。）如何获取这些信息？

这里是官方的documentation。我强烈建议您阅读本文档，因为它逐步介绍了如何提取所需的值。

基本上，您必须将example转换为字典。因此，如果我想找出tfrecord文件中包含哪种信息，我会做类似的事情（在第一个问题中所述的代码中）：dict(example.features.feature).keys()

3。）如何将它们另存为numpy数组？

我将基于上述的for循环。因此，对于每个循环，它都会提取您感兴趣的值，并将它们附加到numpy数组中。如果需要，可以从这些数组创建一个熊猫数据框，并将其另存为csv文件。

但是...

您似乎有多个tfrecord文件... tf.data.TFRecordDataset(filename)返回用于训练模型的数据集。

因此，如果有多个tfrecords，则需要一个double for循环。外循环将遍历每个文件。对于该特定文件，内部循环将遍历所有tf.examples。

编辑：

转换为np.array（）

import tensorflow as tf
from PIL import Image
import io

for string_record in record_iterator:
  example = tf.train.Example()
  example.ParseFromString(string_record)

  print(example)

  # Get the values in a dictionary
  example_bytes = dict(example.features.feature)['image_raw'].bytes_list.value[0]
  image_array = np.array(Image.open(io.BytesIO(example_bytes)))
  print(image_array)
  break

以上代码的来源：

Base code
转换bytes to PIL.JpegImagePlugin.JpegImageFile
从PIL.JpegImagePlugin.JpegImageFile to np.array转换

PIL的官方文档

编辑2：

import tensorflow as tf
from PIL import Image
import io
import numpy as np

# Load image
cat_in_snow  = tf.keras.utils.get_file(path, 'https://storage.googleapis.com/download.tensorflow.org/example_images/320px-Felis_catus-cat_on_snow.jpg')

#------------------------------------------------------Convert to tfrecords
def _bytes_feature(value):
  """Returns a bytes_list from a string / byte."""
  return tf.train.Feature(bytes_list=tf.train.BytesList(value=[value]))

def image_example(image_string):
  feature = {
      'image_raw': _bytes_feature(image_string),
  }
  return tf.train.Example(features=tf.train.Features(feature=feature))

with tf.python_io.TFRecordWriter('images.tfrecords') as writer:
    image_string = open(cat_in_snow, 'rb').read()
    tf_example = image_example(image_string)
    writer.write(tf_example.SerializeToString())
#------------------------------------------------------


#------------------------------------------------------Begin Operation
record_iterator = tf.python_io.tf_record_iterator(path to tfrecord file)

for string_record in record_iterator:
  example = tf.train.Example()
  example.ParseFromString(string_record)

  print(example)

  # OPTION 1: convert bytes to arrays using PIL and IO
  example_bytes = dict(example.features.feature)['image_raw'].bytes_list.value[0]
  PIL_array = np.array(Image.open(io.BytesIO(example_bytes)))

  # OPTION 2: convert bytes to arrays using Tensorflow
  with tf.Session() as sess:
      TF_array = sess.run(tf.image.decode_jpeg(example_bytes, channels=3))

  break
#------------------------------------------------------


#------------------------------------------------------Compare results
(PIL_array.flatten() != TF_array.flatten()).sum()
PIL_array == TF_array

PIL_img = Image.fromarray(PIL_array, 'RGB')
PIL_img.save('PIL_IMAGE.jpg')

TF_img = Image.fromarray(TF_array, 'RGB')
TF_img.save('TF_IMAGE.jpg')
#------------------------------------------------------

请记住，tfrecords只是存储信息以供张量流模型以有效方式读取的一种方式。
我使用PIL和IO从本质上将字节转换为图像。 IO会提取字节并将其转换为PIL.Image可以读取的file like object
是的，有一种纯粹的张量流方法：tf.image.decode_jpeg
是的，当您比较两个数组时，两种方法之间存在差异
您应该选择哪一个？如果您担心Tensorflow's github中所述的准确性，那么Tensorflow并非可行之道：“ jpeg解码的TensorFlow选择默认值为IFAST，这会牺牲图像质量以提高速度”。此信息的信誉属于此post

Tensorflow：加载未知的TFRecord数据集

1 个答案: