官方TensorFlow Transfer Learning Tutorial提供了this source code,我正尝试理解以针对我的特定用例进行修改。
现在,我正在使用create_image_lists
方法。我在源代码中发现以下注释颇为神秘:
# We want to ignore anything after '_nohash_' in the file name when
# deciding which set to put an image in, the data set creator has a way of
# grouping photos that are close variations of each other. For example
# this is used in the plant disease data set to group multiple pictures of
# the same leaf.
hash_name = re.sub(r'_nohash_.*$', '', file_name)
此特定评论出现在line 201 in the source code上。我是TensorFlow的新手,希望有人可以回答以下问题:
data set creator
”吗?我正在寻找类似tf.data.Dataset
的答案。 data set creator
”到底在幕后做什么?data set creator
”的混淆操作如何影响从文件名中排除不可散列字符的决定?我非常感谢您对源代码中的上述注释的引用有任何新的见解。