Question

我正在关注Udacity的深度学习课程。我对pixel depth以及如何在下面的代码中使用它感到有些困惑：

image_size = 28  # Pixel width and height.
pixel_depth = 255.0  # Number of levels per pixel.

image_data = (ndimage.imread(image_file).astype(float) -
                      pixel_depth / 2) / pixel_depth

有人可以解释为什么我们在将图像读入N-d数组时正在进行pixel_depth / 2) / pixel_depth吗？

Answer 1

CV中的深度仅指数据类型。深度为255.0意味着每个像素为8位，依此类推。

pixel_depth / 2) / pixel_depth

这段代码起初看起来有点奇怪，但其目的是将图像标准化为-0.5到0.5的范围，这是简化图像处理的常用策略。

Answer 2

像素深度是像素可以采用的值的数量。对于8位图像，这是256（但它们在这里使用255）。

此处的代码用于将像素值标准化并居中到区间[-0.5,0.5]。

图像分类 - 像素深度是多少？

2 个答案: