我为n = 5计算了高斯滤波器掩码nxn。看起来像这样:
int gauss5[] = {
1, 4, 7, 4, 1,
4, 20, 33, 20, 4,
7, 33, 55, 33, 7,
4, 20, 33, 20, 4,
1, 4, 7, 4, 1
};
int gauss5_summ = 331;
在opencl图像中计算后变暗。是什么原因?
size_t global_wblur[2];
size_t local_wblur[2];
global_wblur[0] = h;
global_wblur[1] = w;
local_wblur[0] = local_wblur[1] = 32;
err = clEnqueueNDRangeKernel(queue, cl_img_gaussian_blur, 2, NULL, global_wblur, local_wblur, 0, NULL, NULL);
if (err != CL_SUCCESS) {
fprintf(stderr, "error: clEnqueueNDRangeKernel() blur %d %s\n", err, cl_strerror(err));
exit(EXIT_FAILURE);
}
内核来源:
__kernel void cl_img_gaussian_blur(__global const uchar *gray, __global uchar *out, __global const uchar *gbox, uint n, uint sum, uint w, uint h)
{
int i, j, offset;
uint x, y, summ;
y = get_global_id(0);
x = get_global_id(1);
offset = n/2;
/* ignore border pixels
*/
if (y - offset < 0 || y + offset > h || x - offset < 0 || x + offset > w) {
out[y*w + x] = gray[y*w + x];
return;
}
summ = 0;
for (j = -offset; j <= offset; j++) {
for (i = -offset; i <= offset; i++) {
summ += gray[(y + j)*w + x + i]*gbox[(j + offset)*n + i + offset];
}
}
out[y*w + x] = summ/sum;
}
模糊图片:
黑暗(opencl) - https://github.com/apetrunev/imgalg-opencl/blob/master/dark.png
正常 - https://github.com/apetrunev/imgalg-opencl/blob/master/test-blur/out.png
答案 0 :(得分:2)
你说gbox数据初始化如下:
clEnqueueWriteBuffer(queue, gauss_buf, CL_FALSE, 0, 5*5, gauss5, 0, NULL, NULL);
这是错误的,因为你正在复制实际内存量的1/4。正确的方法是:
clEnqueueWriteBuffer(queue, gauss_buf, CL_FALSE, 0, 5*5*sizeof(cl_int), gauss5, 0, NULL, NULL);
否则,其余为0,导致输出值较低。