参数列表的长度(3)和CL生成的参数的数量(9)不一致

时间:2017-02-26 07:26:54

标签: python c kernel opencl pyopencl

xx_np = np.random.rand(16500).astype(np.float32)
u_np = np.random.rand(16500).astype(np.float32)
vector_np = np.random.rand(16500).astype(np.float32)
temp_np = np.random.rand(16500).astype(np.float32)
ss_np = np.random.rand(16500).astype(np.float32)
ctx= cl.Context([device])  #context
queue = cl.CommandQueue(ctx) #commandqueue
mf = cl.mem_flags #memoryflags
xx_g = cl.Buffer(ctx, mf.READ_ONLY | mf.COPY_HOST_PTR, hostbuf=xx_np)
u_g = cl.Buffer(ctx, mf.READ_ONLY | mf.COPY_HOST_PTR, hostbuf=u_np)
vector_g = cl.Buffer(ctx, mf.READ_ONLY | mf.COPY_HOST_PTR, hostbuf=vector_np)
temp_g = cl.Buffer(ctx, mf.READ_ONLY | mf.COPY_HOST_PTR, hostbuf=temp_np)
ss_g = cl.Buffer(ctx, mf.READ_ONLY | mf.COPY_HOST_PTR, hostbuf=ss_np)

q00=q0
prg = cl.Program(ctx, """
__kernel void dota( __global float *xx, __global float *u, __global float *vector, __global float *temp,__global float *ss, __global float *res, float q00, float s,float i)
{
    int index = get_global_id(0);
    int qindex= get_global_id(0);
    res[index] = xx[index]*u[index-1];
    res[index] = res[index]*u[index-1];
    xx[index]= xx[index]-res[index];
    float a=   pow(pow(xx[index],2)+pow(xx[index+1],2),0.5);
    float b=   pow(pow(vector[index],2)+pow(vector[index+1],2),0.5);
    if(b==0)
    {   i=vector[qindex]+q00;
        b=pow(pow(i,2)+pow(i,2),0.5);
    }       
    s=100*(a/b);
    ss[index]=s;
    temp[index]=ss[index];
}
""").build()
res_g = cl.Buffer(ctx, mf.WRITE_ONLY, temp_np.nbytes)
prg.dota(queue, temp_np.shape, None, temp_g, xx_g,res_g)
res_np = np.empty_like(temp_np)
cl.enqueue_copy(queue, res_np, res_g)
print res_np

我无法正确设置参数。因为我是内核代码的新手。不能让它自己工作。"参数列表长度(3)和CL生成的参数数量(9)不同意"。这是我在执行代码时得到的错误消息。

1 个答案:

答案 0 :(得分:1)

即使我从未使用过Python的OpenCL,我也可以告诉你在调用dota时没有提供足够的参数......

prg.dota(queue, temp_np.shape, None, temp_g, xx_g,res_g)

  1. temp_g
  2. xx_g
  3. res_g
  4. 虽然dota有这么多参数。