Question

在源代码的JCublas2.cublasSdot评论中，它评论说'result'参数可以是'主机或设备指针'。

 public static int cublasSdot(
    cublasHandle handle, 
    int n, 
    Pointer x, 
    int incx, 
    Pointer y, 
    int incy, 
    Pointer result)/** host or device pointer */
{
    return checkResult(cublasSdotNative(handle, n, x, incx, y, incy, result));
}

但是，我只能使用像Spinter.to（fs）这样的主机指针，浮点数[] fs = {0}。如果我使用像'CUdeviceptr devicePtr = new CUdeviceptr（）; JCudaDriver.cuMemAlloc（devicePtr，100 * Sizeof.FLOAT）;'，程序崩溃，如：

#
# A fatal error has been detected by the Java Runtime Environment:
#
#  EXCEPTION_ACCESS_VIOLATION (0xc0000005) at pc=0x000007fed93af2a3, pid=9376, tid=0x0000000000003a7c
# .....

最小化主机和设备之间的数据传输可节省时间。如何使用设备指针作为此方法的“结果”参数，以及其他带有结果指针的JCuda方法/ **主机或设备指针** /？

Answer 1

CUBLAS可以将某些计算的结果（如点积）写入主机或设备内存。必须使用cublasSetPointerMode显式设置目标内存类型。

JCublas2PointerModes示例中显示了如何使用此示例。

它曾将点积计算的结果写入主机内存（当没有明确设置指针模式时，这也是默认值）：

// Set the pointer mode to HOST
cublasSetPointerMode(handle, CUBLAS_POINTER_MODE_HOST);

// Prepare the pointer for the result in HOST memory
float hostResult[] = { -1.0f };
Pointer hostResultPointer = Pointer.to(hostResult);

// Execute the 'dot' function
cublasSdot(handle, n, deviceData, 1, deviceData, 1, hostResultPointer);

然后更改指针模式并再次调用该函数，这次将结果写入 device 内存：

cublasSetPointerMode(handle, CUBLAS_POINTER_MODE_DEVICE);

// Prepare the pointer for the result in DEVICE memory
Pointer deviceResultPointer = new Pointer();
cudaMalloc(deviceResultPointer, Sizeof.FLOAT);

// Execute the 'dot' function
cublasSdot(handle, n, deviceData, 1, deviceData, 1, deviceResultPointer);

JCuda的JCublas2.cublasSdot：未能将设备指针用于结果指针参数

1 个答案: