我在GTX 1080 Ti和RTX 2080 Ti上都运行相同的程序。我发现,当我尝试使用Cuda-memcheck工具检查程序时,总是基于设备RTX 2080 Ti出现以下错误。
========= CUDA-MEMCHECK
========= Program hit cudaErrorInvalidValue (error 11) due to "invalid argument" on CUDA API call to cudaFuncSetAttribute.
========= Saved host backtrace up to driver entry point at error
========= Host Frame:/usr/lib/x86_64-linux-gnu/libcuda.so.1 [0x359363]
========= Host Frame:/usr/local/cuda/lib64/libcublas.so.10.0 [0x79a03c]
========= Host Frame:/usr/local/cuda/lib64/libcublas.so.10.0 [0x72c2ab]
========= Host Frame:/usr/local/cuda/lib64/libcublas.so.10.0 [0x72c610]
========= Host Frame:/usr/local/cuda/lib64/libcublas.so.10.0 (cublasCreate_v2 + 0x1ce7) [0x14b337]
========= Host Frame:./GPU_LMM (main + 0x43) [0xb633]
========= Host Frame:/lib/x86_64-linux-gnu/libc.so.6 (__libc_start_main + 0xe7) [0x21b97]
========= Host Frame:./GPU_LMM (_start + 0x2a) [0xb77a]
=========
========= Program hit cudaErrorInvalidValue (error 11) due to "invalid argument" on CUDA API call to cudaGetLastError.
========= Saved host backtrace up to driver entry point at error
========= Host Frame:/usr/lib/x86_64-linux-gnu/libcuda.so.1 [0x359363]
========= Host Frame:/usr/local/cuda/lib64/libcublas.so.10.0 [0x79deb3]
========= Host Frame:/usr/local/cuda/lib64/libcublas.so.10.0 [0x72c2b8]
========= Host Frame:/usr/local/cuda/lib64/libcublas.so.10.0 [0x72c610]
========= Host Frame:/usr/local/cuda/lib64/libcublas.so.10.0 (cublasCreate_v2 + 0x1ce7) [0x14b337]
我确保此时我所做的只是创建Cublas上下文,而不执行任何操作。我不确定是什么问题。是由CUDA 10.0和RTX 2080 Ti之间的版本不匹配引起的吗?
有关我的服务器的信息如下。 NVIDIA-SMI 410.93驱动程序版本:410.93 CUDA版本:10.0
答案 0 :(得分:1)
最新的CUDA版本10.0.130应支持RTX2080 Ti 确保您的驱动程序也是最新的。
在Linux上,即驱动程序版本> = 410.48,在Windows> = 411.31
CuBlas在版本10中也获得了Turing支持。
答案 1 :(得分:0)
真正的问题是Cublas库与Cuda 10版本和RTX gpu卡不兼容。