我有一个同时装有GT 610和GTX TITAN V的Ubuntu 16.04系统。到目前为止,我在系统上使用nvidia驱动程序390。但是最近我升级到nvidia 396(用于Anaconda tensorflow-gpu),现在我的显示器无法正常工作。我可以从外部SSH到我的系统,并且可以看到两个GPU都列为硬件
user@supermicro:~$ lspci | grep NVIDIA
02:00.0 VGA compatible controller: NVIDIA Corporation GF119 [GeForce GT 610] (rev a1)
02:00.1 Audio device: NVIDIA Corporation GF119 HDMI Audio Controller (rev a1)
03:00.0 VGA compatible controller: NVIDIA Corporation GM200 [GeForce GTX TITAN X] (rev a1)
03:00.1 Audio device: NVIDIA Corporation Device 0fb0 (rev a1)
但是nvidia-smi
仅列出TITAN X:
user@supermicro:~$ nvidia-smi
Mon Dec 17 10:11:03 2018
+-----------------------------------------------------------------------------+
| NVIDIA-SMI 396.54 Driver Version: 396.54 |
|-------------------------------+----------------------+----------------------+
| GPU Name Persistence-M| Bus-Id Disp.A | Volatile Uncorr. ECC |
| Fan Temp Perf Pwr:Usage/Cap| Memory-Usage | GPU-Util Compute M. |
|===============================+======================+======================|
| 0 GeForce GTX TIT... Off | 00000000:03:00.0 Off | N/A |
| 0% 61C P0 63W / 250W | 0MiB / 12212MiB | 0% Default |
+-------------------------------+----------------------+----------------------+
+-----------------------------------------------------------------------------+
| Processes: GPU Memory |
| GPU PID Type Process name Usage |
|=============================================================================|
| No running processes found |
+-----------------------------------------------------------------------------+
我的dmesg输出过滤后是
user@supermicro:~$ dmesg | grep -i nvidia
[ 18.417165] nvidia: loading out-of-tree module taints kernel.
[ 18.417175] nvidia: module license 'NVIDIA' taints kernel.
[ 18.427423] nvidia: module verification failed: signature and/or required key missing - tainting kernel
[ 18.457750] nvidia-nvlink: Nvlink Core is being initialized, major device number 239
[ 18.457949] NVRM: The NVIDIA GeForce GT 610 GPU installed in this system is
NVRM: supported through the NVIDIA 390.xx Legacy drivers. Please
NVRM: visit http://www.nvidia.com/object/unix.html for more
NVRM: information. The 396.54 NVIDIA driver will ignore
[ 18.458076] nvidia: probe of 0000:02:00.0 failed with error -1
[ 18.458099] nvidia 0000:03:00.0: enabling device (0000 -> 0003)
[ 18.458150] nvidia 0000:03:00.0: vgaarb: changed VGA decodes: olddecodes=io+mem,decodes=none:owns=none
[ 18.458265] NVRM: loading NVIDIA UNIX x86_64 Kernel Module 396.54 Tue Aug 14 19:02:34 PDT 2018 (using threaded interrupts)
[ 18.501306] nvidia-modeset: Loading NVIDIA Kernel Mode Setting Driver for UNIX platforms 396.54 Tue Aug 14 23:08:44 PDT 2018
[ 18.502031] [drm] [nvidia-drm] [GPU ID 0x00000300] Loading driver
[ 18.502032] [drm] Initialized nvidia-drm 0.0.0 20160202 for 0000:03:00.0 on minor 0
[ 31.494731] nvidia-uvm: Loaded the UVM driver in 8 mode, major device number 237
[ 33.904821] input: HDA NVidia HDMI/DP,pcm=3 as /devices/pci0000:00/0000:00:02.0/0000:02:00.1/sound/card1/input12
[ 33.904905] input: HDA NVidia HDMI/DP,pcm=7 as /devices/pci0000:00/0000:00:02.0/0000:02:00.1/sound/card1/input13
[ 33.905109] input: HDA NVidia HDMI/DP,pcm=3 as /devices/pci0000:00/0000:00:03.0/0000:03:00.1/sound/card2/input14
[ 33.905162] input: HDA NVidia HDMI/DP,pcm=7 as /devices/pci0000:00/0000:00:03.0/0000:03:00.1/sound/card2/input15
[ 33.905207] input: HDA NVidia HDMI/DP,pcm=8 as /devices/pci0000:00/0000:00:03.0/0000:03:00.1/sound/card2/input16
[ 33.905249] input: HDA NVidia HDMI/DP,pcm=9 as /devices/pci0000:00/0000:00:03.0/0000:03:00.1/sound/card2/input17
我似乎新驱动程序版本不支持GT 610。我的理解是我将GT 610用于显示器,将TITAN用于其他用途。
如何解决此显示问题?是否可以使显示器使用TITAN X?我使用计算机来训练深度学习模型,而TITAN以前专门用于此目的。 TITAN是否可以同时进行显示和训练?