AMD 2990wx Ubuntu18.04.2 RTX 2080Ti x2 NVLink2slot CUDA 10.1 でsimpleP2P p2pBandwidthLatencyTest deviceQuery bandwidthTestを動作させてみた

chibi@1804:~$ cat /etc/os-release
NAME=”Ubuntu”
VERSION=”18.04.2 LTS (Bionic Beaver)”
ID=ubuntu
ID_LIKE=debian
PRETTY_NAME=”Ubuntu 18.04.2 LTS”
VERSION_ID=”18.04″
HOME_URL=”https://www.ubuntu.com/”
SUPPORT_URL=”https://help.ubuntu.com/”
BUG_REPORT_URL=”https://bugs.launchpad.net/ubuntu/”
PRIVACY_POLICY_URL=”https://www.ubuntu.com/legal/terms-and-policies/privacy-policy”
VERSION_CODENAME=bionic
UBUNTU_CODENAME=bionic
chibi@1804:~$ nvcc -V
nvcc: NVIDIA (R) Cuda compiler driver
Copyright (c) 2005-2019 NVIDIA Corporation
Built on Fri_Feb__8_19:08:17_PST_2019
Cuda compilation tools, release 10.1, V10.1.105
chibi@1804:~/NVIDIA_CUDA-10.1_Samples/1_Utilities/p2pBandwidthLatencyTest$ ./p2pBandwidthLatencyTest
[P2P (Peer-to-Peer) GPU Bandwidth Latency Test]
Device: 0, GeForce RTX 2080 Ti, pciBusID: 42, pciDeviceID: 0, pciDomainID:0
Device: 1, GeForce RTX 2080 Ti, pciBusID: 43, pciDeviceID: 0, pciDomainID:0
Device=0 CAN Access Peer Device=1
Device=1 CAN Access Peer Device=0

***NOTE: In case a device doesn’t have P2P access to other one, it falls back to normal memcopy procedure.
So you can see lesser Bandwidth (GB/s) and unstable Latency (us) in those cases.

P2P Connectivity Matrix
D\D 0 1
0 1 1
1 1 1
Unidirectional P2P=Disabled Bandwidth Matrix (GB/s)
D\D 0 1
0 528.52 4.18
1 4.17 527.72
Unidirectional P2P=Enabled Bandwidth (P2P Writes) Matrix (GB/s)
D\D 0 1
0 530.67 47.00
1 46.94 507.31
Bidirectional P2P=Disabled Bandwidth Matrix (GB/s)
D\D 0 1
0 534.45 7.72
1 7.78 532.24
Bidirectional P2P=Enabled Bandwidth Matrix (GB/s)
D\D 0 1
0 524.11 93.71
1 93.79 528.95
P2P=Disabled Latency Matrix (us)
GPU 0 1
0 1.31 16.16
1 21.55 1.66

CPU 0 1
0 3.77 9.87
1 9.87 3.73
P2P=Enabled Latency (P2P Writes) Matrix (us)
GPU 0 1
0 1.32 1.59
1 1.87 1.66

CPU 0 1
0 3.72 2.89
1 2.89 3.70

NOTE: The CUDA Samples are not meant for performance measurements. Results may vary when GPU Boost is enabled.
chibi@1804:~/NVIDIA_CUDA-10.1_Samples/1_Utilities/p2pBandwidthLatencyTest$

2990wx Ubuntu18.04.2 RTX 2080Ti x2 NVLink2slot CUDA 10.1 simpleP2P p2pBandwidthLatencyTest deviceQuery bandwidthTest

カテゴリー: nvidia, ubuntu パーマリンク

コメントを残す

メールアドレスが公開されることはありません。 が付いている欄は必須項目です