{"id":6527,"date":"2021-08-10T03:01:01","date_gmt":"2021-08-09T18:01:01","guid":{"rendered":"https:\/\/wp.study3.biz\/?p=6527"},"modified":"2021-08-10T03:02:17","modified_gmt":"2021-08-09T18:02:17","slug":"amd-ryzen-threadripper-pro-3995wx-64-core-windows-10-pro-titan-rtx-x2-cuda-11-3-samples-p2pbandwidthlatencytest-devicequery-%e3%82%92%e5%8b%95%e4%bd%9c%e3%81%95%e3%81%9b%e3%81%a6%e3%81%bf%e3%81%9f","status":"publish","type":"post","link":"https:\/\/wp.study3.biz\/?p=6527","title":{"rendered":"AMD Ryzen Threadripper PRO 3995WX 64-Core Windows 10 Pro TITAN RTX x2 CUDA 11.3 Samples  p2pBandwidthLatencyTest deviceQuery \u3092\u52d5\u4f5c\u3055\u305b\u3066\u307f\u305f"},"content":{"rendered":"<p><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/wp.study3.biz\/wp-content\/uploads\/2021\/04\/3995wx-TITAN-RTX-x2-SLI.jpg\" alt=\"\" width=\"3840\" height=\"2160\" class=\"alignnone size-full wp-image-6534\" \/><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/wp.study3.biz\/wp-content\/uploads\/2021\/04\/3995wx-cuda-11.3-p2pBandwidthLatencyTest.jpg\" alt=\"\" width=\"3840\" height=\"2160\" class=\"alignnone size-full wp-image-6537\" \/><img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/wp.study3.biz\/wp-content\/uploads\/2021\/04\/3995wx-cuda-11.3-deviceQuery.jpg\" alt=\"\" width=\"3840\" height=\"2160\" class=\"alignnone size-full wp-image-6538\" \/><br \/>\nC:\\Windows\\system32>cd C:\\ProgramData\\NVIDIA Corporation\\CUDA Samples\\v11.3\\bin\\win64\\Debug<\/p>\n<p>C:\\ProgramData\\NVIDIA Corporation\\CUDA Samples\\v11.3\\bin\\win64\\Debug>nvidia-smi<\/p>\n<p>C:\\ProgramData\\NVIDIA Corporation\\CUDA Samples\\v11.3\\bin\\win64\\Debug>deviceQuery<br \/>\ndeviceQuery Starting&#8230;<\/p>\n<p> CUDA Device Query (Runtime API) version (CUDART static linking)<\/p>\n<p>Detected 2 CUDA Capable device(s)<\/p>\n<p>Device 0: &#8220;NVIDIA TITAN RTX&#8221;<br \/>\n  CUDA Driver Version \/ Runtime Version          11.3 \/ 11.3<br \/>\n  CUDA Capability Major\/Minor version number:    7.5<br \/>\n  Total amount of global memory:                 24576 MBytes (25769803776 bytes)<br \/>\n  (072) Multiprocessors, (064) CUDA Cores\/MP:    4608 CUDA Cores<br \/>\n  GPU Max Clock rate:                            1770 MHz (1.77 GHz)<br \/>\n  Memory Clock rate:                             7001 Mhz<br \/>\n  Memory Bus Width:                              384-bit<br \/>\n  L2 Cache Size:                                 6291456 bytes<br \/>\n  Maximum Texture Dimension Size (x,y,z)         1D=(131072), 2D=(131072, 65536), 3D=(16384, 16384, 16384)<br \/>\n  Maximum Layered 1D Texture Size, (num) layers  1D=(32768), 2048 layers<br \/>\n  Maximum Layered 2D Texture Size, (num) layers  2D=(32768, 32768), 2048 layers<br \/>\n  Total amount of constant memory:               65536 bytes<br \/>\n  Total amount of shared memory per block:       49152 bytes<br \/>\n  Total shared memory per multiprocessor:        65536 bytes<br \/>\n  Total number of registers available per block: 65536<br \/>\n  Warp size:                                     32<br \/>\n  Maximum number of threads per multiprocessor:  1024<br \/>\n  Maximum number of threads per block:           1024<br \/>\n  Max dimension size of a thread block (x,y,z): (1024, 1024, 64)<br \/>\n  Max dimension size of a grid size    (x,y,z): (2147483647, 65535, 65535)<br \/>\n  Maximum memory pitch:                          2147483647 bytes<br \/>\n  Texture alignment:                             512 bytes<br \/>\n  Concurrent copy and kernel execution:          Yes with 6 copy engine(s)<br \/>\n  Run time limit on kernels:                     Yes<br \/>\n  Integrated GPU sharing Host Memory:            No<br \/>\n  Support host page-locked memory mapping:       Yes<br \/>\n  Alignment requirement for Surfaces:            Yes<br \/>\n  Device has ECC support:                        Disabled<br \/>\n  CUDA Device Driver Mode (TCC or WDDM):         WDDM (Windows Display Driver Model)<br \/>\n  Device supports Unified Addressing (UVA):      Yes<br \/>\n  Device supports Managed Memory:                Yes<br \/>\n  Device supports Compute Preemption:            Yes<br \/>\n  Supports Cooperative Kernel Launch:            Yes<br \/>\n  Supports MultiDevice Co-op Kernel Launch:      No<br \/>\n  Device PCI Domain ID \/ Bus ID \/ location ID:   0 \/ 65 \/ 0<br \/>\n  Compute Mode:<br \/>\n     < Default (multiple host threads can use ::cudaSetDevice() with device simultaneously) ><\/p>\n<p>Device 1: &#8220;NVIDIA TITAN RTX&#8221;<br \/>\n  CUDA Driver Version \/ Runtime Version          11.3 \/ 11.3<br \/>\n  CUDA Capability Major\/Minor version number:    7.5<br \/>\n  Total amount of global memory:                 24576 MBytes (25769803776 bytes)<br \/>\n  (072) Multiprocessors, (064) CUDA Cores\/MP:    4608 CUDA Cores<br \/>\n  GPU Max Clock rate:                            1770 MHz (1.77 GHz)<br \/>\n  Memory Clock rate:                             7001 Mhz<br \/>\n  Memory Bus Width:                              384-bit<br \/>\n  L2 Cache Size:                                 6291456 bytes<br \/>\n  Maximum Texture Dimension Size (x,y,z)         1D=(131072), 2D=(131072, 65536), 3D=(16384, 16384, 16384)<br \/>\n  Maximum Layered 1D Texture Size, (num) layers  1D=(32768), 2048 layers<br \/>\n  Maximum Layered 2D Texture Size, (num) layers  2D=(32768, 32768), 2048 layers<br \/>\n  Total amount of constant memory:               65536 bytes<br \/>\n  Total amount of shared memory per block:       49152 bytes<br \/>\n  Total shared memory per multiprocessor:        65536 bytes<br \/>\n  Total number of registers available per block: 65536<br \/>\n  Warp size:                                     32<br \/>\n  Maximum number of threads per multiprocessor:  1024<br \/>\n  Maximum number of threads per block:           1024<br \/>\n  Max dimension size of a thread block (x,y,z): (1024, 1024, 64)<br \/>\n  Max dimension size of a grid size    (x,y,z): (2147483647, 65535, 65535)<br \/>\n  Maximum memory pitch:                          2147483647 bytes<br \/>\n  Texture alignment:                             512 bytes<br \/>\n  Concurrent copy and kernel execution:          Yes with 6 copy engine(s)<br \/>\n  Run time limit on kernels:                     Yes<br \/>\n  Integrated GPU sharing Host Memory:            No<br \/>\n  Support host page-locked memory mapping:       Yes<br \/>\n  Alignment requirement for Surfaces:            Yes<br \/>\n  Device has ECC support:                        Disabled<br \/>\n  CUDA Device Driver Mode (TCC or WDDM):         WDDM (Windows Display Driver Model)<br \/>\n  Device supports Unified Addressing (UVA):      Yes<br \/>\n  Device supports Managed Memory:                Yes<br \/>\n  Device supports Compute Preemption:            Yes<br \/>\n  Supports Cooperative Kernel Launch:            Yes<br \/>\n  Supports MultiDevice Co-op Kernel Launch:      No<br \/>\n  Device PCI Domain ID \/ Bus ID \/ location ID:   0 \/ 97 \/ 0<br \/>\n  Compute Mode:<br \/>\n     < Default (multiple host threads can use ::cudaSetDevice() with device simultaneously) ><\/p>\n<p>deviceQuery, CUDA Driver = CUDART, CUDA Driver Version = 11.3, CUDA Runtime Version = 11.3, NumDevs = 2<br \/>\nResult = PASS<\/p>\n<p>C:\\ProgramData\\NVIDIA Corporation\\CUDA Samples\\v11.3\\bin\\win64\\Debug><br \/>\n<a href=\"https:\/\/wp.study3.biz\/wp-content\/uploads\/2021\/04\/15slot-Windows10-pro-titan-rtx-x2-3995wx-cuda11.3-p2pBandwidthLatencyTest.txt\">1,5slot Windows10 pro titan rtx x2 3995wx cuda11.3 p2pBandwidthLatencyTest<\/a><br \/>\n<a href=\"https:\/\/wp.study3.biz\/wp-content\/uploads\/2021\/04\/15slot-Windows10-pro-titan-rtx-x2-3995wx-cuda11.3-deviceQuery.txt\">1,5slot Windows10 pro titan rtx x2 3995wx cuda11.3 deviceQuery<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>C:\\Windows\\system32>cd C:\\ProgramData\\NVIDIA Corporation\\CUDA Samples\\v11.3\\bin\\win64\\Debug C:\\ProgramData\\NVI &hellip; <a href=\"https:\/\/wp.study3.biz\/?p=6527\">\u7d9a\u304d\u3092\u8aad\u3080 <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":false,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_feature_clip_id":0,"_jetpack_memberships_contains_paid_content":false,"footnotes":"","jetpack_post_was_ever_published":false},"categories":[18,10],"tags":[],"class_list":["post-6527","post","type-post","status-publish","format-standard","hentry","category-nvidia","category-windows"],"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/wp.study3.biz\/index.php?rest_route=\/wp\/v2\/posts\/6527","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/wp.study3.biz\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/wp.study3.biz\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/wp.study3.biz\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/wp.study3.biz\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=6527"}],"version-history":[{"count":3,"href":"https:\/\/wp.study3.biz\/index.php?rest_route=\/wp\/v2\/posts\/6527\/revisions"}],"predecessor-version":[{"id":6542,"href":"https:\/\/wp.study3.biz\/index.php?rest_route=\/wp\/v2\/posts\/6527\/revisions\/6542"}],"wp:attachment":[{"href":"https:\/\/wp.study3.biz\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=6527"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/wp.study3.biz\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=6527"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/wp.study3.biz\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=6527"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}