{"id":965,"date":"2020-04-05T00:31:25","date_gmt":"2020-04-04T15:31:25","guid":{"rendered":"https:\/\/wp.study3.biz\/?p=965"},"modified":"2020-04-05T00:32:28","modified_gmt":"2020-04-04T15:32:28","slug":"ubuntu16-04-6-titan-v-x2-cuda-10-2-samples-nbodybenchmark-%e3%82%92%e5%8b%95%e4%bd%9c%e3%81%95%e3%81%9b%e3%81%a6%e3%81%bf%e3%81%9ftitan-v-x2-%e5%8d%98%e7%b2%be%e5%ba%a612572-154-gflop-s-titan-v-x2","status":"publish","type":"post","link":"https:\/\/wp.study3.biz\/?p=965","title":{"rendered":"Ubuntu16.04.6 TITAN V x2 CUDA 10.2 Samples nbodybenchmark \u3092\u52d5\u4f5c\u3055\u305b\u3066\u307f\u305fTITaN V x2 \u5358\u7cbe\u5ea6=12572.154 GFLOP\/s TITAN V x2 \u500d\u7cbe\u5ea6=5477.201 GFLOP\/s"},"content":{"rendered":"<p>chibi@1604:~\/NVIDIA_CUDA-10.2_Samples\/5_Simulations\/nbody$ cat \/etc\/os-release<br \/>\nNAME=&#8221;Ubuntu&#8221;<br \/>\nVERSION=&#8221;16.04.6 LTS (Xenial Xerus)&#8221;<br \/>\nID=ubuntu<br \/>\nID_LIKE=debian<br \/>\nPRETTY_NAME=&#8221;Ubuntu 16.04.6 LTS&#8221;<br \/>\nVERSION_ID=&#8221;16.04&#8243;<br \/>\nHOME_URL=&#8221;http:\/\/www.ubuntu.com\/&#8221;<br \/>\nSUPPORT_URL=&#8221;http:\/\/help.ubuntu.com\/&#8221;<br \/>\nBUG_REPORT_URL=&#8221;http:\/\/bugs.launchpad.net\/ubuntu\/&#8221;<br \/>\nVERSION_CODENAME=xenial<br \/>\nUBUNTU_CODENAME=xenial<br \/>\nchibi@1604:~\/NVIDIA_CUDA-10.2_Samples\/5_Simulations\/nbody$ nvcc -V<br \/>\nnvcc: NVIDIA (R) Cuda compiler driver<br \/>\nCopyright (c) 2005-2019 NVIDIA Corporation<br \/>\nBuilt on Wed_Oct_23_19:24:38_PDT_2019<br \/>\nCuda compilation tools, release 10.2, V10.2.89<br \/>\nchibi@1604:~\/NVIDIA_CUDA-10.2_Samples\/5_Simulations\/nbody$ .\/nbody &#8211;benchmark &#8211;<br \/>\n-numbodies=256000 -numdevices=2<br \/>\nRun &#8220;nbody -benchmark [-numbodies=&lt;numBodies&gt;]&#8221; to measure performance.<br \/>\n-fullscreen (run n-body simulation in fullscreen mode)<br \/>\n-fp64 (use double precision floating point values for simulation)<br \/>\n-hostmem (stores simulation data in host memory)<br \/>\n-benchmark (run benchmark to measure performance)<br \/>\n-numbodies=&lt;N&gt; (number of bodies (&gt;= 1) to run in simulation)<br \/>\n-device=&lt;d&gt; (where d=0,1,2&#8230;. for the CUDA device to use)<br \/>\n-numdevices=&lt;i&gt; (where i=(number of CUDA devices &gt; 0) to use for simulation)<br \/>\n-compare (compares simulation results running once on the default GPU and once on the CPU)<br \/>\n-cpu (run n-body simulation on the CPU)<br \/>\n-tipsy=&lt;file.bin&gt; (load a tipsy model file for simulation)<\/p>\n<p>NOTE: The CUDA Samples are not meant for performance measurements. Results may vary when GPU Boost is enabled.<\/p>\n<p>number of CUDA devices = 2<br \/>\n&gt; Windowed mode<br \/>\n&gt; Simulation data stored in system memory<br \/>\n&gt; Single precision floating point simulation<br \/>\n&gt; 2 Devices used for simulation<br \/>\nGPU Device 0: &#8220;Volta&#8221; with compute capability 7.0<\/p>\n<p>&gt; Compute 7.0 CUDA device: [TITAN V]<br \/>\n&gt; Compute 7.0 CUDA device: [TITAN V]<br \/>\nnumber of bodies = 256000<br \/>\n256000 bodies, total time for 10 iterations: 1042.558 ms<br \/>\n= 628.608 billion interactions per second<br \/>\n<strong>= 12572.154 single-precision GFLOP\/s at 20 flops per interaction<\/strong><br \/>\nchibi@1604:~\/NVIDIA_CUDA-10.2_Samples\/5_Simulations\/nbody$ .\/nbody -fp64 &#8211;bench<br \/>\nmark &#8211;numbodies=256000 -numdevices=2<br \/>\nRun &#8220;nbody -benchmark [-numbodies=&lt;numBodies&gt;]&#8221; to measure performance.<br \/>\n-fullscreen (run n-body simulation in fullscreen mode)<br \/>\n-fp64 (use double precision floating point values for simulation)<br \/>\n-hostmem (stores simulation data in host memory)<br \/>\n-benchmark (run benchmark to measure performance)<br \/>\n-numbodies=&lt;N&gt; (number of bodies (&gt;= 1) to run in simulation)<br \/>\n-device=&lt;d&gt; (where d=0,1,2&#8230;. for the CUDA device to use)<br \/>\n-numdevices=&lt;i&gt; (where i=(number of CUDA devices &gt; 0) to use for simulation)<br \/>\n-compare (compares simulation results running once on the default GPU and once on the CPU)<br \/>\n-cpu (run n-body simulation on the CPU)<br \/>\n-tipsy=&lt;file.bin&gt; (load a tipsy model file for simulation)<\/p>\n<p>NOTE: The CUDA Samples are not meant for performance measurements. Results may vary when GPU Boost is enabled.<\/p>\n<p>number of CUDA devices = 2<br \/>\n&gt; Windowed mode<br \/>\n&gt; Simulation data stored in system memory<br \/>\n&gt; Double precision floating point simulation<br \/>\n&gt; 2 Devices used for simulation<br \/>\nGPU Device 0: &#8220;Volta&#8221; with compute capability 7.0<\/p>\n<blockquote><p>Compute 7.0 CUDA device: [TITAN V]<br \/>\n&gt; Compute 7.0 CUDA device: [TITAN V]<br \/>\nnumber of bodies = 256000<br \/>\n256000 bodies, total time for 10 iterations: 3589.571 ms<br \/>\n= 182.573 billion interactions per second<br \/>\n<strong>= 5477.201 double-precision GFLOP\/s at 30 flops per interaction<\/strong><br \/>\nchibi@1604:~\/NVIDIA_CUDA-10.2_Samples\/5_Simulations\/nbody$<\/p><\/blockquote>\n<p><img loading=\"lazy\" decoding=\"async\" class=\"alignnone size-full wp-image-971\" src=\"https:\/\/wp.study3.biz\/wp-content\/uploads\/2020\/03\/nbody-\u500d\u7cbe\u5ea6.jpg\" alt=\"\" width=\"1920\" height=\"1080\" \/><\/p>\n<p>GPU\u6e29\u5ea6\u63a8\u79fb <a href=\"https:\/\/wp.study3.biz\/wp-content\/uploads\/2020\/03\/Ubuntu16.04.6-TITAN-V-x2-CUDA-10.2-Samples-nbodybenchmark-TITaN-V-x2-\u5358\u7cbe\u5ea612572.154-GFLOP-s-TITAN-V-x2-\u500d\u7cbe\u5ea65477.201-GFLOP-s-nvidia-smi.txt\">Ubuntu16.04.6 TITAN V x2 CUDA 10.2 Samples nbodybenchmark TITaN V x2 \u5358\u7cbe\u5ea6=12572.154 GFLOP s TITAN V x2 \u500d\u7cbe\u5ea6=5477.201 GFLOP s nvidia-smi<\/a><\/p>\n","protected":false},"excerpt":{"rendered":"<p>chibi@1604:~\/NVIDIA_CUDA-10.2_Samples\/5_Simulations\/nbody$ cat \/etc\/os-release NAME=&#8221;Ubuntu&#8221; VERSI &hellip; <a href=\"https:\/\/wp.study3.biz\/?p=965\">\u7d9a\u304d\u3092\u8aad\u3080 <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_jetpack_newsletter_access":"","_jetpack_dont_email_post_to_subs":false,"_jetpack_newsletter_tier_id":0,"_jetpack_memberships_contains_paywalled_content":false,"_jetpack_feature_clip_id":0,"_jetpack_memberships_contains_paid_content":false,"footnotes":"","jetpack_post_was_ever_published":false},"categories":[18,17],"tags":[],"class_list":["post-965","post","type-post","status-publish","format-standard","hentry","category-nvidia","category-ubuntu"],"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/wp.study3.biz\/index.php?rest_route=\/wp\/v2\/posts\/965","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/wp.study3.biz\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/wp.study3.biz\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/wp.study3.biz\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/wp.study3.biz\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=965"}],"version-history":[{"count":3,"href":"https:\/\/wp.study3.biz\/index.php?rest_route=\/wp\/v2\/posts\/965\/revisions"}],"predecessor-version":[{"id":1006,"href":"https:\/\/wp.study3.biz\/index.php?rest_route=\/wp\/v2\/posts\/965\/revisions\/1006"}],"wp:attachment":[{"href":"https:\/\/wp.study3.biz\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=965"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/wp.study3.biz\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=965"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/wp.study3.biz\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=965"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}