{"id":2880,"date":"2020-08-09T03:39:18","date_gmt":"2020-08-08T18:39:18","guid":{"rendered":"https:\/\/wp.study3.biz\/?p=2880"},"modified":"2020-08-09T03:46:19","modified_gmt":"2020-08-08T18:46:19","slug":"amd-ryzen-threadripper-3990x-64-core-processor-centos-linux-release-8-2-titan-rtx-x2-rtx2080ti-x2-cuda-11-0-namd-2-12-171025-stmv-virus-benchmark-1066628-atoms-periodic-pme%e3%82%92%e4%bb%96","status":"publish","type":"post","link":"https:\/\/wp.study3.biz\/?p=2880","title":{"rendered":"AMD Ryzen Threadripper 3990X 64-Core Processor CentOS Linux release 8.2 TITAN RTX x2 RTX2080Ti x2 CUDA 11.0 namd 2.12-171025  STMV (virus) benchmark (1,066,628 atoms, periodic, PME)\u3092\u4ed6\u306eCPU\u3068\u6bd4\u8f03\u3057\u3066\u52d5\u4f5c\u3055\u305b\u3066\u307f\u305f 0.265825 days\/ns"},"content":{"rendered":"<p>[chibi@centos8 ~]$ sudo nvidia-docker run -it &#8211;rm nvcr.io\/hpc\/namd:2.12-171025 \/opt\/namd\/namd-multicore-memopt +p40 +setcpuaffinity +idlepoll \/workspace\/examples\/stmv\/stmv_pmecuda.namd<br \/>\n[sudo] chibi \u306e\u30d1\u30b9\u30ef\u30fc\u30c9:<br \/>\nCharm++: standalone mode (not using charmrun)<br \/>\nCharm++> Running in Multicore mode:  40 threads<br \/>\nCharm++> Using recursive bisection (scheme 3) for topology aware partitions<br \/>\nConverse\/Charm++ Commit ID: v6.8.2<br \/>\nWarning> Randomization of virtual memory (ASLR) is turned on in the kernel, thread migration may not work! Run &#8216;echo 0 > \/proc\/sys\/kernel\/randomize_va_space&#8217; as root to disable it, or try running with &#8216;+isomalloc_sync&#8217;.<br \/>\nCharmLB> Load balancer assumes all CPUs are same.<br \/>\nCharm++> cpu affinity enabled.<br \/>\nCharm++> Running on 1 unique compute nodes (128-way SMP).<br \/>\nCharm++> cpu topology info is gathered in 0.004 seconds.<br \/>\nInfo: Built with CUDA version 9000<br \/>\nDid not find +devices i,j,k,&#8230; argument, using all<br \/>\nPe 19 physical rank 19 will use CUDA device of pe 16<br \/>\nPe 3 physical rank 3 will use CUDA device of pe 8<br \/>\nPe 28 physical rank 28 will use CUDA device of pe 24<br \/>\nPe 18 physical rank 18 will use CUDA device of pe 16<br \/>\nPe 22 physical rank 22 will use CUDA device of pe 24<br \/>\nPe 23 physical rank 23 will use CUDA device of pe 24<br \/>\nPe 4 physical rank 4 will use CUDA device of pe 8<br \/>\nPe 5 physical rank 5 will use CUDA device of pe 8<br \/>\nPe 12 physical rank 12 will use CUDA device of pe 16<br \/>\nPe 21 physical rank 21 will use CUDA device of pe 24<br \/>\nPe 2 physical rank 2 will use CUDA device of pe 8<br \/>\nPe 36 physical rank 36 will use CUDA device of pe 32<br \/>\nPe 29 physical rank 29 will use CUDA device of pe 24<br \/>\nPe 1 physical rank 1 will use CUDA device of pe 8<br \/>\nPe 17 physical rank 17 will use CUDA device of pe 16<br \/>\nPe 35 physical rank 35 will use CUDA device of pe 32<br \/>\nPe 0 physical rank 0 will use CUDA device of pe 8<br \/>\nPe 10 physical rank 10 will use CUDA device of pe 16<br \/>\nPe 33 physical rank 33 will use CUDA device of pe 32<br \/>\nPe 9 physical rank 9 will use CUDA device of pe 8<br \/>\nPe 34 physical rank 34 will use CUDA device of pe 32<br \/>\nPe 20 physical rank 20 will use CUDA device of pe 24<br \/>\nPe 11 physical rank 11 will use CUDA device of pe 16<br \/>\nPe 31 physical rank 31 will use CUDA device of pe 32<br \/>\nPe 6 physical rank 6 will use CUDA device of pe 8<br \/>\nPe 30 physical rank 30 will use CUDA device of pe 32<br \/>\nPe 15 physical rank 15 will use CUDA device of pe 16<br \/>\nPe 39 physical rank 39 will use CUDA device of pe 32<br \/>\nPe 38 physical rank 38 will use CUDA device of pe 32<br \/>\nPe 7 physical rank 7 will use CUDA device of pe 8<br \/>\nPe 14 physical rank 14 will use CUDA device of pe 16<br \/>\nPe 25 physical rank 25 will use CUDA device of pe 24<br \/>\nPe 27 physical rank 27 will use CUDA device of pe 24<br \/>\nPe 26 physical rank 26 will use CUDA device of pe 24<br \/>\nPe 37 physical rank 37 will use CUDA device of pe 32<br \/>\nPe 13 physical rank 13 will use CUDA device of pe 16<br \/>\nPe 32 physical rank 32 binding to CUDA device 3 on c22b80e21951: &#8216;GeForce RTX 2080 Ti&#8217;  Mem: 11019MB  Rev: 7.5<br \/>\nPe 24 physical rank 24 binding to CUDA device 2 on c22b80e21951: &#8216;GeForce RTX 2080 Ti&#8217;  Mem: 11019MB  Rev: 7.5<br \/>\nPe 8 physical rank 8 binding to CUDA device 0 on c22b80e21951: &#8216;TITAN RTX&#8217;  Mem: 24219MB  Rev: 7.5<br \/>\nPe 16 physical rank 16 binding to CUDA device 1 on c22b80e21951: &#8216;TITAN RTX&#8217;  Mem: 24220MB  Rev: 7.5<br \/>\nInfo: NAMD 2.12 for Linux-x86_64-multicore-CUDA-memopt<br \/>\nWarning:<br \/>\nWarning:        ***  EXPERIMENTAL MEMORY OPTIMIZED VERSION  ***<br \/>\nWarning:<br \/>\nInfo:<br \/>\nInfo: Please visit http:\/\/www.ks.uiuc.edu\/Research\/namd\/<br \/>\nInfo: for updates, documentation, and support information.<br \/>\nInfo:<br \/>\nInfo: Please cite Phillips et al., J. Comp. Chem. 26:1781-1802 (2005)<br \/>\nInfo: in all publications reporting results obtained with NAMD.<br \/>\nInfo:<br \/>\nInfo: Based on Charm++\/Converse 60800 for multicore-linux64-gcc<br \/>\nInfo: Built Tue Nov 21 02:03:10 UTC 2017 by  on a02d2dbfe66b<br \/>\nInfo: 1 NAMD  2.12  Linux-x86_64-multicore-CUDA-memopt  40    c22b80e21951  root<br \/>\nInfo: Running on 40 processors, 1 nodes, 1 physical nodes.<br \/>\nInfo: CPU topology information available.<br \/>\nInfo: Charm++\/Converse parallel runtime startup completed at 0.340343 s<br \/>\nCkLoopLib is used in SMP with a simple dynamic scheduling (converse-level notification) but not using node-level queue<br \/>\nInfo: 39.5391 MB of memory in use based on \/proc\/self\/stat<br \/>\nInfo: Configuration file is \/workspace\/examples\/stmv\/stmv_pmecuda.namd<br \/>\nInfo: Changed directory to \/workspace\/examples\/stmv<br \/>\nTCL: Suspending until startup complete.<br \/>\nInfo: SIMULATION PARAMETERS:<br \/>\nInfo: TIMESTEP               1<br \/>\nInfo: NUMBER OF STEPS        800<br \/>\nInfo: STEPS PER CYCLE        20<br \/>\nInfo: PERIODIC CELL BASIS 1  216.832 0 0<br \/>\nInfo: PERIODIC CELL BASIS 2  0 216.832 0<br \/>\nInfo: PERIODIC CELL BASIS 3  0 0 216.832<br \/>\nInfo: PERIODIC CELL CENTER   0 0 0<br \/>\nInfo: LOAD BALANCER  Hybrid<br \/>\nInfo: LOAD BALANCING STRATEGY  New Load Balancers &#8212; DEFAULT<br \/>\nInfo: LDB PERIOD             4000 steps<br \/>\nInfo: FIRST LDB TIMESTEP     100<br \/>\nInfo: HYBRIDLB GROUP SIZE     512<br \/>\nInfo: LAST LDB TIMESTEP     -1<br \/>\nInfo: LDB BACKGROUND SCALING 1<br \/>\nInfo: HOM BACKGROUND SCALING 1<br \/>\nInfo: PME BACKGROUND SCALING 1<br \/>\nInfo: MAX SELF PARTITIONS    1<br \/>\nInfo: MAX PAIR PARTITIONS    1<br \/>\nInfo: SELF PARTITION ATOMS   154<br \/>\nInfo: SELF2 PARTITION ATOMS   154<br \/>\nInfo: PAIR PARTITION ATOMS   318<br \/>\nInfo: PAIR2 PARTITION ATOMS  637<br \/>\nInfo: MIN ATOMS PER PATCH    40<br \/>\nInfo: INITIAL TEMPERATURE    298<br \/>\nInfo: CENTER OF MASS MOVING INITIALLY? NO<br \/>\nInfo: DIELECTRIC             1<br \/>\nInfo: EXCLUDE                SCALED ONE-FOUR<br \/>\nInfo: 1-4 ELECTROSTATICS SCALED BY 1<br \/>\nInfo: MODIFIED 1-4 VDW PARAMETERS WILL BE USED<br \/>\nInfo: NO DCD TRAJECTORY OUTPUT<br \/>\nInfo: NO EXTENDED SYSTEM TRAJECTORY OUTPUT<br \/>\nInfo: NO VELOCITY DCD OUTPUT<br \/>\nInfo: NO FORCE DCD OUTPUT<br \/>\nInfo: OUTPUT FILENAME        \/workspace\/examples\/stmv\/stmv-output<br \/>\nInfo: BINARY OUTPUT FILES WILL BE USED<br \/>\nInfo: NO RESTART FILE<br \/>\nInfo: SWITCHING ACTIVE<br \/>\nInfo: SWITCHING ON           10<br \/>\nInfo: SWITCHING OFF          12<br \/>\nInfo: PAIRLIST DISTANCE      13.5<br \/>\nInfo: PAIRLIST SHRINK RATE   0.01<br \/>\nInfo: PAIRLIST GROW RATE     0.01<br \/>\nInfo: PAIRLIST TRIGGER       0.3<br \/>\nInfo: PAIRLISTS PER CYCLE    2<br \/>\nInfo: PAIRLISTS ENABLED<br \/>\nInfo: MARGIN                 0.48<br \/>\nInfo: HYDROGEN GROUP CUTOFF  2.5<br \/>\nInfo: PATCH DIMENSION        16.48<br \/>\nInfo: ENERGY OUTPUT STEPS    200<br \/>\nInfo: CROSSTERM ENERGY INCLUDED IN DIHEDRAL<br \/>\nInfo: TIMING OUTPUT STEPS    1<br \/>\nInfo: LANGEVIN DYNAMICS ACTIVE<br \/>\nInfo: LANGEVIN TEMPERATURE   298<br \/>\nInfo: LANGEVIN USING BBK INTEGRATOR<br \/>\nInfo: LANGEVIN DAMPING COEFFICIENT IS 5 INVERSE PS<br \/>\nInfo: LANGEVIN DYNAMICS NOT APPLIED TO HYDROGENS<br \/>\nInfo: LANGEVIN PISTON PRESSURE CONTROL ACTIVE<br \/>\nInfo:        TARGET PRESSURE IS 1.01325 BAR<br \/>\nInfo:     OSCILLATION PERIOD IS 200 FS<br \/>\nInfo:             DECAY TIME IS 100 FS<br \/>\nInfo:     PISTON TEMPERATURE IS 298 K<br \/>\nInfo:       PRESSURE CONTROL IS GROUP-BASED<br \/>\nInfo:    INITIAL STRAIN RATE IS 0 0 0<br \/>\nInfo:       CELL FLUCTUATION IS ISOTROPIC<br \/>\nInfo: PARTICLE MESH EWALD (PME) ACTIVE<br \/>\nInfo: PME TOLERANCE               1e-06<br \/>\nInfo: PME EWALD COEFFICIENT       0.257952<br \/>\nInfo: PME INTERPOLATION ORDER     8<br \/>\nInfo: PME GRID DIMENSIONS         108 108 108<br \/>\nInfo: PME MAXIMUM GRID SPACING    2.1<br \/>\nInfo: FULL ELECTROSTATIC EVALUATION FREQUENCY      4<br \/>\nInfo: USING VERLET I (r-RESPA) MTS SCHEME.<br \/>\nInfo: C1 SPLITTING OF LONG RANGE ELECTROSTATICS<br \/>\nInfo: PLACING ATOMS IN PATCHES BY HYDROGEN GROUPS<br \/>\nInfo: RIGID BONDS TO HYDROGEN : ALL<br \/>\nInfo:         ERROR TOLERANCE : 1e-08<br \/>\nInfo:          MAX ITERATIONS : 100<br \/>\nInfo: RIGID WATER USING SETTLE ALGORITHM<br \/>\nInfo: NONBONDED FORCES EVALUATED EVERY 2 STEPS<br \/>\nInfo: RANDOM NUMBER SEED     3141<br \/>\nInfo: USE HYDROGEN BONDS?    NO<br \/>\nInfo: STRUCTURE FILE         stmv.psf.inter<br \/>\nInfo: PARAMETER file: CHARMM format!<br \/>\nInfo: PARAMETERS             par_all27_prot_na.inp<br \/>\nInfo: USING ARITHMETIC MEAN TO COMBINE L-J SIGMA PARAMETERS<br \/>\nInfo: BINARY COORDINATES     stmv.coor<br \/>\nInfo: SUMMARY OF PARAMETERS:<br \/>\nInfo: 250 BONDS<br \/>\nInfo: 622 ANGLES<br \/>\nInfo: 1049 DIHEDRAL<br \/>\nInfo: 73 IMPROPER<br \/>\nInfo: 0 CROSSTERM<br \/>\nInfo: 130 VDW<br \/>\nInfo: 0 VDW_PAIRS<br \/>\nInfo: 0 NBTHOLE_PAIRS<br \/>\nInfo: TIME FOR READING PSF FILE: 0.0028851<br \/>\nInfo:<br \/>\nInfo: Entering startup at 0.359806 s, 85.5703 MB of memory in use<br \/>\nInfo: Startup phase 0 took 8.01086e-05 s, 85.5703 MB of memory in use<br \/>\nWarning: an empty exclusion signature with index 709!<br \/>\nInfo: Startup phase 1 took 0.000342846 s, 85.5703 MB of memory in use<br \/>\nInfo: NONBONDED TABLE R-SQUARED SPACING: 0.0625<br \/>\nInfo: NONBONDED TABLE SIZE: 769 POINTS<br \/>\nInfo: INCONSISTENCY IN FAST TABLE ENERGY VS FORCE: 0.000325096 AT 11.9556<br \/>\nInfo: INCONSISTENCY IN SCOR TABLE ENERGY VS FORCE: 0.000324844 AT 11.9556<br \/>\nInfo: INCONSISTENCY IN VDWA TABLE ENERGY VS FORCE: 0.0040507 AT 0.251946<br \/>\nInfo: INCONSISTENCY IN VDWB TABLE ENERGY VS FORCE: 0.00150189 AT 0.251946<br \/>\nInfo: Running with 2 input processors.<br \/>\nInfo: Running with 1 output processors (1 of them will output simultaneously).<br \/>\nInfo: INPUT PROC LOCATIONS: 8 16<br \/>\nInfo: OUTPUT PROC LOCATIONS: 32<br \/>\nInfo: Startup phase 2 took 0.00796604 s, 241.273 MB of memory in use<br \/>\nInfo: Startup phase 3 took 0.049921 s, 244.605 MB of memory in use<br \/>\nInfo: PATCH GRID IS 13 (PERIODIC) BY 13 (PERIODIC) BY 13 (PERIODIC)<br \/>\nInfo: PATCH GRID IS 1-AWAY BY 1-AWAY BY 1-AWAY<br \/>\nInfo: LOADED 1810196 TOTAL EXCLUSIONS<br \/>\nInfo: REMOVING COM VELOCITY -0.00436736 -0.0116608 0.0017952<br \/>\nInfo: Startup phase 4 took 0.111289 s, 559.586 MB of memory in use<br \/>\nInfo: ****************************<br \/>\nInfo: STRUCTURE SUMMARY:<br \/>\nInfo: 1066628 ATOMS<br \/>\nInfo: 769956 BONDS<br \/>\nInfo: 605872 ANGLES<br \/>\nInfo: 450875 DIHEDRALS<br \/>\nInfo: 24578 IMPROPERS<br \/>\nInfo: 0 CROSSTERMS<br \/>\nInfo: 0 EXCLUSIONS<br \/>\nInfo: 977416 RIGID BONDS<br \/>\nInfo: 2222468 DEGREES OF FREEDOM<br \/>\nInfo: 389067 HYDROGEN GROUPS<br \/>\nInfo: 4 ATOMS IN LARGEST HYDROGEN GROUP<br \/>\nInfo: 389067 MIGRATION GROUPS<br \/>\nInfo: 4 ATOMS IN LARGEST MIGRATION GROUP<br \/>\nInfo: TOTAL MASS = 6.69877e+06 amu<br \/>\nInfo: TOTAL CHARGE = 0.000168104 e<br \/>\nInfo: MASS DENSITY = 1.09115 g\/cm^3<br \/>\nInfo: ATOM DENSITY = 0.104627 atoms\/A^3<br \/>\nInfo: *****************************<br \/>\nInfo: LARGEST PATCH (1044) HAS 541 ATOMS<br \/>\nInfo: Startup phase 5 took 0.0533659 s, 565.773 MB of memory in use<br \/>\nInfo: TORUS A SIZE 1 USING 0<br \/>\nInfo: TORUS B SIZE 1 USING 0<br \/>\nInfo: TORUS C SIZE 1 USING 0<br \/>\nInfo: TORUS MINIMAL MESH SIZE IS 1 BY 1 BY 1<br \/>\nInfo: Placed 100% of base nodes on same physical node as patch<br \/>\nInfo: Startup phase 6 took 0.00464511 s, 573.051 MB of memory in use<br \/>\nInfo: PME using 1 x 1 x 1 pencil grid for FFT and reciprocal sum.<br \/>\nInfo: Updated CUDA force table with 4096 elements.<br \/>\nInfo: Updated CUDA LJ table with 130 x 130 elements.<br \/>\nInfo: Updated CUDA force table with 4096 elements.<br \/>\nInfo: Updated CUDA LJ table with 130 x 130 elements.<br \/>\nInfo: Updated CUDA force table with 4096 elements.<br \/>\nInfo: Updated CUDA LJ table with 130 x 130 elements.<br \/>\nInfo: Updated CUDA force table with 4096 elements.<br \/>\nInfo: Updated CUDA LJ table with 130 x 130 elements.<br \/>\nInfo: Startup phase 7 took 3.33094 s, 1593.53 MB of memory in use<br \/>\nInfo: Startup phase 8 took 0.00655293 s, 1595.91 MB of memory in use<br \/>\nLDB: Hybrid LB being created&#8230;<br \/>\nHybridBaseLB: ThreeLevelTree is created.<br \/>\nInfo: Startup phase 9 took 0.0024569 s, 1595.91 MB of memory in use<br \/>\nInfo: CREATING 46457 COMPUTE OBJECTS<br \/>\nInfo: Found 333 unique exclusion lists needing 1076 bytes<br \/>\nInfo: Found 333 unique exclusion lists needing 1076 bytes<br \/>\nInfo: Found 333 unique exclusion lists needing 1076 bytes<br \/>\nInfo: Found 333 unique exclusion lists needing 1076 bytes<br \/>\nInfo: useSync: 0 useProxySync: 0<br \/>\nInfo: Startup phase 10 took 0.068258 s, 1634.8 MB of memory in use<br \/>\nInfo: Startup phase 11 took 0.000103951 s, 1634.8 MB of memory in use<br \/>\nInfo: Startup phase 12 took 0.00266719 s, 1641.88 MB of memory in use<br \/>\nInfo: Finished startup at 3.99839 s, 1641.88 MB of memory in use<br \/>\n\u4e2d\u7565<br \/>\nInfo: Benchmark time: 40 CPUs 0.0229673 s\/step <strong>0.265825 days\/ns<\/strong> 2834.73 MB memory<br \/>\n\u2190days\/ns, Less Is Better<br \/>\n<img loading=\"lazy\" decoding=\"async\" src=\"https:\/\/wp.study3.biz\/wp-content\/uploads\/2020\/06\/namd-2.13-cpu.jpg\" alt=\"\" width=\"904\" height=\"408\" class=\"alignnone size-full wp-image-2889\" \/><\/p>\n","protected":false},"excerpt":{"rendered":"<p>[chibi@centos8 ~]$ sudo nvidia-docker run -it &#8211;rm nvcr.io\/hpc\/namd:2.12-171025 \/opt\/namd\/namd-multicore- &hellip; <a href=\"https:\/\/wp.study3.biz\/?p=2880\">\u7d9a\u304d\u3092\u8aad\u3080 <span class=\"meta-nav\">&rarr;<\/span><\/a><\/p>\n","protected":false},"author":1,"featured_media":0,"comment_status":"open","ping_status":"open","sticky":false,"template":"","format":"standard","meta":{"_jetpack_memberships_contains_paid_content":false,"footnotes":""},"categories":[5,18],"tags":[],"class_list":["post-2880","post","type-post","status-publish","format-standard","hentry","category-centos8","category-nvidia"],"jetpack_featured_media_url":"","jetpack_sharing_enabled":true,"_links":{"self":[{"href":"https:\/\/wp.study3.biz\/index.php?rest_route=\/wp\/v2\/posts\/2880","targetHints":{"allow":["GET"]}}],"collection":[{"href":"https:\/\/wp.study3.biz\/index.php?rest_route=\/wp\/v2\/posts"}],"about":[{"href":"https:\/\/wp.study3.biz\/index.php?rest_route=\/wp\/v2\/types\/post"}],"author":[{"embeddable":true,"href":"https:\/\/wp.study3.biz\/index.php?rest_route=\/wp\/v2\/users\/1"}],"replies":[{"embeddable":true,"href":"https:\/\/wp.study3.biz\/index.php?rest_route=%2Fwp%2Fv2%2Fcomments&post=2880"}],"version-history":[{"count":3,"href":"https:\/\/wp.study3.biz\/index.php?rest_route=\/wp\/v2\/posts\/2880\/revisions"}],"predecessor-version":[{"id":3491,"href":"https:\/\/wp.study3.biz\/index.php?rest_route=\/wp\/v2\/posts\/2880\/revisions\/3491"}],"wp:attachment":[{"href":"https:\/\/wp.study3.biz\/index.php?rest_route=%2Fwp%2Fv2%2Fmedia&parent=2880"}],"wp:term":[{"taxonomy":"category","embeddable":true,"href":"https:\/\/wp.study3.biz\/index.php?rest_route=%2Fwp%2Fv2%2Fcategories&post=2880"},{"taxonomy":"post_tag","embeddable":true,"href":"https:\/\/wp.study3.biz\/index.php?rest_route=%2Fwp%2Fv2%2Ftags&post=2880"}],"curies":[{"name":"wp","href":"https:\/\/api.w.org\/{rel}","templated":true}]}}