GAMERGPU test on new cluster (caler) suggests that performance (validated using Perf_Overall in Record__Performance) keeps increasing without saturation when MPI ranks increasing. The best performance is achieved by 16 MPI ranks, while using 32 MPI ranks will exceed GPU memory limitation. The time-step-averaged Perf_Overall summary is given as below (TIMING_SOLVERoff)
Issue
GAMER
GPU
test on new cluster (caler
) suggests that performance (validated usingPerf_Overall
inRecord__Performance
) keeps increasing without saturation whenMPI
ranks increasing. The best performance is achieved by16 MPI
ranks, while using32 MPI
ranks will exceed GPU memory limitation. The time-step-averagedPerf_Overall
summary is given as below (TIMING_SOLVER
off
)Performance test
ClusterMerger
problem with patch number further increased (by adjusting theAMR
refinement criteria), compared to the default test problem settingFLUID
+PARTICLE
20
NX0_TOT_X
:128
;NX0_TOT_Y
:128
;NX0_TOT_Z
:128
MAX_LEVEL
:3
Input__Flag_Rho
,Input__Flag_NParPatch
andInput_Flag_Lohner
(Dens
). Content for each file is shown below:Input__Flag_Rho
Input__Flag_NParPatch
Input__Flag_Lohner