Closed erwincoumans closed 3 years ago
On a MacBook M1 Pro (14") with 8 Cores (6 performance, 2 efficiency) the training of a custom Ant runs at almost 100k steps per second, using OpenMP and generated C code using CppAdCodegen. Compile using above home-brew clang and then run examples/ars/ars_train_policy_omp_ant
Time Iteration AverageReward MaxRewardRollout MinRewardRollout timesteps
23.93 10 818.099707 1074.45 42.9172 1932001
51.526 20 275.611223 679.283 -403.851 4333386
79.205 30 1024.878382 1303.88 106.159 6824583
106.478 40 1276.533419 1570.6 -249.706 9472280
133.704 50 1756.318418 1846.07 1428.89 12146026
162.837 60 1778.257093 1884.86 1524.56 14825677
allow to use clang with OpenMP on MacOS (tested on Mac M1 Pro (arm)) brew update brew install llvm libomp cmake -DCMAKE_C_COMPILER="/opt/homebrew/opt/llvm/bin/clang" -DCMAKE_CXX_COMPILER="/opt/homebrew/opt/llvm/bin/clang++" -DUSE_APPLE_HOMEBREW_CLANG=ON ..