pytorch / rl

A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.
https://pytorch.org/rl
MIT License
2.27k stars 302 forks source link

[BugFix] Fix another ctx test #2284

Closed vmoens closed 3 months ago

pytorch-bot[bot] commented 3 months ago

:link: Helpful Links

:test_tube: See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/rl/2284

Note: Links to docs will display an error until the docs builds have been completed.

:x: 1 New Failure, 1 Cancelled Job, 1 Unrelated Failure

As of commit 980c015ea75f3c8db30e1f4e542a0af44f7b15d7 with merge base 28acf6159d81c59c313c2626d8cd8a72a8212ee4 (image):

NEW FAILURE - The following job has failed:

* [Habitat Tests on Linux / tests (3.9, 12.1) / linux-job](https://hud.pytorch.org/pr/pytorch/rl/2284#27266833552) ([gh](https://github.com/pytorch/rl/actions/runs/9873791033/job/27266833552)) `RuntimeError: Command docker exec -t 4bba8cb5bb00b77a014c5a35629316407c28e58ec9903e92f8f102114641e195 /exec failed with exit code 139`

CANCELLED JOB - The following job was cancelled. Please retry:

* [Unit-tests on Linux / tests-gpu (3.10, 12.1) / linux-job](https://hud.pytorch.org/pr/pytorch/rl/2284#27266848303) ([gh](https://github.com/pytorch/rl/actions/runs/9873791023/job/27266848303)) `##[error]The operation was canceled.`

BROKEN TRUNK - The following job failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

* [Unit-tests on Windows / unittests-cpu / windows-job](https://hud.pytorch.org/pr/pytorch/rl/2284#27266832483) ([gh](https://github.com/pytorch/rl/actions/runs/9873791024/job/27266832483)) ([trunk failure](https://hud.pytorch.org/pytorch/rl/commit/28acf6159d81c59c313c2626d8cd8a72a8212ee4#27263815657)) `test/test_transforms.py::TestActionDiscretizer::test_trans_parallel_env_check[False]`

This comment was automatically generated by Dr. CI and updates every 15 minutes.

github-actions[bot] commented 3 months ago

$\color{#D29922}\textsf{\Large\⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 91. Improved: $\large\color{#35bf28}12$. Worsened: $\large\color{#d91a1a}3$.

Expand to view detailed results | Name | Max | Mean | Ops | Ops on Repo `HEAD` | Change | | ----------------------------------------------------------------------------------------- | --------- | --------- | --------------- | ------------------ | ----------------------------------- | | test_single | 0.1236s | 62.0274ms | 16.1219 Ops/s | 17.1996 Ops/s | $\textbf{\color{#d91a1a}-6.27\\%}$ | | test_sync | 39.2521ms | 32.7140ms | 30.5680 Ops/s | 30.4356 Ops/s | $\color{#35bf28}+0.43\\%$ | | test_async | 59.6516ms | 29.4464ms | 33.9600 Ops/s | 33.1581 Ops/s | $\color{#35bf28}+2.42\\%$ | | test_simple | 0.3993s | 0.3968s | 2.5202 Ops/s | 2.5535 Ops/s | $\color{#d91a1a}-1.31\\%$ | | test_transformed | 0.5711s | 0.5691s | 1.7571 Ops/s | 1.7764 Ops/s | $\color{#d91a1a}-1.09\\%$ | | test_serial | 1.4117s | 1.3558s | 0.7376 Ops/s | 0.7727 Ops/s | $\color{#d91a1a}-4.54\\%$ | | test_parallel | 1.1936s | 1.1184s | 0.8942 Ops/s | 0.8691 Ops/s | $\color{#35bf28}+2.88\\%$ | | test_step_mdp_speed[True-True-True-True-True] | 0.2078ms | 22.1102μs | 45.2280 KOps/s | 44.6475 KOps/s | $\color{#35bf28}+1.30\\%$ | | test_step_mdp_speed[True-True-True-True-False] | 62.7160μs | 13.2335μs | 75.5659 KOps/s | 76.5695 KOps/s | $\color{#d91a1a}-1.31\\%$ | | test_step_mdp_speed[True-True-True-False-True] | 42.2880μs | 12.9841μs | 77.0171 KOps/s | 76.7369 KOps/s | $\color{#35bf28}+0.37\\%$ | | test_step_mdp_speed[True-True-True-False-False] | 67.5960μs | 7.6460μs | 130.7879 KOps/s | 130.8563 KOps/s | $\color{#d91a1a}-0.05\\%$ | | test_step_mdp_speed[True-True-False-True-True] | 77.1330μs | 23.8074μs | 42.0038 KOps/s | 41.6350 KOps/s | $\color{#35bf28}+0.89\\%$ | | test_step_mdp_speed[True-True-False-True-False] | 84.6670μs | 14.5382μs | 68.7844 KOps/s | 68.5273 KOps/s | $\color{#35bf28}+0.38\\%$ | | test_step_mdp_speed[True-True-False-False-True] | 56.3950μs | 14.2862μs | 69.9976 KOps/s | 69.7266 KOps/s | $\color{#35bf28}+0.39\\%$ | | test_step_mdp_speed[True-True-False-False-False] | 58.2480μs | 8.8723μs | 112.7102 KOps/s | 112.2053 KOps/s | $\color{#35bf28}+0.45\\%$ | | test_step_mdp_speed[True-False-True-True-True] | 59.6010μs | 25.2372μs | 39.6240 KOps/s | 39.6373 KOps/s | $\color{#d91a1a}-0.03\\%$ | | test_step_mdp_speed[True-False-True-True-False] | 69.0680μs | 15.9094μs | 62.8558 KOps/s | 62.7034 KOps/s | $\color{#35bf28}+0.24\\%$ | | test_step_mdp_speed[True-False-True-False-True] | 70.7610μs | 14.3102μs | 69.8801 KOps/s | 70.4176 KOps/s | $\color{#d91a1a}-0.76\\%$ | | test_step_mdp_speed[True-False-True-False-False] | 72.1140μs | 8.7916μs | 113.7444 KOps/s | 112.1206 KOps/s | $\color{#35bf28}+1.45\\%$ | | test_step_mdp_speed[True-False-False-True-True] | 73.7770μs | 26.4330μs | 37.8314 KOps/s | 38.0538 KOps/s | $\color{#d91a1a}-0.58\\%$ | | test_step_mdp_speed[True-False-False-True-False] | 96.5490μs | 17.1331μs | 58.3666 KOps/s | 58.8289 KOps/s | $\color{#d91a1a}-0.79\\%$ | | test_step_mdp_speed[True-False-False-False-True] | 0.1107ms | 15.6151μs | 64.0407 KOps/s | 65.2447 KOps/s | $\color{#d91a1a}-1.85\\%$ | | test_step_mdp_speed[True-False-False-False-False] | 39.9350μs | 10.0390μs | 99.6116 KOps/s | 99.4532 KOps/s | $\color{#35bf28}+0.16\\%$ | | test_step_mdp_speed[False-True-True-True-True] | 76.4220μs | 25.1483μs | 39.7641 KOps/s | 39.9090 KOps/s | $\color{#d91a1a}-0.36\\%$ | | test_step_mdp_speed[False-True-True-True-False] | 80.0090μs | 15.7940μs | 63.3152 KOps/s | 63.2266 KOps/s | $\color{#35bf28}+0.14\\%$ | | test_step_mdp_speed[False-True-True-False-True] | 60.8330μs | 16.3904μs | 61.0114 KOps/s | 60.7839 KOps/s | $\color{#35bf28}+0.37\\%$ | | test_step_mdp_speed[False-True-True-False-False] | 59.2900μs | 10.0519μs | 99.4840 KOps/s | 99.2647 KOps/s | $\color{#35bf28}+0.22\\%$ | | test_step_mdp_speed[False-True-False-True-True] | 64.3700μs | 26.2623μs | 38.0774 KOps/s | 38.0491 KOps/s | $\color{#35bf28}+0.07\\%$ | | test_step_mdp_speed[False-True-False-True-False] | 67.1350μs | 17.0262μs | 58.7332 KOps/s | 58.5565 KOps/s | $\color{#35bf28}+0.30\\%$ | | test_step_mdp_speed[False-True-False-False-True] | 0.2321ms | 17.8593μs | 55.9931 KOps/s | 56.2293 KOps/s | $\color{#d91a1a}-0.42\\%$ | | test_step_mdp_speed[False-True-False-False-False] | 77.2730μs | 11.3025μs | 88.4761 KOps/s | 87.9374 KOps/s | $\color{#35bf28}+0.61\\%$ | | test_step_mdp_speed[False-False-True-True-True] | 64.8700μs | 27.7398μs | 36.0492 KOps/s | 35.9301 KOps/s | $\color{#35bf28}+0.33\\%$ | | test_step_mdp_speed[False-False-True-True-False] | 74.6790μs | 18.3779μs | 54.4132 KOps/s | 54.8701 KOps/s | $\color{#d91a1a}-0.83\\%$ | | test_step_mdp_speed[False-False-True-False-True] | 46.1050μs | 17.7325μs | 56.3937 KOps/s | 56.4269 KOps/s | $\color{#d91a1a}-0.06\\%$ | | test_step_mdp_speed[False-False-True-False-False] | 54.9520μs | 11.2956μs | 88.5302 KOps/s | 88.7565 KOps/s | $\color{#d91a1a}-0.25\\%$ | | test_step_mdp_speed[False-False-False-True-True] | 42.8990μs | 29.2665μs | 34.1687 KOps/s | 34.5921 KOps/s | $\color{#d91a1a}-1.22\\%$ | | test_step_mdp_speed[False-False-False-True-False] | 73.8370μs | 19.6535μs | 50.8815 KOps/s | 49.8150 KOps/s | $\color{#35bf28}+2.14\\%$ | | test_step_mdp_speed[False-False-False-False-True] | 74.6680μs | 18.7096μs | 53.4484 KOps/s | 52.9851 KOps/s | $\color{#35bf28}+0.87\\%$ | | test_step_mdp_speed[False-False-False-False-False] | 67.7260μs | 12.5017μs | 79.9892 KOps/s | 79.7274 KOps/s | $\color{#35bf28}+0.33\\%$ | | test_values[generalized_advantage_estimate-True-True] | 15.1041ms | 10.0623ms | 99.3812 Ops/s | 103.1299 Ops/s | $\color{#d91a1a}-3.63\\%$ | | test_values[vec_generalized_advantage_estimate-True-True] | 38.0457ms | 33.8792ms | 29.5166 Ops/s | 27.7131 Ops/s | $\textbf{\color{#35bf28}+6.51\\%}$ | | test_values[td0_return_estimate-False-False] | 0.2631ms | 0.1832ms | 5.4570 KOps/s | 5.3875 KOps/s | $\color{#35bf28}+1.29\\%$ | | test_values[td1_return_estimate-False-False] | 26.6514ms | 24.9959ms | 40.0066 Ops/s | 40.9028 Ops/s | $\color{#d91a1a}-2.19\\%$ | | test_values[vec_td1_return_estimate-False-False] | 35.9110ms | 34.0032ms | 29.4090 Ops/s | 27.8177 Ops/s | $\textbf{\color{#35bf28}+5.72\\%}$ | | test_values[td_lambda_return_estimate-True-False] | 39.5583ms | 36.0758ms | 27.7194 Ops/s | 28.3793 Ops/s | $\color{#d91a1a}-2.33\\%$ | | test_values[vec_td_lambda_return_estimate-True-False] | 36.1962ms | 34.0335ms | 29.3828 Ops/s | 27.7935 Ops/s | $\textbf{\color{#35bf28}+5.72\\%}$ | | test_gae_speed[generalized_advantage_estimate-False-1-512] | 11.5392ms | 8.5474ms | 116.9940 Ops/s | 121.0462 Ops/s | $\color{#d91a1a}-3.35\\%$ | | test_gae_speed[vec_generalized_advantage_estimate-True-1-512] | 2.8239ms | 2.1028ms | 475.5614 Ops/s | 495.6666 Ops/s | $\color{#d91a1a}-4.06\\%$ | | test_gae_speed[vec_generalized_advantage_estimate-False-1-512] | 0.4712ms | 0.3629ms | 2.7556 KOps/s | 2.7464 KOps/s | $\color{#35bf28}+0.33\\%$ | | test_gae_speed[vec_generalized_advantage_estimate-True-32-512] | 43.2333ms | 40.7101ms | 24.5639 Ops/s | 21.0087 Ops/s | $\textbf{\color{#35bf28}+16.92\\%}$ | | test_gae_speed[vec_generalized_advantage_estimate-False-32-512] | 4.0149ms | 3.0465ms | 328.2403 Ops/s | 324.6104 Ops/s | $\color{#35bf28}+1.12\\%$ | | test_dqn_speed | 1.8656ms | 1.3892ms | 719.8485 Ops/s | 717.4081 Ops/s | $\color{#35bf28}+0.34\\%$ | | test_ddpg_speed | 3.6852ms | 2.9593ms | 337.9164 Ops/s | 341.5548 Ops/s | $\color{#d91a1a}-1.07\\%$ | | test_sac_speed | 11.0745ms | 9.0280ms | 110.7662 Ops/s | 113.5806 Ops/s | $\color{#d91a1a}-2.48\\%$ | | test_redq_speed | 22.9840ms | 14.2266ms | 70.2908 Ops/s | 63.3450 Ops/s | $\textbf{\color{#35bf28}+10.96\\%}$ | | test_redq_deprec_speed | 16.6398ms | 14.1655ms | 70.5942 Ops/s | 68.9229 Ops/s | $\color{#35bf28}+2.42\\%$ | | test_td3_speed | 19.1551ms | 8.8762ms | 112.6607 Ops/s | 109.5721 Ops/s | $\color{#35bf28}+2.82\\%$ | | test_cql_speed | 48.2205ms | 38.7583ms | 25.8009 Ops/s | 26.1949 Ops/s | $\color{#d91a1a}-1.50\\%$ | | test_a2c_speed | 8.6923ms | 7.8064ms | 128.0997 Ops/s | 119.8297 Ops/s | $\textbf{\color{#35bf28}+6.90\\%}$ | | test_ppo_speed | 10.2109ms | 8.1437ms | 122.7948 Ops/s | 116.0275 Ops/s | $\textbf{\color{#35bf28}+5.83\\%}$ | | test_reinforce_speed | 7.8250ms | 6.8092ms | 146.8606 Ops/s | 138.5118 Ops/s | $\textbf{\color{#35bf28}+6.03\\%}$ | | test_iql_speed | 35.3283ms | 33.7710ms | 29.6112 Ops/s | 29.1461 Ops/s | $\color{#35bf28}+1.60\\%$ | | test_rb_sample[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] | 5.8580ms | 3.8519ms | 259.6141 Ops/s | 257.6367 Ops/s | $\color{#35bf28}+0.77\\%$ | | test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] | 0.7240ms | 0.5188ms | 1.9274 KOps/s | 1.8849 KOps/s | $\color{#35bf28}+2.25\\%$ | | test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] | 7.4821ms | 0.5027ms | 1.9893 KOps/s | 1.9929 KOps/s | $\color{#d91a1a}-0.18\\%$ | | test_rb_sample[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] | 5.8372ms | 3.7494ms | 266.7120 Ops/s | 259.0538 Ops/s | $\color{#35bf28}+2.96\\%$ | | test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] | 0.8122ms | 0.5164ms | 1.9365 KOps/s | 1.8854 KOps/s | $\color{#35bf28}+2.71\\%$ | | test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] | 7.5376ms | 0.5003ms | 1.9987 KOps/s | 2.0222 KOps/s | $\color{#d91a1a}-1.16\\%$ | | test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-sampler6-10000] | 1.9993ms | 1.7806ms | 561.6175 Ops/s | 561.4024 Ops/s | $\color{#35bf28}+0.04\\%$ | | test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-sampler7-10000] | 2.1704ms | 1.6858ms | 593.1794 Ops/s | 595.4538 Ops/s | $\color{#d91a1a}-0.38\\%$ | | test_rb_sample[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] | 5.8431ms | 3.8987ms | 256.4973 Ops/s | 252.2047 Ops/s | $\color{#35bf28}+1.70\\%$ | | test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] | 0.9856ms | 0.6641ms | 1.5058 KOps/s | 1.4784 KOps/s | $\color{#35bf28}+1.86\\%$ | | test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] | 7.8828ms | 0.6429ms | 1.5555 KOps/s | 1.5221 KOps/s | $\color{#35bf28}+2.19\\%$ | | test_rb_iterate[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] | 3.9805ms | 3.7409ms | 267.3175 Ops/s | 246.8247 Ops/s | $\textbf{\color{#35bf28}+8.30\\%}$ | | test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] | 0.8919ms | 0.5247ms | 1.9057 KOps/s | 1.8569 KOps/s | $\color{#35bf28}+2.63\\%$ | | test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] | 7.5303ms | 0.5102ms | 1.9602 KOps/s | 1.9603 KOps/s | $-0.00\\%$ | | test_rb_iterate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] | 5.8450ms | 3.8185ms | 261.8861 Ops/s | 261.4273 Ops/s | $\color{#35bf28}+0.18\\%$ | | test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] | 1.0922ms | 0.5224ms | 1.9143 KOps/s | 1.8782 KOps/s | $\color{#35bf28}+1.93\\%$ | | test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] | 0.6484ms | 0.4893ms | 2.0438 KOps/s | 1.9836 KOps/s | $\color{#35bf28}+3.03\\%$ | | test_rb_iterate[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] | 4.2372ms | 3.9266ms | 254.6738 Ops/s | 245.2135 Ops/s | $\color{#35bf28}+3.86\\%$ | | test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] | 1.0393ms | 0.6719ms | 1.4884 KOps/s | 1.4693 KOps/s | $\color{#35bf28}+1.30\\%$ | | test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] | 7.6451ms | 0.6476ms | 1.5442 KOps/s | 1.4643 KOps/s | $\textbf{\color{#35bf28}+5.46\\%}$ | | test_rb_populate[TensorDictReplayBuffer-ListStorage-RandomSampler-400] | 0.1178s | 6.0009ms | 166.6428 Ops/s | 150.5103 Ops/s | $\textbf{\color{#35bf28}+10.72\\%}$ | | test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-400] | 18.0291ms | 13.5588ms | 73.7527 Ops/s | 73.7555 Ops/s | $-0.00\\%$ | | test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-400] | 1.9554ms | 1.1642ms | 858.9301 Ops/s | 840.9823 Ops/s | $\color{#35bf28}+2.13\\%$ | | test_rb_populate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-400] | 0.1276s | 8.5393ms | 117.1053 Ops/s | 112.6172 Ops/s | $\color{#35bf28}+3.99\\%$ | | test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-400] | 20.4603ms | 14.2386ms | 70.2317 Ops/s | 73.9609 Ops/s | $\textbf{\color{#d91a1a}-5.04\\%}$ | | test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-400] | 1.9660ms | 1.2639ms | 791.2116 Ops/s | 850.9677 Ops/s | $\textbf{\color{#d91a1a}-7.02\\%}$ | | test_rb_populate[TensorDictPrioritizedReplayBuffer-ListStorage-None-400] | 0.1262s | 6.4135ms | 155.9211 Ops/s | 152.4801 Ops/s | $\color{#35bf28}+2.26\\%$ | | test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-400] | 18.8567ms | 13.9840ms | 71.5102 Ops/s | 74.0828 Ops/s | $\color{#d91a1a}-3.47\\%$ | | test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-400] | 2.0954ms | 1.2794ms | 781.6162 Ops/s | 739.2218 Ops/s | $\textbf{\color{#35bf28}+5.73\\%}$ |
github-actions[bot] commented 3 months ago

$\color{#D29922}\textsf{\Large\⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 94. Improved: $\large\color{#35bf28}3$. Worsened: $\large\color{#d91a1a}4$.

Expand to view detailed results | Name | Max | Mean | Ops | Ops on Repo `HEAD` | Change | | ----------------------------------------------------------------------------------------- | --------- | --------- | -------------- | ------------------ | ----------------------------------- | | test_single | 0.1649s | 0.1108s | 9.0279 Ops/s | 9.3925 Ops/s | $\color{#d91a1a}-3.88\\%$ | | test_sync | 94.5710ms | 91.6296ms | 10.9135 Ops/s | 10.9712 Ops/s | $\color{#d91a1a}-0.53\\%$ | | test_async | 0.1784s | 89.9748ms | 11.1142 Ops/s | 11.7030 Ops/s | $\textbf{\color{#d91a1a}-5.03\\%}$ | | test_single_pixels | 0.1162s | 0.1154s | 8.6675 Ops/s | 8.6539 Ops/s | $\color{#35bf28}+0.16\\%$ | | test_sync_pixels | 71.2782ms | 69.5777ms | 14.3724 Ops/s | 14.2193 Ops/s | $\color{#35bf28}+1.08\\%$ | | test_async_pixels | 0.1218s | 57.7622ms | 17.3124 Ops/s | 17.4680 Ops/s | $\color{#d91a1a}-0.89\\%$ | | test_simple | 0.7679s | 0.7406s | 1.3503 Ops/s | 1.3201 Ops/s | $\color{#35bf28}+2.29\\%$ | | test_transformed | 1.0598s | 0.9998s | 1.0002 Ops/s | 1.0004 Ops/s | $\color{#d91a1a}-0.03\\%$ | | test_serial | 2.2356s | 2.1976s | 0.4550 Ops/s | 0.4530 Ops/s | $\color{#35bf28}+0.45\\%$ | | test_parallel | 1.9348s | 1.8889s | 0.5294 Ops/s | 0.5269 Ops/s | $\color{#35bf28}+0.48\\%$ | | test_step_mdp_speed[True-True-True-True-True] | 91.4810μs | 33.8828μs | 29.5135 KOps/s | 29.4986 KOps/s | $\color{#35bf28}+0.05\\%$ | | test_step_mdp_speed[True-True-True-True-False] | 46.2910μs | 19.9273μs | 50.1823 KOps/s | 51.3215 KOps/s | $\color{#d91a1a}-2.22\\%$ | | test_step_mdp_speed[True-True-True-False-True] | 40.9710μs | 19.1678μs | 52.1710 KOps/s | 52.1135 KOps/s | $\color{#35bf28}+0.11\\%$ | | test_step_mdp_speed[True-True-True-False-False] | 32.2010μs | 11.0959μs | 90.1234 KOps/s | 90.5069 KOps/s | $\color{#d91a1a}-0.42\\%$ | | test_step_mdp_speed[True-True-False-True-True] | 73.1810μs | 35.4563μs | 28.2037 KOps/s | 27.6178 KOps/s | $\color{#35bf28}+2.12\\%$ | | test_step_mdp_speed[True-True-False-True-False] | 46.3900μs | 21.6718μs | 46.1429 KOps/s | 47.0432 KOps/s | $\color{#d91a1a}-1.91\\%$ | | test_step_mdp_speed[True-True-False-False-True] | 51.9210μs | 20.8841μs | 47.8833 KOps/s | 46.9472 KOps/s | $\color{#35bf28}+1.99\\%$ | | test_step_mdp_speed[True-True-False-False-False] | 45.5900μs | 12.9847μs | 77.0135 KOps/s | 77.5116 KOps/s | $\color{#d91a1a}-0.64\\%$ | | test_step_mdp_speed[True-False-True-True-True] | 80.0110μs | 37.5408μs | 26.6377 KOps/s | 26.7141 KOps/s | $\color{#d91a1a}-0.29\\%$ | | test_step_mdp_speed[True-False-True-True-False] | 88.6830μs | 23.4728μs | 42.6024 KOps/s | 43.1224 KOps/s | $\color{#d91a1a}-1.21\\%$ | | test_step_mdp_speed[True-False-True-False-True] | 57.5710μs | 20.9862μs | 47.6505 KOps/s | 47.1282 KOps/s | $\color{#35bf28}+1.11\\%$ | | test_step_mdp_speed[True-False-True-False-False] | 27.7110μs | 12.9098μs | 77.4605 KOps/s | 77.5313 KOps/s | $\color{#d91a1a}-0.09\\%$ | | test_step_mdp_speed[True-False-False-True-True] | 76.4410μs | 39.2365μs | 25.4865 KOps/s | 25.4536 KOps/s | $\color{#35bf28}+0.13\\%$ | | test_step_mdp_speed[True-False-False-True-False] | 45.7610μs | 25.4896μs | 39.2316 KOps/s | 40.2201 KOps/s | $\color{#d91a1a}-2.46\\%$ | | test_step_mdp_speed[True-False-False-False-True] | 39.3910μs | 22.9537μs | 43.5659 KOps/s | 43.6912 KOps/s | $\color{#d91a1a}-0.29\\%$ | | test_step_mdp_speed[True-False-False-False-False] | 45.5110μs | 14.7285μs | 67.8954 KOps/s | 67.7910 KOps/s | $\color{#35bf28}+0.15\\%$ | | test_step_mdp_speed[False-True-True-True-True] | 75.3110μs | 37.3366μs | 26.7834 KOps/s | 26.5137 KOps/s | $\color{#35bf28}+1.02\\%$ | | test_step_mdp_speed[False-True-True-True-False] | 47.0410μs | 23.4297μs | 42.6808 KOps/s | 42.7297 KOps/s | $\color{#d91a1a}-0.11\\%$ | | test_step_mdp_speed[False-True-True-False-True] | 50.2910μs | 24.8580μs | 40.2285 KOps/s | 39.9805 KOps/s | $\color{#35bf28}+0.62\\%$ | | test_step_mdp_speed[False-True-True-False-False] | 32.5710μs | 14.7463μs | 67.8134 KOps/s | 67.9382 KOps/s | $\color{#d91a1a}-0.18\\%$ | | test_step_mdp_speed[False-True-False-True-True] | 64.3310μs | 39.2443μs | 25.4814 KOps/s | 25.3891 KOps/s | $\color{#35bf28}+0.36\\%$ | | test_step_mdp_speed[False-True-False-True-False] | 52.4710μs | 25.1561μs | 39.7518 KOps/s | 39.6748 KOps/s | $\color{#35bf28}+0.19\\%$ | | test_step_mdp_speed[False-True-False-False-True] | 44.3900μs | 26.3631μs | 37.9318 KOps/s | 36.5189 KOps/s | $\color{#35bf28}+3.87\\%$ | | test_step_mdp_speed[False-True-False-False-False] | 34.3310μs | 16.6374μs | 60.1056 KOps/s | 60.2745 KOps/s | $\color{#d91a1a}-0.28\\%$ | | test_step_mdp_speed[False-False-True-True-True] | 80.4910μs | 41.3009μs | 24.2126 KOps/s | 23.8889 KOps/s | $\color{#35bf28}+1.35\\%$ | | test_step_mdp_speed[False-False-True-True-False] | 58.1010μs | 27.1734μs | 36.8007 KOps/s | 36.6018 KOps/s | $\color{#35bf28}+0.54\\%$ | | test_step_mdp_speed[False-False-True-False-True] | 56.2510μs | 26.4956μs | 37.7420 KOps/s | 36.4272 KOps/s | $\color{#35bf28}+3.61\\%$ | | test_step_mdp_speed[False-False-True-False-False] | 37.4610μs | 16.7244μs | 59.7927 KOps/s | 58.9105 KOps/s | $\color{#35bf28}+1.50\\%$ | | test_step_mdp_speed[False-False-False-True-True] | 58.6110μs | 43.1311μs | 23.1851 KOps/s | 22.5913 KOps/s | $\color{#35bf28}+2.63\\%$ | | test_step_mdp_speed[False-False-False-True-False] | 66.5810μs | 29.1291μs | 34.3299 KOps/s | 34.7631 KOps/s | $\color{#d91a1a}-1.25\\%$ | | test_step_mdp_speed[False-False-False-False-True] | 52.8110μs | 28.2797μs | 35.3611 KOps/s | 35.3624 KOps/s | $-0.00\\%$ | | test_step_mdp_speed[False-False-False-False-False] | 35.9720μs | 18.4977μs | 54.0608 KOps/s | 54.9780 KOps/s | $\color{#d91a1a}-1.67\\%$ | | test_values[generalized_advantage_estimate-True-True] | 25.9854ms | 24.5307ms | 40.7652 Ops/s | 41.0089 Ops/s | $\color{#d91a1a}-0.59\\%$ | | test_values[vec_generalized_advantage_estimate-True-True] | 98.9109ms | 2.8807ms | 347.1378 Ops/s | 354.5405 Ops/s | $\color{#d91a1a}-2.09\\%$ | | test_values[td0_return_estimate-False-False] | 92.7920μs | 66.6403μs | 15.0059 KOps/s | 15.1235 KOps/s | $\color{#d91a1a}-0.78\\%$ | | test_values[td1_return_estimate-False-False] | 56.9670ms | 54.9723ms | 18.1910 Ops/s | 18.4626 Ops/s | $\color{#d91a1a}-1.47\\%$ | | test_values[vec_td1_return_estimate-False-False] | 1.4745ms | 1.0925ms | 915.3208 Ops/s | 921.3087 Ops/s | $\color{#d91a1a}-0.65\\%$ | | test_values[td_lambda_return_estimate-True-False] | 89.1192ms | 87.9241ms | 11.3735 Ops/s | 11.5175 Ops/s | $\color{#d91a1a}-1.25\\%$ | | test_values[vec_td_lambda_return_estimate-True-False] | 1.3136ms | 1.0825ms | 923.8172 Ops/s | 916.0633 Ops/s | $\color{#35bf28}+0.85\\%$ | | test_gae_speed[generalized_advantage_estimate-False-1-512] | 25.2400ms | 24.5477ms | 40.7370 Ops/s | 41.0352 Ops/s | $\color{#d91a1a}-0.73\\%$ | | test_gae_speed[vec_generalized_advantage_estimate-True-1-512] | 0.9567ms | 0.7220ms | 1.3850 KOps/s | 1.3440 KOps/s | $\color{#35bf28}+3.05\\%$ | | test_gae_speed[vec_generalized_advantage_estimate-False-1-512] | 0.7889ms | 0.6724ms | 1.4872 KOps/s | 1.4922 KOps/s | $\color{#d91a1a}-0.34\\%$ | | test_gae_speed[vec_generalized_advantage_estimate-True-32-512] | 1.5591ms | 1.4731ms | 678.8327 Ops/s | 681.8067 Ops/s | $\color{#d91a1a}-0.44\\%$ | | test_gae_speed[vec_generalized_advantage_estimate-False-32-512] | 0.7483ms | 0.6856ms | 1.4586 KOps/s | 1.4490 KOps/s | $\color{#35bf28}+0.66\\%$ | | test_dqn_speed | 7.8880ms | 1.4893ms | 671.4669 Ops/s | 683.2024 Ops/s | $\color{#d91a1a}-1.72\\%$ | | test_ddpg_speed | 3.2435ms | 2.9629ms | 337.5026 Ops/s | 337.1663 Ops/s | $\color{#35bf28}+0.10\\%$ | | test_sac_speed | 9.1684ms | 8.4314ms | 118.6047 Ops/s | 117.1391 Ops/s | $\color{#35bf28}+1.25\\%$ | | test_redq_speed | 0.1043s | 11.7619ms | 85.0206 Ops/s | 91.8707 Ops/s | $\textbf{\color{#d91a1a}-7.46\\%}$ | | test_redq_deprec_speed | 12.2648ms | 11.6680ms | 85.7046 Ops/s | 77.1717 Ops/s | $\textbf{\color{#35bf28}+11.06\\%}$ | | test_td3_speed | 8.4718ms | 8.3876ms | 119.2231 Ops/s | 119.4538 Ops/s | $\color{#d91a1a}-0.19\\%$ | | test_cql_speed | 26.3330ms | 25.7867ms | 38.7797 Ops/s | 38.7382 Ops/s | $\color{#35bf28}+0.11\\%$ | | test_a2c_speed | 6.0039ms | 5.7652ms | 173.4532 Ops/s | 174.1314 Ops/s | $\color{#d91a1a}-0.39\\%$ | | test_ppo_speed | 6.2540ms | 6.0367ms | 165.6537 Ops/s | 166.3632 Ops/s | $\color{#d91a1a}-0.43\\%$ | | test_reinforce_speed | 5.0146ms | 4.7324ms | 211.3111 Ops/s | 213.2725 Ops/s | $\color{#d91a1a}-0.92\\%$ | | test_iql_speed | 20.3268ms | 19.7241ms | 50.6993 Ops/s | 50.2035 Ops/s | $\color{#35bf28}+0.99\\%$ | | test_rb_sample[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] | 4.8827ms | 4.6506ms | 215.0275 Ops/s | 213.7783 Ops/s | $\color{#35bf28}+0.58\\%$ | | test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] | 0.1101s | 0.6303ms | 1.5865 KOps/s | 1.8084 KOps/s | $\textbf{\color{#d91a1a}-12.27\\%}$ | | test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] | 0.8596ms | 0.5271ms | 1.8973 KOps/s | 1.8862 KOps/s | $\color{#35bf28}+0.59\\%$ | | test_rb_sample[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] | 4.8370ms | 4.5952ms | 217.6175 Ops/s | 216.4987 Ops/s | $\color{#35bf28}+0.52\\%$ | | test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] | 0.6901ms | 0.5429ms | 1.8421 KOps/s | 1.8284 KOps/s | $\color{#35bf28}+0.75\\%$ | | test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] | 0.7028ms | 0.5233ms | 1.9110 KOps/s | 1.8999 KOps/s | $\color{#35bf28}+0.58\\%$ | | test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-sampler6-10000] | 3.0315ms | 2.0841ms | 479.8278 Ops/s | 498.5816 Ops/s | $\color{#d91a1a}-3.76\\%$ | | test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-sampler7-10000] | 2.1120ms | 1.8946ms | 527.8046 Ops/s | 519.7651 Ops/s | $\color{#35bf28}+1.55\\%$ | | test_rb_sample[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] | 4.9131ms | 4.7730ms | 209.5122 Ops/s | 208.9940 Ops/s | $\color{#35bf28}+0.25\\%$ | | test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] | 1.5402ms | 0.7001ms | 1.4283 KOps/s | 1.2209 KOps/s | $\textbf{\color{#35bf28}+16.98\\%}$ | | test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] | 0.9082ms | 0.6783ms | 1.4744 KOps/s | 1.4714 KOps/s | $\color{#35bf28}+0.20\\%$ | | test_rb_iterate[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] | 4.8103ms | 4.6591ms | 214.6320 Ops/s | 213.4697 Ops/s | $\color{#35bf28}+0.54\\%$ | | test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] | 1.3697ms | 0.5542ms | 1.8045 KOps/s | 1.8011 KOps/s | $\color{#35bf28}+0.19\\%$ | | test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] | 0.6612ms | 0.5312ms | 1.8826 KOps/s | 1.8883 KOps/s | $\color{#d91a1a}-0.30\\%$ | | test_rb_iterate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] | 4.8534ms | 4.6299ms | 215.9855 Ops/s | 214.3663 Ops/s | $\color{#35bf28}+0.76\\%$ | | test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] | 0.6698ms | 0.5446ms | 1.8363 KOps/s | 1.8218 KOps/s | $\color{#35bf28}+0.80\\%$ | | test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] | 0.8500ms | 0.5236ms | 1.9099 KOps/s | 1.8730 KOps/s | $\color{#35bf28}+1.97\\%$ | | test_rb_iterate[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] | 4.9955ms | 4.8201ms | 207.4629 Ops/s | 207.3519 Ops/s | $\color{#35bf28}+0.05\\%$ | | test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] | 1.5704ms | 0.7003ms | 1.4280 KOps/s | 1.4195 KOps/s | $\color{#35bf28}+0.60\\%$ | | test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] | 0.9317ms | 0.6816ms | 1.4670 KOps/s | 1.4677 KOps/s | $\color{#d91a1a}-0.04\\%$ | | test_rb_populate[TensorDictReplayBuffer-ListStorage-RandomSampler-400] | 0.1485s | 7.7251ms | 129.4482 Ops/s | 128.5497 Ops/s | $\color{#35bf28}+0.70\\%$ | | test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-400] | 18.1373ms | 15.4712ms | 64.6364 Ops/s | 64.1407 Ops/s | $\color{#35bf28}+0.77\\%$ | | test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-400] | 1.2310ms | 1.1318ms | 883.5308 Ops/s | 883.2468 Ops/s | $\color{#35bf28}+0.03\\%$ | | test_rb_populate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-400] | 0.1268s | 7.3479ms | 136.0930 Ops/s | 135.3397 Ops/s | $\color{#35bf28}+0.56\\%$ | | test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-400] | 17.9645ms | 15.3484ms | 65.1532 Ops/s | 55.9657 Ops/s | $\textbf{\color{#35bf28}+16.42\\%}$ | | test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-400] | 7.7437ms | 1.2862ms | 777.5048 Ops/s | 787.5908 Ops/s | $\color{#d91a1a}-1.28\\%$ | | test_rb_populate[TensorDictPrioritizedReplayBuffer-ListStorage-None-400] | 0.1273s | 7.5168ms | 133.0350 Ops/s | 133.0068 Ops/s | $\color{#35bf28}+0.02\\%$ | | test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-400] | 18.5390ms | 15.5203ms | 64.4316 Ops/s | 63.9651 Ops/s | $\color{#35bf28}+0.73\\%$ | | test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-400] | 7.8469ms | 1.4751ms | 677.9012 Ops/s | 750.7941 Ops/s | $\textbf{\color{#d91a1a}-9.71\\%}$ |