[BugFix] Fix strict_length in prioritized slice sampler

pytorch-bot[bot] commented 1 month ago

:link: Helpful Links

:test_tube: See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/rl/2194

:page_facing_up: Preview Python docs built from this PR

Note: Links to docs will display an error until the docs builds have been completed.

:x: 10 New Failures, 9 Unrelated Failures

As of commit 56590017b4fcb97be57fb759d7c40e7b04a59a42 with merge base 3e6cb8419df56d9263d1daa48f9c3be5f01eaea6 ():

NEW FAILURES - The following jobs have failed:

* [Generate documentation / build-docs (3.9, 12.1) / linux-job](https://hud.pytorch.org/pr/pytorch/rl/2194#25779477475) ([gh](https://github.com/pytorch/rl/actions/runs/9365067675/job/25779477475)) `No files were found with the provided path: /home/ec2-user/actions-runner/_work/_temp/artifacts/. No artifacts will be uploaded.` * [Unit-tests on Windows / unittests-cpu / windows-job](https://hud.pytorch.org/pr/pytorch/rl/2194#25779476338) ([gh](https://github.com/pytorch/rl/actions/runs/9365067679/job/25779476338)) `The process 'C:\Program Files\Git\cmd\git.exe' failed with exit code 128` * [Wheels / test-wheel (linux, ubuntu-20.04, 3.10)](https://hud.pytorch.org/pr/pytorch/rl/2194#25779569263) ([gh](https://github.com/pytorch/rl/actions/runs/9365067651/job/25779569263)) `##[error]The operation was canceled.` * [Wheels / test-wheel (linux, ubuntu-20.04, 3.11)](https://hud.pytorch.org/pr/pytorch/rl/2194#25779569683) ([gh](https://github.com/pytorch/rl/actions/runs/9365067651/job/25779569683)) `ModuleNotFoundError: No module named 'dm_env'` * [Wheels / test-wheel (linux, ubuntu-20.04, 3.8)](https://hud.pytorch.org/pr/pytorch/rl/2194#25779568466) ([gh](https://github.com/pytorch/rl/actions/runs/9365067651/job/25779568466)) * [Wheels / test-wheel (linux, ubuntu-20.04, 3.9)](https://hud.pytorch.org/pr/pytorch/rl/2194#25779568924) ([gh](https://github.com/pytorch/rl/actions/runs/9365067651/job/25779568924)) * [Wheels / test-wheel-windows (3.10)](https://hud.pytorch.org/pr/pytorch/rl/2194#25779673053) ([gh](https://github.com/pytorch/rl/actions/runs/9365067651/job/25779673053)) `ModuleNotFoundError: No module named 'dm_env'` * [Wheels / test-wheel-windows (3.11)](https://hud.pytorch.org/pr/pytorch/rl/2194#25779673394) ([gh](https://github.com/pytorch/rl/actions/runs/9365067651/job/25779673394)) * [Wheels / test-wheel-windows (3.8)](https://hud.pytorch.org/pr/pytorch/rl/2194#25779672327) ([gh](https://github.com/pytorch/rl/actions/runs/9365067651/job/25779672327)) `ModuleNotFoundError: No module named 'dm_env'` * [Wheels / test-wheel-windows (3.9)](https://hud.pytorch.org/pr/pytorch/rl/2194#25779672709) ([gh](https://github.com/pytorch/rl/actions/runs/9365067651/job/25779672709))

FLAKY - The following jobs failed but were likely due to flakiness present on trunk:

* [Examples Tests on Linux / tests (3.9, 12.1) / linux-job](https://hud.pytorch.org/pr/pytorch/rl/2194#25779478116) ([gh](https://github.com/pytorch/rl/actions/runs/9365067678/job/25779478116)) (matched **linux** rule in [flaky-rules.json](https://github.com/pytorch/test-infra/blob/generated-stats/stats/flaky-rules.json)) `The process '/usr/bin/git' failed with exit code 128` * [Habitat Tests on Linux / tests (3.9, 12.1) / linux-job](https://hud.pytorch.org/pr/pytorch/rl/2194#25779476852) ([gh](https://github.com/pytorch/rl/actions/runs/9365067682/job/25779476852)) (matched **linux** rule in [flaky-rules.json](https://github.com/pytorch/test-infra/blob/generated-stats/stats/flaky-rules.json)) `The process '/usr/bin/git' failed with exit code 128` * [Libs Tests on Linux / unittests-gym (3.9, 12.1) / linux-job](https://hud.pytorch.org/pr/pytorch/rl/2194#25779483987) ([gh](https://github.com/pytorch/rl/actions/runs/9365067677/job/25779483987)) (matched **linux** rule in [flaky-rules.json](https://github.com/pytorch/test-infra/blob/generated-stats/stats/flaky-rules.json)) `The process '/usr/bin/git' failed with exit code 128` * [Libs Tests on Linux / unittests-sklearn (3.9, 12.1) / linux-job](https://hud.pytorch.org/pr/pytorch/rl/2194#25779484227) ([gh](https://github.com/pytorch/rl/actions/runs/9365067677/job/25779484227)) (matched **linux** rule in [flaky-rules.json](https://github.com/pytorch/test-infra/blob/generated-stats/stats/flaky-rules.json)) `The process '/usr/bin/git' failed with exit code 128` * [RLHF Tests on Linux / unittests (3.9, 12.1) / linux-job](https://hud.pytorch.org/pr/pytorch/rl/2194#25779476840) ([gh](https://github.com/pytorch/rl/actions/runs/9365067692/job/25779476840)) (matched **linux** rule in [flaky-rules.json](https://github.com/pytorch/test-infra/blob/generated-stats/stats/flaky-rules.json)) `The process '/usr/bin/git' failed with exit code 128` * [Unit-tests on Linux / tests-gpu (3.10, 12.1) / linux-job](https://hud.pytorch.org/pr/pytorch/rl/2194#25779480729) ([gh](https://github.com/pytorch/rl/actions/runs/9365067674/job/25779480729)) (matched **linux** rule in [flaky-rules.json](https://github.com/pytorch/test-infra/blob/generated-stats/stats/flaky-rules.json)) `The process '/usr/bin/git' failed with exit code 128` * [Unit-tests on Linux / tests-olddeps (3.8, 11.6) / linux-job](https://hud.pytorch.org/pr/pytorch/rl/2194#25779480977) ([gh](https://github.com/pytorch/rl/actions/runs/9365067674/job/25779480977)) (matched **linux** rule in [flaky-rules.json](https://github.com/pytorch/test-infra/blob/generated-stats/stats/flaky-rules.json)) `The process '/usr/bin/git' failed with exit code 128` * [Unit-tests on Linux / tests-optdeps (3.10, 12.1) / linux-job](https://hud.pytorch.org/pr/pytorch/rl/2194#25779481226) ([gh](https://github.com/pytorch/rl/actions/runs/9365067674/job/25779481226)) (matched **linux** rule in [flaky-rules.json](https://github.com/pytorch/test-infra/blob/generated-stats/stats/flaky-rules.json)) `The process '/usr/bin/git' failed with exit code 128` * [Unit-tests on Linux / tests-stable-gpu (3.10, 11.8) / linux-job](https://hud.pytorch.org/pr/pytorch/rl/2194#25779481484) ([gh](https://github.com/pytorch/rl/actions/runs/9365067674/job/25779481484)) (matched **linux** rule in [flaky-rules.json](https://github.com/pytorch/test-infra/blob/generated-stats/stats/flaky-rules.json)) `The process '/usr/bin/git' failed with exit code 128`

This comment was automatically generated by Dr. CI and updates every 15 minutes.

github-actions[bot] commented 1 month ago

$\color{#D29922}\textsf{\Large\⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 91. Improved: $\large\color{#35bf28}7$. Worsened: $\large\color{#d91a1a}3$.

Expand to view detailed results

| Name | Max | Mean | Ops | Ops on Repo `HEAD` | Change | | ----------------------------------------------------------------------------------------- | --------- | --------- | --------------- | ------------------ | ----------------------------------- | | test_single | 0.1088s | 59.9196ms | 16.6890 Ops/s | 17.6010 Ops/s | $\textbf{\color{#d91a1a}-5.18\\%}$ | | test_sync | 41.2009ms | 31.9943ms | 31.2556 Ops/s | 31.4979 Ops/s | $\color{#d91a1a}-0.77\\%$ | | test_async | 54.6765ms | 27.6045ms | 36.2260 Ops/s | 35.3370 Ops/s | $\color{#35bf28}+2.52\\%$ | | test_simple | 0.4525s | 0.3956s | 2.5281 Ops/s | 2.6419 Ops/s | $\color{#d91a1a}-4.31\\%$ | | test_transformed | 0.5381s | 0.5365s | 1.8639 Ops/s | 1.8065 Ops/s | $\color{#35bf28}+3.18\\%$ | | test_serial | 1.3295s | 1.2815s | 0.7803 Ops/s | 0.7852 Ops/s | $\color{#d91a1a}-0.62\\%$ | | test_parallel | 1.1425s | 1.0723s | 0.9326 Ops/s | 0.9314 Ops/s | $\color{#35bf28}+0.13\\%$ | | test_step_mdp_speed[True-True-True-True-True] | 0.1888ms | 22.1613μs | 45.1236 KOps/s | 46.1921 KOps/s | $\color{#d91a1a}-2.31\\%$ | | test_step_mdp_speed[True-True-True-True-False] | 55.1830μs | 13.0749μs | 76.4824 KOps/s | 75.9968 KOps/s | $\color{#35bf28}+0.64\\%$ | | test_step_mdp_speed[True-True-True-False-True] | 44.4630μs | 12.6257μs | 79.2033 KOps/s | 77.9766 KOps/s | $\color{#35bf28}+1.57\\%$ | | test_step_mdp_speed[True-True-True-False-False] | 27.1300μs | 7.6613μs | 130.5258 KOps/s | 130.1141 KOps/s | $\color{#35bf28}+0.32\\%$ | | test_step_mdp_speed[True-True-False-True-True] | 56.6350μs | 23.0702μs | 43.3460 KOps/s | 43.5025 KOps/s | $\color{#d91a1a}-0.36\\%$ | | test_step_mdp_speed[True-True-False-True-False] | 0.2071ms | 14.5516μs | 68.7211 KOps/s | 69.1018 KOps/s | $\color{#d91a1a}-0.55\\%$ | | test_step_mdp_speed[True-True-False-False-True] | 42.0590μs | 14.0145μs | 71.3545 KOps/s | 70.0615 KOps/s | $\color{#35bf28}+1.85\\%$ | | test_step_mdp_speed[True-True-False-False-False] | 31.7690μs | 8.9724μs | 111.4528 KOps/s | 110.4699 KOps/s | $\color{#35bf28}+0.89\\%$ | | test_step_mdp_speed[True-False-True-True-True] | 51.9970μs | 24.4159μs | 40.9570 KOps/s | 40.6226 KOps/s | $\color{#35bf28}+0.82\\%$ | | test_step_mdp_speed[True-False-True-True-False] | 61.8660μs | 15.7601μs | 63.4512 KOps/s | 62.9673 KOps/s | $\color{#35bf28}+0.77\\%$ | | test_step_mdp_speed[True-False-True-False-True] | 56.5760μs | 13.8212μs | 72.3528 KOps/s | 70.8508 KOps/s | $\color{#35bf28}+2.12\\%$ | | test_step_mdp_speed[True-False-True-False-False] | 46.7970μs | 8.9198μs | 112.1100 KOps/s | 110.6007 KOps/s | $\color{#35bf28}+1.36\\%$ | | test_step_mdp_speed[True-False-False-True-True] | 54.0210μs | 25.1965μs | 39.6881 KOps/s | 39.0289 KOps/s | $\color{#35bf28}+1.69\\%$ | | test_step_mdp_speed[True-False-False-True-False] | 66.3620μs | 16.9909μs | 58.8551 KOps/s | 58.1133 KOps/s | $\color{#35bf28}+1.28\\%$ | | test_step_mdp_speed[True-False-False-False-True] | 38.3120μs | 15.1170μs | 66.1507 KOps/s | 64.9034 KOps/s | $\color{#35bf28}+1.92\\%$ | | test_step_mdp_speed[True-False-False-False-False] | 30.2960μs | 10.1072μs | 98.9397 KOps/s | 96.7453 KOps/s | $\color{#35bf28}+2.27\\%$ | | test_step_mdp_speed[False-True-True-True-True] | 89.1650μs | 24.1416μs | 41.4222 KOps/s | 40.7280 KOps/s | $\color{#35bf28}+1.70\\%$ | | test_step_mdp_speed[False-True-True-True-False] | 38.7320μs | 15.7190μs | 63.6172 KOps/s | 63.1559 KOps/s | $\color{#35bf28}+0.73\\%$ | | test_step_mdp_speed[False-True-True-False-True] | 41.9180μs | 16.2753μs | 61.4427 KOps/s | 60.8096 KOps/s | $\color{#35bf28}+1.04\\%$ | | test_step_mdp_speed[False-True-True-False-False] | 30.7770μs | 10.0710μs | 99.2954 KOps/s | 97.9863 KOps/s | $\color{#35bf28}+1.34\\%$ | | test_step_mdp_speed[False-True-False-True-True] | 53.2800μs | 25.3831μs | 39.3963 KOps/s | 38.7690 KOps/s | $\color{#35bf28}+1.62\\%$ | | test_step_mdp_speed[False-True-False-True-False] | 83.1750μs | 16.9367μs | 59.0436 KOps/s | 58.3294 KOps/s | $\color{#35bf28}+1.22\\%$ | | test_step_mdp_speed[False-True-False-False-True] | 52.5090μs | 17.5094μs | 57.1123 KOps/s | 56.8139 KOps/s | $\color{#35bf28}+0.53\\%$ | | test_step_mdp_speed[False-True-False-False-False] | 44.9440μs | 11.3579μs | 88.0446 KOps/s | 87.5312 KOps/s | $\color{#35bf28}+0.59\\%$ | | test_step_mdp_speed[False-False-True-True-True] | 56.2350μs | 26.6299μs | 37.5518 KOps/s | 37.3474 KOps/s | $\color{#35bf28}+0.55\\%$ | | test_step_mdp_speed[False-False-True-True-False] | 45.5750μs | 18.2938μs | 54.6633 KOps/s | 54.2269 KOps/s | $\color{#35bf28}+0.80\\%$ | | test_step_mdp_speed[False-False-True-False-True] | 39.8540μs | 17.4362μs | 57.3518 KOps/s | 57.1067 KOps/s | $\color{#35bf28}+0.43\\%$ | | test_step_mdp_speed[False-False-True-False-False] | 36.5780μs | 11.3125μs | 88.3981 KOps/s | 87.9064 KOps/s | $\color{#35bf28}+0.56\\%$ | | test_step_mdp_speed[False-False-False-True-True] | 41.3980μs | 28.5809μs | 34.9884 KOps/s | 34.6130 KOps/s | $\color{#35bf28}+1.08\\%$ | | test_step_mdp_speed[False-False-False-True-False] | 56.9270μs | 19.3325μs | 51.7263 KOps/s | 51.3578 KOps/s | $\color{#35bf28}+0.72\\%$ | | test_step_mdp_speed[False-False-False-False-True] | 48.6610μs | 18.3740μs | 54.4248 KOps/s | 53.6460 KOps/s | $\color{#35bf28}+1.45\\%$ | | test_step_mdp_speed[False-False-False-False-False] | 40.2050μs | 12.4531μs | 80.3010 KOps/s | 79.2848 KOps/s | $\color{#35bf28}+1.28\\%$ | | test_values[generalized_advantage_estimate-True-True] | 10.9184ms | 9.5582ms | 104.6227 Ops/s | 104.0153 Ops/s | $\color{#35bf28}+0.58\\%$ | | test_values[vec_generalized_advantage_estimate-True-True] | 35.2712ms | 33.3193ms | 30.0127 Ops/s | 28.3472 Ops/s | $\textbf{\color{#35bf28}+5.88\\%}$ | | test_values[td0_return_estimate-False-False] | 0.2130ms | 0.1649ms | 6.0660 KOps/s | 6.0739 KOps/s | $\color{#d91a1a}-0.13\\%$ | | test_values[td1_return_estimate-False-False] | 25.7394ms | 24.2233ms | 41.2826 Ops/s | 40.8849 Ops/s | $\color{#35bf28}+0.97\\%$ | | test_values[vec_td1_return_estimate-False-False] | 34.3838ms | 33.6216ms | 29.7428 Ops/s | 28.3962 Ops/s | $\color{#35bf28}+4.74\\%$ | | test_values[td_lambda_return_estimate-True-False] | 35.4160ms | 34.8928ms | 28.6592 Ops/s | 28.1690 Ops/s | $\color{#35bf28}+1.74\\%$ | | test_values[vec_td_lambda_return_estimate-True-False] | 35.3653ms | 33.6508ms | 29.7170 Ops/s | 28.3522 Ops/s | $\color{#35bf28}+4.81\\%$ | | test_gae_speed[generalized_advantage_estimate-False-1-512] | 8.5625ms | 8.4055ms | 118.9695 Ops/s | 117.3254 Ops/s | $\color{#35bf28}+1.40\\%$ | | test_gae_speed[vec_generalized_advantage_estimate-True-1-512] | 2.1360ms | 1.7977ms | 556.2643 Ops/s | 523.7379 Ops/s | $\textbf{\color{#35bf28}+6.21\\%}$ | | test_gae_speed[vec_generalized_advantage_estimate-False-1-512] | 0.4747ms | 0.3615ms | 2.7659 KOps/s | 2.7744 KOps/s | $\color{#d91a1a}-0.30\\%$ | | test_gae_speed[vec_generalized_advantage_estimate-True-32-512] | 46.0336ms | 43.1777ms | 23.1601 Ops/s | 21.5753 Ops/s | $\textbf{\color{#35bf28}+7.35\\%}$ | | test_gae_speed[vec_generalized_advantage_estimate-False-32-512] | 5.0093ms | 3.1592ms | 316.5379 Ops/s | 329.3042 Ops/s | $\color{#d91a1a}-3.88\\%$ | | test_dqn_speed | 1.6010ms | 1.3708ms | 729.4943 Ops/s | 716.9809 Ops/s | $\color{#35bf28}+1.75\\%$ | | test_ddpg_speed | 3.7614ms | 2.9090ms | 343.7563 Ops/s | 343.1693 Ops/s | $\color{#35bf28}+0.17\\%$ | | test_sac_speed | 9.1264ms | 8.6430ms | 115.7008 Ops/s | 116.2022 Ops/s | $\color{#d91a1a}-0.43\\%$ | | test_redq_speed | 15.0114ms | 13.4831ms | 74.1668 Ops/s | 75.2284 Ops/s | $\color{#d91a1a}-1.41\\%$ | | test_redq_deprec_speed | 14.4841ms | 13.6731ms | 73.1365 Ops/s | 65.8990 Ops/s | $\textbf{\color{#35bf28}+10.98\\%}$ | | test_td3_speed | 16.3614ms | 8.6124ms | 116.1119 Ops/s | 116.8474 Ops/s | $\color{#d91a1a}-0.63\\%$ | | test_cql_speed | 40.5719ms | 36.9441ms | 27.0680 Ops/s | 27.2739 Ops/s | $\color{#d91a1a}-0.76\\%$ | | test_a2c_speed | 10.1033ms | 7.6116ms | 131.3779 Ops/s | 132.6672 Ops/s | $\color{#d91a1a}-0.97\\%$ | | test_ppo_speed | 8.5277ms | 7.8126ms | 127.9980 Ops/s | 126.9382 Ops/s | $\color{#35bf28}+0.83\\%$ | | test_reinforce_speed | 7.7727ms | 6.9036ms | 144.8521 Ops/s | 148.0792 Ops/s | $\color{#d91a1a}-2.18\\%$ | | test_iql_speed | 33.6617ms | 33.1158ms | 30.1971 Ops/s | 30.2802 Ops/s | $\color{#d91a1a}-0.27\\%$ | | test_rb_sample[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] | 11.3760ms | 3.6601ms | 273.2187 Ops/s | 281.5313 Ops/s | $\color{#d91a1a}-2.95\\%$ | | test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] | 0.8934ms | 0.5040ms | 1.9843 KOps/s | 1.9396 KOps/s | $\color{#35bf28}+2.30\\%$ | | test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] | 0.7656ms | 0.4804ms | 2.0817 KOps/s | 2.0771 KOps/s | $\color{#35bf28}+0.22\\%$ | | test_rb_sample[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] | 3.9139ms | 3.6494ms | 274.0195 Ops/s | 284.5835 Ops/s | $\color{#d91a1a}-3.71\\%$ | | test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] | 0.9307ms | 0.5023ms | 1.9907 KOps/s | 2.0100 KOps/s | $\color{#d91a1a}-0.96\\%$ | | test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] | 2.2454ms | 0.4871ms | 2.0531 KOps/s | 2.0920 KOps/s | $\color{#d91a1a}-1.86\\%$ | | test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-sampler6-10000] | 2.0710ms | 1.2862ms | 777.5070 Ops/s | 794.2253 Ops/s | $\color{#d91a1a}-2.10\\%$ | | test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-sampler7-10000] | 1.6703ms | 1.2065ms | 828.8616 Ops/s | 842.5823 Ops/s | $\color{#d91a1a}-1.63\\%$ | | test_rb_sample[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] | 5.5705ms | 3.6877ms | 271.1685 Ops/s | 270.9803 Ops/s | $\color{#35bf28}+0.07\\%$ | | test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] | 0.9794ms | 0.6379ms | 1.5677 KOps/s | 1.6085 KOps/s | $\color{#d91a1a}-2.53\\%$ | | test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] | 0.7511ms | 0.5960ms | 1.6779 KOps/s | 1.6831 KOps/s | $\color{#d91a1a}-0.31\\%$ | | test_rb_iterate[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] | 5.1162ms | 3.5822ms | 279.1565 Ops/s | 281.3693 Ops/s | $\color{#d91a1a}-0.79\\%$ | | test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] | 0.8105ms | 0.5045ms | 1.9823 KOps/s | 1.9738 KOps/s | $\color{#35bf28}+0.43\\%$ | | test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] | 3.7688ms | 0.4867ms | 2.0545 KOps/s | 2.0538 KOps/s | $\color{#35bf28}+0.04\\%$ | | test_rb_iterate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] | 4.2831ms | 3.5581ms | 281.0524 Ops/s | 279.3081 Ops/s | $\color{#35bf28}+0.62\\%$ | | test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] | 0.7605ms | 0.4991ms | 2.0036 KOps/s | 2.0091 KOps/s | $\color{#d91a1a}-0.27\\%$ | | test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] | 0.6204ms | 0.4729ms | 2.1146 KOps/s | 2.1025 KOps/s | $\color{#35bf28}+0.57\\%$ | | test_rb_iterate[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] | 5.4911ms | 3.6509ms | 273.9087 Ops/s | 270.8404 Ops/s | $\color{#35bf28}+1.13\\%$ | | test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] | 0.8679ms | 0.6230ms | 1.6050 KOps/s | 1.6090 KOps/s | $\color{#d91a1a}-0.25\\%$ | | test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] | 0.8727ms | 0.5930ms | 1.6862 KOps/s | 1.6438 KOps/s | $\color{#35bf28}+2.58\\%$ | | test_rb_populate[TensorDictReplayBuffer-ListStorage-RandomSampler-400] | 0.1036s | 5.6882ms | 175.8039 Ops/s | 125.5020 Ops/s | $\textbf{\color{#35bf28}+40.08\\%}$ | | test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-400] | 14.9732ms | 12.8846ms | 77.6117 Ops/s | 77.5694 Ops/s | $\color{#35bf28}+0.05\\%$ | | test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-400] | 5.2019ms | 2.4868ms | 402.1201 Ops/s | 486.2555 Ops/s | $\textbf{\color{#d91a1a}-17.30\\%}$ | | test_rb_populate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-400] | 0.1015s | 7.5287ms | 132.8258 Ops/s | 175.7936 Ops/s | $\textbf{\color{#d91a1a}-24.44\\%}$ | | test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-400] | 14.8971ms | 12.7481ms | 78.4433 Ops/s | 77.7531 Ops/s | $\color{#35bf28}+0.89\\%$ | | test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-400] | 5.3335ms | 1.8152ms | 550.8963 Ops/s | 416.3103 Ops/s | $\textbf{\color{#35bf28}+32.33\\%}$ | | test_rb_populate[TensorDictPrioritizedReplayBuffer-ListStorage-None-400] | 0.1102s | 6.3509ms | 157.4589 Ops/s | 124.2316 Ops/s | $\textbf{\color{#35bf28}+26.75\\%}$ | | test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-400] | 15.9559ms | 13.4011ms | 74.6206 Ops/s | 76.1325 Ops/s | $\color{#d91a1a}-1.99\\%$ | | test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-400] | 5.9096ms | 2.7490ms | 363.7724 Ops/s | 378.7641 Ops/s | $\color{#d91a1a}-3.96\\%$ |

github-actions[bot] commented 1 month ago

$\color{#D29922}\textsf{\Large\⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 94. Improved: $\large\color{#35bf28}6$. Worsened: $\large\color{#d91a1a}3$.

Expand to view detailed results

| Name | Max | Mean | Ops | Ops on Repo `HEAD` | Change | | ----------------------------------------------------------------------------------------- | --------- | --------- | -------------- | ------------------ | ----------------------------------- | | test_single | 0.1177s | 0.1175s | 8.5082 Ops/s | 8.3306 Ops/s | $\color{#35bf28}+2.13\\%$ | | test_sync | 0.1051s | 0.1039s | 9.6217 Ops/s | 9.4880 Ops/s | $\color{#35bf28}+1.41\\%$ | | test_async | 0.1961s | 95.3590ms | 10.4867 Ops/s | 10.9033 Ops/s | $\color{#d91a1a}-3.82\\%$ | | test_single_pixels | 0.1304s | 0.1283s | 7.7961 Ops/s | 7.7203 Ops/s | $\color{#35bf28}+0.98\\%$ | | test_sync_pixels | 85.6050ms | 83.0983ms | 12.0339 Ops/s | 11.9023 Ops/s | $\color{#35bf28}+1.11\\%$ | | test_async_pixels | 0.1612s | 69.5490ms | 14.3783 Ops/s | 14.2574 Ops/s | $\color{#35bf28}+0.85\\%$ | | test_simple | 0.8844s | 0.8249s | 1.2122 Ops/s | 1.1960 Ops/s | $\color{#35bf28}+1.36\\%$ | | test_transformed | 1.1382s | 1.0797s | 0.9262 Ops/s | 0.9116 Ops/s | $\color{#35bf28}+1.61\\%$ | | test_serial | 2.5370s | 2.4809s | 0.4031 Ops/s | 0.3960 Ops/s | $\color{#35bf28}+1.79\\%$ | | test_parallel | 2.4134s | 2.3485s | 0.4258 Ops/s | 0.4206 Ops/s | $\color{#35bf28}+1.25\\%$ | | test_step_mdp_speed[True-True-True-True-True] | 0.1022ms | 33.3833μs | 29.9551 KOps/s | 29.3629 KOps/s | $\color{#35bf28}+2.02\\%$ | | test_step_mdp_speed[True-True-True-True-False] | 44.7100μs | 19.7258μs | 50.6949 KOps/s | 49.2375 KOps/s | $\color{#35bf28}+2.96\\%$ | | test_step_mdp_speed[True-True-True-False-True] | 40.3010μs | 18.9958μs | 52.6433 KOps/s | 49.8905 KOps/s | $\textbf{\color{#35bf28}+5.52\\%}$ | | test_step_mdp_speed[True-True-True-False-False] | 26.6400μs | 11.3285μs | 88.2730 KOps/s | 84.6251 KOps/s | $\color{#35bf28}+4.31\\%$ | | test_step_mdp_speed[True-True-False-True-True] | 63.1910μs | 35.1030μs | 28.4876 KOps/s | 27.6114 KOps/s | $\color{#35bf28}+3.17\\%$ | | test_step_mdp_speed[True-True-False-True-False] | 44.9400μs | 21.8839μs | 45.6957 KOps/s | 45.3316 KOps/s | $\color{#35bf28}+0.80\\%$ | | test_step_mdp_speed[True-True-False-False-True] | 39.3110μs | 20.9046μs | 47.8365 KOps/s | 46.9420 KOps/s | $\color{#35bf28}+1.91\\%$ | | test_step_mdp_speed[True-True-False-False-False] | 36.3810μs | 13.4110μs | 74.5658 KOps/s | 74.1807 KOps/s | $\color{#35bf28}+0.52\\%$ | | test_step_mdp_speed[True-False-True-True-True] | 58.8710μs | 37.2974μs | 26.8115 KOps/s | 26.1407 KOps/s | $\color{#35bf28}+2.57\\%$ | | test_step_mdp_speed[True-False-True-True-False] | 50.3210μs | 23.7746μs | 42.0617 KOps/s | 41.0686 KOps/s | $\color{#35bf28}+2.42\\%$ | | test_step_mdp_speed[True-False-True-False-True] | 46.6100μs | 20.8853μs | 47.8805 KOps/s | 46.0152 KOps/s | $\color{#35bf28}+4.05\\%$ | | test_step_mdp_speed[True-False-True-False-False] | 34.7300μs | 13.3806μs | 74.7350 KOps/s | 74.5354 KOps/s | $\color{#35bf28}+0.27\\%$ | | test_step_mdp_speed[True-False-False-True-True] | 61.5810μs | 39.2912μs | 25.4510 KOps/s | 25.3874 KOps/s | $\color{#35bf28}+0.25\\%$ | | test_step_mdp_speed[True-False-False-True-False] | 53.6410μs | 25.9308μs | 38.5642 KOps/s | 38.9193 KOps/s | $\color{#d91a1a}-0.91\\%$ | | test_step_mdp_speed[True-False-False-False-True] | 48.7600μs | 22.5325μs | 44.3804 KOps/s | 42.4007 KOps/s | $\color{#35bf28}+4.67\\%$ | | test_step_mdp_speed[True-False-False-False-False] | 51.6910μs | 15.2233μs | 65.6887 KOps/s | 64.4651 KOps/s | $\color{#35bf28}+1.90\\%$ | | test_step_mdp_speed[False-True-True-True-True] | 64.2310μs | 37.4017μs | 26.7368 KOps/s | 26.4352 KOps/s | $\color{#35bf28}+1.14\\%$ | | test_step_mdp_speed[False-True-True-True-False] | 51.7500μs | 23.9845μs | 41.6937 KOps/s | 41.2166 KOps/s | $\color{#35bf28}+1.16\\%$ | | test_step_mdp_speed[False-True-True-False-True] | 51.9700μs | 24.6033μs | 40.6450 KOps/s | 38.2890 KOps/s | $\textbf{\color{#35bf28}+6.15\\%}$ | | test_step_mdp_speed[False-True-True-False-False] | 36.2400μs | 15.0341μs | 66.5153 KOps/s | 64.4543 KOps/s | $\color{#35bf28}+3.20\\%$ | | test_step_mdp_speed[False-True-False-True-True] | 65.4510μs | 38.8674μs | 25.7285 KOps/s | 25.1531 KOps/s | $\color{#35bf28}+2.29\\%$ | | test_step_mdp_speed[False-True-False-True-False] | 45.3800μs | 25.6838μs | 38.9350 KOps/s | 38.0122 KOps/s | $\color{#35bf28}+2.43\\%$ | | test_step_mdp_speed[False-True-False-False-True] | 48.7810μs | 26.4615μs | 37.7908 KOps/s | 36.1538 KOps/s | $\color{#35bf28}+4.53\\%$ | | test_step_mdp_speed[False-True-False-False-False] | 36.2900μs | 16.7891μs | 59.5625 KOps/s | 58.0601 KOps/s | $\color{#35bf28}+2.59\\%$ | | test_step_mdp_speed[False-False-True-True-True] | 64.7010μs | 41.2743μs | 24.2281 KOps/s | 24.1535 KOps/s | $\color{#35bf28}+0.31\\%$ | | test_step_mdp_speed[False-False-True-True-False] | 52.5410μs | 27.4585μs | 36.4186 KOps/s | 35.6209 KOps/s | $\color{#35bf28}+2.24\\%$ | | test_step_mdp_speed[False-False-True-False-True] | 47.3210μs | 26.3604μs | 37.9358 KOps/s | 35.9710 KOps/s | $\textbf{\color{#35bf28}+5.46\\%}$ | | test_step_mdp_speed[False-False-True-False-False] | 37.5510μs | 16.7884μs | 59.5649 KOps/s | 58.1661 KOps/s | $\color{#35bf28}+2.40\\%$ | | test_step_mdp_speed[False-False-False-True-True] | 57.8110μs | 42.9904μs | 23.2610 KOps/s | 22.5930 KOps/s | $\color{#35bf28}+2.96\\%$ | | test_step_mdp_speed[False-False-False-True-False] | 56.9010μs | 29.5760μs | 33.8112 KOps/s | 33.5216 KOps/s | $\color{#35bf28}+0.86\\%$ | | test_step_mdp_speed[False-False-False-False-True] | 60.7800μs | 28.1459μs | 35.5292 KOps/s | 34.3773 KOps/s | $\color{#35bf28}+3.35\\%$ | | test_step_mdp_speed[False-False-False-False-False] | 49.2410μs | 18.6047μs | 53.7498 KOps/s | 53.0967 KOps/s | $\color{#35bf28}+1.23\\%$ | | test_values[generalized_advantage_estimate-True-True] | 25.3206ms | 24.7040ms | 40.4793 Ops/s | 38.9446 Ops/s | $\color{#35bf28}+3.94\\%$ | | test_values[vec_generalized_advantage_estimate-True-True] | 81.2806ms | 3.2038ms | 312.1310 Ops/s | 300.7793 Ops/s | $\color{#35bf28}+3.77\\%$ | | test_values[td0_return_estimate-False-False] | 91.3310μs | 64.8814μs | 15.4127 KOps/s | 15.0870 KOps/s | $\color{#35bf28}+2.16\\%$ | | test_values[td1_return_estimate-False-False] | 53.9883ms | 52.9850ms | 18.8733 Ops/s | 18.2585 Ops/s | $\color{#35bf28}+3.37\\%$ | | test_values[vec_td1_return_estimate-False-False] | 2.0565ms | 1.7766ms | 562.8739 Ops/s | 560.2954 Ops/s | $\color{#35bf28}+0.46\\%$ | | test_values[td_lambda_return_estimate-True-False] | 85.4918ms | 84.7664ms | 11.7971 Ops/s | 11.4134 Ops/s | $\color{#35bf28}+3.36\\%$ | | test_values[vec_td_lambda_return_estimate-True-False] | 2.1530ms | 1.7756ms | 563.1848 Ops/s | 559.1379 Ops/s | $\color{#35bf28}+0.72\\%$ | | test_gae_speed[generalized_advantage_estimate-False-1-512] | 23.7590ms | 23.5196ms | 42.5178 Ops/s | 40.8184 Ops/s | $\color{#35bf28}+4.16\\%$ | | test_gae_speed[vec_generalized_advantage_estimate-True-1-512] | 0.9536ms | 0.7223ms | 1.3845 KOps/s | 1.3913 KOps/s | $\color{#d91a1a}-0.48\\%$ | | test_gae_speed[vec_generalized_advantage_estimate-False-1-512] | 0.7279ms | 0.6646ms | 1.5048 KOps/s | 1.4860 KOps/s | $\color{#35bf28}+1.26\\%$ | | test_gae_speed[vec_generalized_advantage_estimate-True-32-512] | 1.4978ms | 1.4618ms | 684.0668 Ops/s | 679.7693 Ops/s | $\color{#35bf28}+0.63\\%$ | | test_gae_speed[vec_generalized_advantage_estimate-False-32-512] | 0.9479ms | 0.6824ms | 1.4654 KOps/s | 1.4474 KOps/s | $\color{#35bf28}+1.25\\%$ | | test_dqn_speed | 1.5728ms | 1.4322ms | 698.2281 Ops/s | 691.8065 Ops/s | $\color{#35bf28}+0.93\\%$ | | test_ddpg_speed | 3.3945ms | 2.9366ms | 340.5251 Ops/s | 335.2328 Ops/s | $\color{#35bf28}+1.58\\%$ | | test_sac_speed | 96.4613ms | 9.2732ms | 107.8380 Ops/s | 116.8008 Ops/s | $\textbf{\color{#d91a1a}-7.67\\%}$ | | test_redq_speed | 12.4731ms | 10.6219ms | 94.1447 Ops/s | 84.5039 Ops/s | $\textbf{\color{#35bf28}+11.41\\%}$ | | test_redq_deprec_speed | 11.9659ms | 11.4847ms | 87.0727 Ops/s | 83.9303 Ops/s | $\color{#35bf28}+3.74\\%$ | | test_td3_speed | 8.5100ms | 8.3688ms | 119.4913 Ops/s | 117.7272 Ops/s | $\color{#35bf28}+1.50\\%$ | | test_cql_speed | 27.0552ms | 25.7723ms | 38.8014 Ops/s | 38.3657 Ops/s | $\color{#35bf28}+1.14\\%$ | | test_a2c_speed | 6.2470ms | 5.5643ms | 179.7172 Ops/s | 176.0353 Ops/s | $\color{#35bf28}+2.09\\%$ | | test_ppo_speed | 6.9856ms | 5.8525ms | 170.8664 Ops/s | 166.5777 Ops/s | $\color{#35bf28}+2.57\\%$ | | test_reinforce_speed | 5.2094ms | 4.5817ms | 218.2587 Ops/s | 212.8607 Ops/s | $\color{#35bf28}+2.54\\%$ | | test_iql_speed | 20.3109ms | 19.7327ms | 50.6774 Ops/s | 49.7848 Ops/s | $\color{#35bf28}+1.79\\%$ | | test_rb_sample[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] | 4.7957ms | 4.6790ms | 213.7197 Ops/s | 214.2861 Ops/s | $\color{#d91a1a}-0.26\\%$ | | test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] | 1.3800ms | 0.5887ms | 1.6986 KOps/s | 1.4683 KOps/s | $\textbf{\color{#35bf28}+15.68\\%}$ | | test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] | 0.7451ms | 0.5624ms | 1.7780 KOps/s | 1.7568 KOps/s | $\color{#35bf28}+1.21\\%$ | | test_rb_sample[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] | 4.7693ms | 4.5971ms | 217.5296 Ops/s | 214.4736 Ops/s | $\color{#35bf28}+1.42\\%$ | | test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] | 1.3301ms | 0.5797ms | 1.7249 KOps/s | 1.7000 KOps/s | $\color{#35bf28}+1.47\\%$ | | test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] | 0.7019ms | 0.5571ms | 1.7950 KOps/s | 1.7818 KOps/s | $\color{#35bf28}+0.74\\%$ | | test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-sampler6-10000] | 1.8185ms | 1.5201ms | 657.8500 Ops/s | 638.7253 Ops/s | $\color{#35bf28}+2.99\\%$ | | test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-sampler7-10000] | 6.5251ms | 1.4583ms | 685.7495 Ops/s | 662.7569 Ops/s | $\color{#35bf28}+3.47\\%$ | | test_rb_sample[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] | 4.8971ms | 4.7845ms | 209.0076 Ops/s | 209.1016 Ops/s | $\color{#d91a1a}-0.04\\%$ | | test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] | 1.7318ms | 0.7092ms | 1.4101 KOps/s | 1.3911 KOps/s | $\color{#35bf28}+1.36\\%$ | | test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] | 0.8878ms | 0.6831ms | 1.4639 KOps/s | 1.4352 KOps/s | $\color{#35bf28}+2.00\\%$ | | test_rb_iterate[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] | 4.8026ms | 4.6396ms | 215.5343 Ops/s | 212.7605 Ops/s | $\color{#35bf28}+1.30\\%$ | | test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] | 1.5815ms | 0.5908ms | 1.6927 KOps/s | 1.6741 KOps/s | $\color{#35bf28}+1.11\\%$ | | test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] | 0.7023ms | 0.5683ms | 1.7595 KOps/s | 1.7496 KOps/s | $\color{#35bf28}+0.56\\%$ | | test_rb_iterate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] | 4.8209ms | 4.6215ms | 216.3815 Ops/s | 214.8930 Ops/s | $\color{#35bf28}+0.69\\%$ | | test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] | 0.6799ms | 0.5798ms | 1.7246 KOps/s | 1.7032 KOps/s | $\color{#35bf28}+1.25\\%$ | | test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] | 0.8372ms | 0.5596ms | 1.7871 KOps/s | 1.7695 KOps/s | $\color{#35bf28}+1.00\\%$ | | test_rb_iterate[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] | 4.8802ms | 4.7773ms | 209.3218 Ops/s | 207.8802 Ops/s | $\color{#35bf28}+0.69\\%$ | | test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] | 1.6727ms | 0.7135ms | 1.4016 KOps/s | 1.3699 KOps/s | $\color{#35bf28}+2.31\\%$ | | test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] | 0.8448ms | 0.6879ms | 1.4537 KOps/s | 1.4062 KOps/s | $\color{#35bf28}+3.37\\%$ | | test_rb_populate[TensorDictReplayBuffer-ListStorage-RandomSampler-400] | 0.1386s | 7.5409ms | 132.6102 Ops/s | 132.5380 Ops/s | $\color{#35bf28}+0.05\\%$ | | test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-400] | 19.2611ms | 15.5462ms | 64.3245 Ops/s | 54.9399 Ops/s | $\textbf{\color{#35bf28}+17.08\\%}$ | | test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-400] | 1.3880ms | 1.2764ms | 783.4352 Ops/s | 769.3237 Ops/s | $\color{#35bf28}+1.83\\%$ | | test_rb_populate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-400] | 0.1194s | 9.4286ms | 106.0606 Ops/s | 136.8065 Ops/s | $\textbf{\color{#d91a1a}-22.47\\%}$ | | test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-400] | 17.8203ms | 15.5272ms | 64.4032 Ops/s | 63.5374 Ops/s | $\color{#35bf28}+1.36\\%$ | | test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-400] | 2.3609ms | 1.3045ms | 766.5895 Ops/s | 764.0506 Ops/s | $\color{#35bf28}+0.33\\%$ | | test_rb_populate[TensorDictPrioritizedReplayBuffer-ListStorage-None-400] | 0.1189s | 7.5099ms | 133.1578 Ops/s | 130.9536 Ops/s | $\color{#35bf28}+1.68\\%$ | | test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-400] | 18.3277ms | 15.7710ms | 63.4077 Ops/s | 61.5563 Ops/s | $\color{#35bf28}+3.01\\%$ | | test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-400] | 7.1833ms | 1.7966ms | 556.5996 Ops/s | 601.1269 Ops/s | $\textbf{\color{#d91a1a}-7.41\\%}$ |

pytorch / rl