pytorch / rl

A modular, primitive-first, python-first PyTorch library for Reinforcement Learning.
https://pytorch.org/rl
MIT License
2.05k stars 273 forks source link

[Doc] Dynamic envs #2191

Closed vmoens closed 1 month ago

pytorch-bot[bot] commented 1 month ago

:link: Helpful Links

:test_tube: See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/rl/2191

Note: Links to docs will display an error until the docs builds have been completed.

:x: 2 New Failures, 12 Unrelated Failures

As of commit 833b9483a47b6bad6a070eef9e5045e63571ed81 with merge base 7b145b509f47b3ce8f043dc4e10c19c8c8a9a617 (image):

NEW FAILURES - The following jobs have failed:

* [Generate documentation / build-docs (3.9, 12.1) / linux-job](https://hud.pytorch.org/pr/pytorch/rl/2191#25656012230) ([gh](https://github.com/pytorch/rl/actions/runs/9320058143/job/25656012230)) `No files were found with the provided path: /home/ec2-user/actions-runner/_work/_temp/artifacts/. No artifacts will be uploaded.` * [Unit-tests on Windows / unittests-cpu / windows-job](https://hud.pytorch.org/pr/pytorch/rl/2191#25656011689) ([gh](https://github.com/pytorch/rl/actions/runs/9320058128/job/25656011689)) `The process 'C:\Program Files\Git\cmd\git.exe' failed with exit code 128`

FLAKY - The following jobs failed but were likely due to flakiness present on trunk:

* [Examples Tests on Linux / tests (3.9, 12.1) / linux-job](https://hud.pytorch.org/pr/pytorch/rl/2191#25656012947) ([gh](https://github.com/pytorch/rl/actions/runs/9320058136/job/25656012947)) (matched **linux** rule in [flaky-rules.json](https://github.com/pytorch/test-infra/blob/generated-stats/stats/flaky-rules.json)) `The process '/usr/bin/git' failed with exit code 128` * [Habitat Tests on Linux / tests (3.9, 12.1) / linux-job](https://hud.pytorch.org/pr/pytorch/rl/2191#25656012566) ([gh](https://github.com/pytorch/rl/actions/runs/9320058130/job/25656012566)) (matched **linux** rule in [flaky-rules.json](https://github.com/pytorch/test-infra/blob/generated-stats/stats/flaky-rules.json)) `The process '/usr/bin/git' failed with exit code 128` * [Libs Tests on Linux / unittests-gym (3.9, 12.1) / linux-job](https://hud.pytorch.org/pr/pytorch/rl/2191#25656018708) ([gh](https://github.com/pytorch/rl/actions/runs/9320058123/job/25656018708)) (matched **linux** rule in [flaky-rules.json](https://github.com/pytorch/test-infra/blob/generated-stats/stats/flaky-rules.json)) `The process '/usr/bin/git' failed with exit code 128` * [Libs Tests on Linux / unittests-sklearn (3.9, 12.1) / linux-job](https://hud.pytorch.org/pr/pytorch/rl/2191#25656018977) ([gh](https://github.com/pytorch/rl/actions/runs/9320058123/job/25656018977)) (matched **linux** rule in [flaky-rules.json](https://github.com/pytorch/test-infra/blob/generated-stats/stats/flaky-rules.json)) `The process '/usr/bin/git' failed with exit code 128` * [RLHF Tests on Linux / unittests (3.9, 12.1) / linux-job](https://hud.pytorch.org/pr/pytorch/rl/2191#25656012953) ([gh](https://github.com/pytorch/rl/actions/runs/9320058140/job/25656012953)) (matched **linux** rule in [flaky-rules.json](https://github.com/pytorch/test-infra/blob/generated-stats/stats/flaky-rules.json)) `The process '/usr/bin/git' failed with exit code 128` * [Unit-tests on Linux / tests-cpu (3.11) / linux-job](https://hud.pytorch.org/pr/pytorch/rl/2191#25656013826) ([gh](https://github.com/pytorch/rl/actions/runs/9320058118/job/25656013826)) (matched **linux** rule in [flaky-rules.json](https://github.com/pytorch/test-infra/blob/generated-stats/stats/flaky-rules.json)) `The process '/usr/bin/git' failed with exit code 128` * [Unit-tests on Linux / tests-cpu (3.8) / linux-job](https://hud.pytorch.org/pr/pytorch/rl/2191#25656014498) ([gh](https://github.com/pytorch/rl/actions/runs/9320058118/job/25656014498)) (matched **linux** rule in [flaky-rules.json](https://github.com/pytorch/test-infra/blob/generated-stats/stats/flaky-rules.json)) `The process '/usr/bin/git' failed with exit code 128` * [Unit-tests on Linux / tests-cpu (3.9) / linux-job](https://hud.pytorch.org/pr/pytorch/rl/2191#25656014947) ([gh](https://github.com/pytorch/rl/actions/runs/9320058118/job/25656014947)) (matched **linux** rule in [flaky-rules.json](https://github.com/pytorch/test-infra/blob/generated-stats/stats/flaky-rules.json)) `The process '/usr/bin/git' failed with exit code 128` * [Unit-tests on Linux / tests-gpu (3.10, 12.1) / linux-job](https://hud.pytorch.org/pr/pytorch/rl/2191#25656015358) ([gh](https://github.com/pytorch/rl/actions/runs/9320058118/job/25656015358)) (matched **linux** rule in [flaky-rules.json](https://github.com/pytorch/test-infra/blob/generated-stats/stats/flaky-rules.json)) `The process '/usr/bin/git' failed with exit code 128` * [Unit-tests on Linux / tests-olddeps (3.8, 11.6) / linux-job](https://hud.pytorch.org/pr/pytorch/rl/2191#25656015613) ([gh](https://github.com/pytorch/rl/actions/runs/9320058118/job/25656015613)) (matched **linux** rule in [flaky-rules.json](https://github.com/pytorch/test-infra/blob/generated-stats/stats/flaky-rules.json)) `The process '/usr/bin/git' failed with exit code 128` * [Unit-tests on Linux / tests-optdeps (3.10, 12.1) / linux-job](https://hud.pytorch.org/pr/pytorch/rl/2191#25656015883) ([gh](https://github.com/pytorch/rl/actions/runs/9320058118/job/25656015883)) (matched **linux** rule in [flaky-rules.json](https://github.com/pytorch/test-infra/blob/generated-stats/stats/flaky-rules.json)) `The process '/usr/bin/git' failed with exit code 128` * [Unit-tests on Linux / tests-stable-gpu (3.10, 11.8) / linux-job](https://hud.pytorch.org/pr/pytorch/rl/2191#25656016150) ([gh](https://github.com/pytorch/rl/actions/runs/9320058118/job/25656016150)) (matched **linux** rule in [flaky-rules.json](https://github.com/pytorch/test-infra/blob/generated-stats/stats/flaky-rules.json)) `The process '/usr/bin/git' failed with exit code 128`

This comment was automatically generated by Dr. CI and updates every 15 minutes.

github-actions[bot] commented 1 month ago

$\color{#D29922}\textsf{\Large\⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 91. Improved: $\large\color{#35bf28}3$. Worsened: $\large\color{#d91a1a}6$.

Expand to view detailed results | Name | Max | Mean | Ops | Ops on Repo `HEAD` | Change | | ----------------------------------------------------------------------------------------- | --------- | --------- | --------------- | ------------------ | ----------------------------------- | | test_single | 0.1228s | 59.2277ms | 16.8840 Ops/s | 18.0146 Ops/s | $\textbf{\color{#d91a1a}-6.28\\%}$ | | test_sync | 33.5546ms | 31.7092ms | 31.5366 Ops/s | 31.5638 Ops/s | $\color{#d91a1a}-0.09\\%$ | | test_async | 72.0078ms | 29.2420ms | 34.1974 Ops/s | 34.1845 Ops/s | $\color{#35bf28}+0.04\\%$ | | test_simple | 0.4402s | 0.3830s | 2.6112 Ops/s | 2.6815 Ops/s | $\color{#d91a1a}-2.62\\%$ | | test_transformed | 0.5283s | 0.5255s | 1.9031 Ops/s | 1.8554 Ops/s | $\color{#35bf28}+2.57\\%$ | | test_serial | 1.3077s | 1.2501s | 0.7999 Ops/s | 0.7839 Ops/s | $\color{#35bf28}+2.04\\%$ | | test_parallel | 1.1285s | 1.0752s | 0.9300 Ops/s | 0.9099 Ops/s | $\color{#35bf28}+2.21\\%$ | | test_step_mdp_speed[True-True-True-True-True] | 0.1335ms | 21.2817μs | 46.9888 KOps/s | 46.3092 KOps/s | $\color{#35bf28}+1.47\\%$ | | test_step_mdp_speed[True-True-True-True-False] | 36.0180μs | 13.0881μs | 76.4052 KOps/s | 75.2997 KOps/s | $\color{#35bf28}+1.47\\%$ | | test_step_mdp_speed[True-True-True-False-True] | 47.4180μs | 12.7338μs | 78.5314 KOps/s | 78.3742 KOps/s | $\color{#35bf28}+0.20\\%$ | | test_step_mdp_speed[True-True-True-False-False] | 35.2370μs | 7.6265μs | 131.1212 KOps/s | 128.5038 KOps/s | $\color{#35bf28}+2.04\\%$ | | test_step_mdp_speed[True-True-False-True-True] | 53.8400μs | 22.9557μs | 43.5622 KOps/s | 43.3961 KOps/s | $\color{#35bf28}+0.38\\%$ | | test_step_mdp_speed[True-True-False-True-False] | 38.7830μs | 14.3403μs | 69.7337 KOps/s | 68.3979 KOps/s | $\color{#35bf28}+1.95\\%$ | | test_step_mdp_speed[True-True-False-False-True] | 58.8200μs | 13.9224μs | 71.8267 KOps/s | 70.6354 KOps/s | $\color{#35bf28}+1.69\\%$ | | test_step_mdp_speed[True-True-False-False-False] | 34.8950μs | 8.9563μs | 111.6530 KOps/s | 110.7205 KOps/s | $\color{#35bf28}+0.84\\%$ | | test_step_mdp_speed[True-False-True-True-True] | 68.6080μs | 23.9743μs | 41.7114 KOps/s | 41.4148 KOps/s | $\color{#35bf28}+0.72\\%$ | | test_step_mdp_speed[True-False-True-True-False] | 49.9630μs | 15.7890μs | 63.3351 KOps/s | 63.8599 KOps/s | $\color{#d91a1a}-0.82\\%$ | | test_step_mdp_speed[True-False-True-False-True] | 70.3720μs | 13.8959μs | 71.9635 KOps/s | 72.0221 KOps/s | $\color{#d91a1a}-0.08\\%$ | | test_step_mdp_speed[True-False-True-False-False] | 33.6630μs | 8.9253μs | 112.0413 KOps/s | 111.7698 KOps/s | $\color{#35bf28}+0.24\\%$ | | test_step_mdp_speed[True-False-False-True-True] | 64.9210μs | 24.9291μs | 40.1138 KOps/s | 39.6073 KOps/s | $\color{#35bf28}+1.28\\%$ | | test_step_mdp_speed[True-False-False-True-False] | 45.1040μs | 16.6748μs | 59.9708 KOps/s | 59.5727 KOps/s | $\color{#35bf28}+0.67\\%$ | | test_step_mdp_speed[True-False-False-False-True] | 58.9100μs | 14.9095μs | 67.0712 KOps/s | 66.4488 KOps/s | $\color{#35bf28}+0.94\\%$ | | test_step_mdp_speed[True-False-False-False-False] | 59.3520μs | 9.9541μs | 100.4611 KOps/s | 98.3816 KOps/s | $\color{#35bf28}+2.11\\%$ | | test_step_mdp_speed[False-True-True-True-True] | 59.7320μs | 24.0564μs | 41.5690 KOps/s | 40.9668 KOps/s | $\color{#35bf28}+1.47\\%$ | | test_step_mdp_speed[False-True-True-True-False] | 34.1540μs | 15.6435μs | 63.9243 KOps/s | 63.3401 KOps/s | $\color{#35bf28}+0.92\\%$ | | test_step_mdp_speed[False-True-True-False-True] | 38.8130μs | 15.9875μs | 62.5488 KOps/s | 62.3116 KOps/s | $\color{#35bf28}+0.38\\%$ | | test_step_mdp_speed[False-True-True-False-False] | 48.8520μs | 9.9867μs | 100.1334 KOps/s | 98.4573 KOps/s | $\color{#35bf28}+1.70\\%$ | | test_step_mdp_speed[False-True-False-True-True] | 60.6340μs | 24.9031μs | 40.1557 KOps/s | 39.7170 KOps/s | $\color{#35bf28}+1.10\\%$ | | test_step_mdp_speed[False-True-False-True-False] | 80.7410μs | 16.6427μs | 60.0863 KOps/s | 59.2460 KOps/s | $\color{#35bf28}+1.42\\%$ | | test_step_mdp_speed[False-True-False-False-True] | 61.8960μs | 17.0434μs | 58.6738 KOps/s | 57.4367 KOps/s | $\color{#35bf28}+2.15\\%$ | | test_step_mdp_speed[False-True-False-False-False] | 43.9630μs | 11.2677μs | 88.7494 KOps/s | 88.7262 KOps/s | $\color{#35bf28}+0.03\\%$ | | test_step_mdp_speed[False-False-True-True-True] | 72.9660μs | 26.2679μs | 38.0693 KOps/s | 37.9226 KOps/s | $\color{#35bf28}+0.39\\%$ | | test_step_mdp_speed[False-False-True-True-False] | 40.1460μs | 18.1197μs | 55.1886 KOps/s | 55.5220 KOps/s | $\color{#d91a1a}-0.60\\%$ | | test_step_mdp_speed[False-False-True-False-True] | 45.0850μs | 17.0753μs | 58.5643 KOps/s | 58.2827 KOps/s | $\color{#35bf28}+0.48\\%$ | | test_step_mdp_speed[False-False-True-False-False] | 49.5030μs | 11.2670μs | 88.7549 KOps/s | 87.7684 KOps/s | $\color{#35bf28}+1.12\\%$ | | test_step_mdp_speed[False-False-False-True-True] | 40.4060μs | 27.2957μs | 36.6358 KOps/s | 36.0494 KOps/s | $\color{#35bf28}+1.63\\%$ | | test_step_mdp_speed[False-False-False-True-False] | 65.2320μs | 19.0937μs | 52.3733 KOps/s | 51.9053 KOps/s | $\color{#35bf28}+0.90\\%$ | | test_step_mdp_speed[False-False-False-False-True] | 66.0140μs | 18.1276μs | 55.1644 KOps/s | 55.3364 KOps/s | $\color{#d91a1a}-0.31\\%$ | | test_step_mdp_speed[False-False-False-False-False] | 38.2710μs | 12.3662μs | 80.8657 KOps/s | 80.5076 KOps/s | $\color{#35bf28}+0.44\\%$ | | test_values[generalized_advantage_estimate-True-True] | 10.4342ms | 9.7147ms | 102.9373 Ops/s | 106.4908 Ops/s | $\color{#d91a1a}-3.34\\%$ | | test_values[vec_generalized_advantage_estimate-True-True] | 42.3768ms | 36.3551ms | 27.5065 Ops/s | 29.6765 Ops/s | $\textbf{\color{#d91a1a}-7.31\\%}$ | | test_values[td0_return_estimate-False-False] | 0.2503ms | 0.1763ms | 5.6707 KOps/s | 5.1910 KOps/s | $\textbf{\color{#35bf28}+9.24\\%}$ | | test_values[td1_return_estimate-False-False] | 25.8863ms | 23.9658ms | 41.7262 Ops/s | 41.6309 Ops/s | $\color{#35bf28}+0.23\\%$ | | test_values[vec_td1_return_estimate-False-False] | 38.8055ms | 36.2344ms | 27.5981 Ops/s | 29.5236 Ops/s | $\textbf{\color{#d91a1a}-6.52\\%}$ | | test_values[td_lambda_return_estimate-True-False] | 37.9343ms | 34.6917ms | 28.8253 Ops/s | 28.9872 Ops/s | $\color{#d91a1a}-0.56\\%$ | | test_values[vec_td_lambda_return_estimate-True-False] | 36.8630ms | 35.6820ms | 28.0254 Ops/s | 29.5575 Ops/s | $\textbf{\color{#d91a1a}-5.18\\%}$ | | test_gae_speed[generalized_advantage_estimate-False-1-512] | 8.7513ms | 8.6102ms | 116.1410 Ops/s | 120.2714 Ops/s | $\color{#d91a1a}-3.43\\%$ | | test_gae_speed[vec_generalized_advantage_estimate-True-1-512] | 2.1358ms | 1.8064ms | 553.5986 Ops/s | 541.8076 Ops/s | $\color{#35bf28}+2.18\\%$ | | test_gae_speed[vec_generalized_advantage_estimate-False-1-512] | 0.4283ms | 0.3605ms | 2.7741 KOps/s | 2.7766 KOps/s | $\color{#d91a1a}-0.09\\%$ | | test_gae_speed[vec_generalized_advantage_estimate-True-32-512] | 47.1560ms | 45.4384ms | 22.0078 Ops/s | 24.2756 Ops/s | $\textbf{\color{#d91a1a}-9.34\\%}$ | | test_gae_speed[vec_generalized_advantage_estimate-False-32-512] | 3.6869ms | 3.0417ms | 328.7654 Ops/s | 325.5931 Ops/s | $\color{#35bf28}+0.97\\%$ | | test_dqn_speed | 7.4821ms | 1.3844ms | 722.3387 Ops/s | 707.3297 Ops/s | $\color{#35bf28}+2.12\\%$ | | test_ddpg_speed | 3.7466ms | 2.9169ms | 342.8328 Ops/s | 335.8748 Ops/s | $\color{#35bf28}+2.07\\%$ | | test_sac_speed | 10.0640ms | 8.5811ms | 116.5350 Ops/s | 113.2439 Ops/s | $\color{#35bf28}+2.91\\%$ | | test_redq_speed | 16.1525ms | 13.7066ms | 72.9574 Ops/s | 71.6705 Ops/s | $\color{#35bf28}+1.80\\%$ | | test_redq_deprec_speed | 16.8564ms | 13.8562ms | 72.1697 Ops/s | 72.6548 Ops/s | $\color{#d91a1a}-0.67\\%$ | | test_td3_speed | 9.4359ms | 8.5210ms | 117.3571 Ops/s | 116.9321 Ops/s | $\color{#35bf28}+0.36\\%$ | | test_cql_speed | 39.3365ms | 36.9287ms | 27.0792 Ops/s | 27.0934 Ops/s | $\color{#d91a1a}-0.05\\%$ | | test_a2c_speed | 8.4675ms | 7.4438ms | 134.3392 Ops/s | 132.4888 Ops/s | $\color{#35bf28}+1.40\\%$ | | test_ppo_speed | 8.4941ms | 7.7580ms | 128.8988 Ops/s | 128.4465 Ops/s | $\color{#35bf28}+0.35\\%$ | | test_reinforce_speed | 7.7914ms | 6.6468ms | 150.4483 Ops/s | 149.3306 Ops/s | $\color{#35bf28}+0.75\\%$ | | test_iql_speed | 34.9246ms | 33.6072ms | 29.7555 Ops/s | 29.4074 Ops/s | $\color{#35bf28}+1.18\\%$ | | test_rb_sample[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] | 5.4139ms | 3.4972ms | 285.9417 Ops/s | 283.8405 Ops/s | $\color{#35bf28}+0.74\\%$ | | test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] | 0.7401ms | 0.5050ms | 1.9802 KOps/s | 1.9883 KOps/s | $\color{#d91a1a}-0.41\\%$ | | test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] | 0.7913ms | 0.4784ms | 2.0904 KOps/s | 2.0788 KOps/s | $\color{#35bf28}+0.56\\%$ | | test_rb_sample[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] | 5.3043ms | 3.4561ms | 289.3443 Ops/s | 285.1403 Ops/s | $\color{#35bf28}+1.47\\%$ | | test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] | 0.9290ms | 0.4960ms | 2.0160 KOps/s | 2.0238 KOps/s | $\color{#d91a1a}-0.39\\%$ | | test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] | 99.6733ms | 0.5450ms | 1.8349 KOps/s | 2.1180 KOps/s | $\textbf{\color{#d91a1a}-13.37\\%}$ | | test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-sampler6-10000] | 1.8264ms | 1.2612ms | 792.9136 Ops/s | 793.8054 Ops/s | $\color{#d91a1a}-0.11\\%$ | | test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-sampler7-10000] | 1.4702ms | 1.1824ms | 845.7648 Ops/s | 839.0430 Ops/s | $\color{#35bf28}+0.80\\%$ | | test_rb_sample[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] | 5.2142ms | 3.5866ms | 278.8144 Ops/s | 271.2212 Ops/s | $\color{#35bf28}+2.80\\%$ | | test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] | 0.9390ms | 0.6198ms | 1.6134 KOps/s | 1.6123 KOps/s | $\color{#35bf28}+0.07\\%$ | | test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] | 3.6560ms | 0.5947ms | 1.6814 KOps/s | 1.6876 KOps/s | $\color{#d91a1a}-0.37\\%$ | | test_rb_iterate[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] | 4.3995ms | 3.4366ms | 290.9888 Ops/s | 284.9995 Ops/s | $\color{#35bf28}+2.10\\%$ | | test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] | 0.6057ms | 0.5027ms | 1.9893 KOps/s | 1.9732 KOps/s | $\color{#35bf28}+0.82\\%$ | | test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] | 3.7350ms | 0.4849ms | 2.0624 KOps/s | 2.0916 KOps/s | $\color{#d91a1a}-1.40\\%$ | | test_rb_iterate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] | 5.5872ms | 3.4871ms | 286.7711 Ops/s | 278.4625 Ops/s | $\color{#35bf28}+2.98\\%$ | | test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] | 1.1628ms | 0.5062ms | 1.9756 KOps/s | 2.0140 KOps/s | $\color{#d91a1a}-1.91\\%$ | | test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] | 0.6345ms | 0.4759ms | 2.1011 KOps/s | 2.0946 KOps/s | $\color{#35bf28}+0.31\\%$ | | test_rb_iterate[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] | 3.8025ms | 3.5151ms | 284.4831 Ops/s | 271.8325 Ops/s | $\color{#35bf28}+4.65\\%$ | | test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] | 0.9575ms | 0.6258ms | 1.5981 KOps/s | 1.3449 KOps/s | $\textbf{\color{#35bf28}+18.82\\%}$ | | test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] | 4.4526ms | 0.5990ms | 1.6694 KOps/s | 1.6727 KOps/s | $\color{#d91a1a}-0.20\\%$ | | test_rb_populate[TensorDictReplayBuffer-ListStorage-RandomSampler-400] | 98.8239ms | 5.6445ms | 177.1634 Ops/s | 178.7859 Ops/s | $\color{#d91a1a}-0.91\\%$ | | test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-400] | 15.4292ms | 12.9152ms | 77.4283 Ops/s | 77.4517 Ops/s | $\color{#d91a1a}-0.03\\%$ | | test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-400] | 1.7739ms | 1.0982ms | 910.5893 Ops/s | 897.5566 Ops/s | $\color{#35bf28}+1.45\\%$ | | test_rb_populate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-400] | 93.5237ms | 7.2502ms | 137.9279 Ops/s | 129.9061 Ops/s | $\textbf{\color{#35bf28}+6.18\\%}$ | | test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-400] | 15.4121ms | 12.8896ms | 77.5822 Ops/s | 78.4375 Ops/s | $\color{#d91a1a}-1.09\\%$ | | test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-400] | 1.7893ms | 1.1101ms | 900.8102 Ops/s | 880.9559 Ops/s | $\color{#35bf28}+2.25\\%$ | | test_rb_populate[TensorDictPrioritizedReplayBuffer-ListStorage-None-400] | 89.9292ms | 5.8143ms | 171.9890 Ops/s | 174.6399 Ops/s | $\color{#d91a1a}-1.52\\%$ | | test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-400] | 16.0214ms | 13.3414ms | 74.9548 Ops/s | 77.0172 Ops/s | $\color{#d91a1a}-2.68\\%$ | | test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-400] | 2.1592ms | 1.4929ms | 669.8334 Ops/s | 679.6983 Ops/s | $\color{#d91a1a}-1.45\\%$ |
github-actions[bot] commented 1 month ago

$\color{#35bf28}\textsf{\Large\✔\kern{0.2cm}\normalsize OK}$ Result of GPU Benchmark Tests

Total Benchmarks: 94. Improved: $\large\color{#35bf28}4$. Worsened: $\large\color{#d91a1a}0$.

Expand to view detailed results | Name | Max | Mean | Ops | Ops on Repo `HEAD` | Change | | ----------------------------------------------------------------------------------------- | --------- | --------- | -------------- | ------------------ | ----------------------------------- | | test_single | 0.1171s | 0.1155s | 8.6555 Ops/s | 8.3625 Ops/s | $\color{#35bf28}+3.50\\%$ | | test_sync | 0.1098s | 0.1071s | 9.3392 Ops/s | 9.5406 Ops/s | $\color{#d91a1a}-2.11\\%$ | | test_async | 0.2119s | 84.3319ms | 11.8579 Ops/s | 10.3897 Ops/s | $\textbf{\color{#35bf28}+14.13\\%}$ | | test_single_pixels | 0.1282s | 0.1276s | 7.8353 Ops/s | 7.6373 Ops/s | $\color{#35bf28}+2.59\\%$ | | test_sync_pixels | 85.7829ms | 84.3785ms | 11.8514 Ops/s | 12.2310 Ops/s | $\color{#d91a1a}-3.10\\%$ | | test_async_pixels | 0.1570s | 68.9150ms | 14.5106 Ops/s | 14.7584 Ops/s | $\color{#d91a1a}-1.68\\%$ | | test_simple | 0.7994s | 0.7981s | 1.2530 Ops/s | 1.2207 Ops/s | $\color{#35bf28}+2.65\\%$ | | test_transformed | 1.0549s | 1.0522s | 0.9504 Ops/s | 0.9237 Ops/s | $\color{#35bf28}+2.90\\%$ | | test_serial | 2.5128s | 2.4508s | 0.4080 Ops/s | 0.3991 Ops/s | $\color{#35bf28}+2.22\\%$ | | test_parallel | 2.4009s | 2.3320s | 0.4288 Ops/s | 0.4237 Ops/s | $\color{#35bf28}+1.21\\%$ | | test_step_mdp_speed[True-True-True-True-True] | 91.4310μs | 32.3390μs | 30.9224 KOps/s | 29.6443 KOps/s | $\color{#35bf28}+4.31\\%$ | | test_step_mdp_speed[True-True-True-True-False] | 41.4110μs | 19.5392μs | 51.1792 KOps/s | 50.2287 KOps/s | $\color{#35bf28}+1.89\\%$ | | test_step_mdp_speed[True-True-True-False-True] | 36.7000μs | 18.9759μs | 52.6985 KOps/s | 51.2928 KOps/s | $\color{#35bf28}+2.74\\%$ | | test_step_mdp_speed[True-True-True-False-False] | 36.7710μs | 11.2721μs | 88.7143 KOps/s | 87.3743 KOps/s | $\color{#35bf28}+1.53\\%$ | | test_step_mdp_speed[True-True-False-True-True] | 53.7510μs | 34.3335μs | 29.1260 KOps/s | 28.7077 KOps/s | $\color{#35bf28}+1.46\\%$ | | test_step_mdp_speed[True-True-False-True-False] | 44.8210μs | 21.7751μs | 45.9240 KOps/s | 46.2905 KOps/s | $\color{#d91a1a}-0.79\\%$ | | test_step_mdp_speed[True-True-False-False-True] | 37.9810μs | 20.5682μs | 48.6188 KOps/s | 47.0601 KOps/s | $\color{#35bf28}+3.31\\%$ | | test_step_mdp_speed[True-True-False-False-False] | 39.8300μs | 13.2866μs | 75.2637 KOps/s | 74.6232 KOps/s | $\color{#35bf28}+0.86\\%$ | | test_step_mdp_speed[True-False-True-True-True] | 62.3310μs | 36.8989μs | 27.1010 KOps/s | 27.0030 KOps/s | $\color{#35bf28}+0.36\\%$ | | test_step_mdp_speed[True-False-True-True-False] | 48.0710μs | 23.7855μs | 42.0424 KOps/s | 41.8926 KOps/s | $\color{#35bf28}+0.36\\%$ | | test_step_mdp_speed[True-False-True-False-True] | 41.3300μs | 20.8066μs | 48.0617 KOps/s | 46.4127 KOps/s | $\color{#35bf28}+3.55\\%$ | | test_step_mdp_speed[True-False-True-False-False] | 37.4810μs | 13.2205μs | 75.6401 KOps/s | 77.1199 KOps/s | $\color{#d91a1a}-1.92\\%$ | | test_step_mdp_speed[True-False-False-True-True] | 60.9110μs | 38.2628μs | 26.1351 KOps/s | 25.6385 KOps/s | $\color{#35bf28}+1.94\\%$ | | test_step_mdp_speed[True-False-False-True-False] | 48.6110μs | 25.2951μs | 39.5333 KOps/s | 38.7652 KOps/s | $\color{#35bf28}+1.98\\%$ | | test_step_mdp_speed[True-False-False-False-True] | 49.4820μs | 22.2748μs | 44.8939 KOps/s | 42.9602 KOps/s | $\color{#35bf28}+4.50\\%$ | | test_step_mdp_speed[True-False-False-False-False] | 34.7300μs | 14.8878μs | 67.1691 KOps/s | 66.9160 KOps/s | $\color{#35bf28}+0.38\\%$ | | test_step_mdp_speed[False-True-True-True-True] | 57.9110μs | 36.3663μs | 27.4980 KOps/s | 26.6440 KOps/s | $\color{#35bf28}+3.20\\%$ | | test_step_mdp_speed[False-True-True-True-False] | 43.5910μs | 23.5923μs | 42.3866 KOps/s | 41.6766 KOps/s | $\color{#35bf28}+1.70\\%$ | | test_step_mdp_speed[False-True-True-False-True] | 42.6210μs | 24.9347μs | 40.1047 KOps/s | 39.1748 KOps/s | $\color{#35bf28}+2.37\\%$ | | test_step_mdp_speed[False-True-True-False-False] | 40.4310μs | 15.0563μs | 66.4175 KOps/s | 65.5470 KOps/s | $\color{#35bf28}+1.33\\%$ | | test_step_mdp_speed[False-True-False-True-True] | 60.7510μs | 38.2187μs | 26.1652 KOps/s | 25.6484 KOps/s | $\color{#35bf28}+2.01\\%$ | | test_step_mdp_speed[False-True-False-True-False] | 59.7410μs | 25.5380μs | 39.1574 KOps/s | 38.9557 KOps/s | $\color{#35bf28}+0.52\\%$ | | test_step_mdp_speed[False-True-False-False-True] | 47.3410μs | 26.7247μs | 37.4186 KOps/s | 37.4161 KOps/s | $+0.01\\%$ | | test_step_mdp_speed[False-True-False-False-False] | 35.7810μs | 16.8557μs | 59.3271 KOps/s | 59.8172 KOps/s | $\color{#d91a1a}-0.82\\%$ | | test_step_mdp_speed[False-False-True-True-True] | 66.8610μs | 40.0593μs | 24.9630 KOps/s | 24.4004 KOps/s | $\color{#35bf28}+2.31\\%$ | | test_step_mdp_speed[False-False-True-True-False] | 50.3110μs | 27.3937μs | 36.5047 KOps/s | 35.9549 KOps/s | $\color{#35bf28}+1.53\\%$ | | test_step_mdp_speed[False-False-True-False-True] | 45.1910μs | 26.3926μs | 37.8894 KOps/s | 37.7569 KOps/s | $\color{#35bf28}+0.35\\%$ | | test_step_mdp_speed[False-False-True-False-False] | 60.7110μs | 16.8428μs | 59.3727 KOps/s | 59.0337 KOps/s | $\color{#35bf28}+0.57\\%$ | | test_step_mdp_speed[False-False-False-True-True] | 71.9010μs | 43.7824μs | 22.8402 KOps/s | 22.1075 KOps/s | $\color{#35bf28}+3.31\\%$ | | test_step_mdp_speed[False-False-False-True-False] | 54.8510μs | 29.0130μs | 34.4673 KOps/s | 33.2717 KOps/s | $\color{#35bf28}+3.59\\%$ | | test_step_mdp_speed[False-False-False-False-True] | 54.2410μs | 28.2826μs | 35.3574 KOps/s | 34.6058 KOps/s | $\color{#35bf28}+2.17\\%$ | | test_step_mdp_speed[False-False-False-False-False] | 42.3110μs | 18.6200μs | 53.7057 KOps/s | 53.7002 KOps/s | $\color{#35bf28}+0.01\\%$ | | test_values[generalized_advantage_estimate-True-True] | 25.2782ms | 24.7481ms | 40.4072 Ops/s | 39.6337 Ops/s | $\color{#35bf28}+1.95\\%$ | | test_values[vec_generalized_advantage_estimate-True-True] | 82.5155ms | 3.2179ms | 310.7662 Ops/s | 307.1912 Ops/s | $\color{#35bf28}+1.16\\%$ | | test_values[td0_return_estimate-False-False] | 87.0920μs | 63.3622μs | 15.7823 KOps/s | 15.3549 KOps/s | $\color{#35bf28}+2.78\\%$ | | test_values[td1_return_estimate-False-False] | 53.1463ms | 52.2421ms | 19.1417 Ops/s | 18.7242 Ops/s | $\color{#35bf28}+2.23\\%$ | | test_values[vec_td1_return_estimate-False-False] | 2.0914ms | 1.7579ms | 568.8522 Ops/s | 561.6246 Ops/s | $\color{#35bf28}+1.29\\%$ | | test_values[td_lambda_return_estimate-True-False] | 84.3973ms | 83.0125ms | 12.0464 Ops/s | 11.7500 Ops/s | $\color{#35bf28}+2.52\\%$ | | test_values[vec_td_lambda_return_estimate-True-False] | 2.0583ms | 1.7608ms | 567.9216 Ops/s | 565.1837 Ops/s | $\color{#35bf28}+0.48\\%$ | | test_gae_speed[generalized_advantage_estimate-False-1-512] | 23.9255ms | 23.5617ms | 42.4418 Ops/s | 41.2954 Ops/s | $\color{#35bf28}+2.78\\%$ | | test_gae_speed[vec_generalized_advantage_estimate-True-1-512] | 0.8961ms | 0.6893ms | 1.4508 KOps/s | 1.4283 KOps/s | $\color{#35bf28}+1.58\\%$ | | test_gae_speed[vec_generalized_advantage_estimate-False-1-512] | 0.7063ms | 0.6503ms | 1.5379 KOps/s | 1.5024 KOps/s | $\color{#35bf28}+2.36\\%$ | | test_gae_speed[vec_generalized_advantage_estimate-True-32-512] | 1.5064ms | 1.4420ms | 693.4874 Ops/s | 685.5486 Ops/s | $\color{#35bf28}+1.16\\%$ | | test_gae_speed[vec_generalized_advantage_estimate-False-32-512] | 0.9511ms | 0.6669ms | 1.4994 KOps/s | 1.4654 KOps/s | $\color{#35bf28}+2.32\\%$ | | test_dqn_speed | 1.7825ms | 1.4069ms | 710.7587 Ops/s | 615.0341 Ops/s | $\textbf{\color{#35bf28}+15.56\\%}$ | | test_ddpg_speed | 3.1400ms | 2.9077ms | 343.9097 Ops/s | 334.0920 Ops/s | $\color{#35bf28}+2.94\\%$ | | test_sac_speed | 8.5901ms | 8.2728ms | 120.8774 Ops/s | 117.1606 Ops/s | $\color{#35bf28}+3.17\\%$ | | test_redq_speed | 12.2713ms | 10.5346ms | 94.9252 Ops/s | 94.1559 Ops/s | $\color{#35bf28}+0.82\\%$ | | test_redq_deprec_speed | 11.9432ms | 11.4149ms | 87.6046 Ops/s | 86.0924 Ops/s | $\color{#35bf28}+1.76\\%$ | | test_td3_speed | 8.3068ms | 8.1637ms | 122.4937 Ops/s | 118.9615 Ops/s | $\color{#35bf28}+2.97\\%$ | | test_cql_speed | 26.5142ms | 25.5040ms | 39.2096 Ops/s | 38.4143 Ops/s | $\color{#35bf28}+2.07\\%$ | | test_a2c_speed | 5.8455ms | 5.5671ms | 179.6274 Ops/s | 175.8474 Ops/s | $\color{#35bf28}+2.15\\%$ | | test_ppo_speed | 6.2387ms | 5.9625ms | 167.7147 Ops/s | 165.5029 Ops/s | $\color{#35bf28}+1.34\\%$ | | test_reinforce_speed | 4.7688ms | 4.5528ms | 219.6445 Ops/s | 212.1346 Ops/s | $\color{#35bf28}+3.54\\%$ | | test_iql_speed | 20.3544ms | 19.5163ms | 51.2393 Ops/s | 49.8280 Ops/s | $\color{#35bf28}+2.83\\%$ | | test_rb_sample[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] | 4.7254ms | 4.5457ms | 219.9858 Ops/s | 208.0138 Ops/s | $\textbf{\color{#35bf28}+5.76\\%}$ | | test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] | 1.5035ms | 0.5901ms | 1.6947 KOps/s | 1.6801 KOps/s | $\color{#35bf28}+0.87\\%$ | | test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] | 0.8036ms | 0.5673ms | 1.7629 KOps/s | 1.7492 KOps/s | $\color{#35bf28}+0.78\\%$ | | test_rb_sample[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] | 4.8058ms | 4.5341ms | 220.5520 Ops/s | 212.9968 Ops/s | $\color{#35bf28}+3.55\\%$ | | test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] | 1.5574ms | 0.5802ms | 1.7235 KOps/s | 1.6871 KOps/s | $\color{#35bf28}+2.16\\%$ | | test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] | 0.7574ms | 0.5575ms | 1.7939 KOps/s | 1.7641 KOps/s | $\color{#35bf28}+1.69\\%$ | | test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-sampler6-10000] | 2.0630ms | 1.4993ms | 666.9939 Ops/s | 648.6846 Ops/s | $\color{#35bf28}+2.82\\%$ | | test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-sampler7-10000] | 1.5499ms | 1.4285ms | 700.0326 Ops/s | 680.6048 Ops/s | $\color{#35bf28}+2.85\\%$ | | test_rb_sample[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] | 5.0144ms | 4.7004ms | 212.7498 Ops/s | 205.0481 Ops/s | $\color{#35bf28}+3.76\\%$ | | test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] | 0.8259ms | 0.7127ms | 1.4031 KOps/s | 1.3854 KOps/s | $\color{#35bf28}+1.28\\%$ | | test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] | 4.6039ms | 0.6978ms | 1.4330 KOps/s | 1.4262 KOps/s | $\color{#35bf28}+0.48\\%$ | | test_rb_iterate[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] | 4.8953ms | 4.5737ms | 218.6414 Ops/s | 209.8962 Ops/s | $\color{#35bf28}+4.17\\%$ | | test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] | 0.8597ms | 0.5919ms | 1.6896 KOps/s | 1.6891 KOps/s | $\color{#35bf28}+0.03\\%$ | | test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] | 0.6541ms | 0.5668ms | 1.7643 KOps/s | 1.7428 KOps/s | $\color{#35bf28}+1.23\\%$ | | test_rb_iterate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] | 4.8767ms | 4.5431ms | 220.1129 Ops/s | 208.9843 Ops/s | $\textbf{\color{#35bf28}+5.33\\%}$ | | test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] | 0.6822ms | 0.5846ms | 1.7105 KOps/s | 1.6950 KOps/s | $\color{#35bf28}+0.91\\%$ | | test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] | 4.7800ms | 0.5643ms | 1.7721 KOps/s | 1.7503 KOps/s | $\color{#35bf28}+1.24\\%$ | | test_rb_iterate[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] | 4.8528ms | 4.7045ms | 212.5619 Ops/s | 204.8250 Ops/s | $\color{#35bf28}+3.78\\%$ | | test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] | 0.8354ms | 0.7140ms | 1.4005 KOps/s | 1.3903 KOps/s | $\color{#35bf28}+0.74\\%$ | | test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] | 0.8322ms | 0.6914ms | 1.4463 KOps/s | 1.4314 KOps/s | $\color{#35bf28}+1.04\\%$ | | test_rb_populate[TensorDictReplayBuffer-ListStorage-RandomSampler-400] | 0.1150s | 9.1529ms | 109.2548 Ops/s | 106.5584 Ops/s | $\color{#35bf28}+2.53\\%$ | | test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-400] | 17.6234ms | 15.1860ms | 65.8503 Ops/s | 64.3408 Ops/s | $\color{#35bf28}+2.35\\%$ | | test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-400] | 2.3181ms | 1.2596ms | 793.9225 Ops/s | 762.4095 Ops/s | $\color{#35bf28}+4.13\\%$ | | test_rb_populate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-400] | 0.1060s | 6.8648ms | 145.6697 Ops/s | 144.2686 Ops/s | $\color{#35bf28}+0.97\\%$ | | test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-400] | 17.9927ms | 15.1991ms | 65.7932 Ops/s | 64.3440 Ops/s | $\color{#35bf28}+2.25\\%$ | | test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-400] | 2.5254ms | 1.2811ms | 780.6047 Ops/s | 756.7288 Ops/s | $\color{#35bf28}+3.16\\%$ | | test_rb_populate[TensorDictPrioritizedReplayBuffer-ListStorage-None-400] | 0.1109s | 9.3959ms | 106.4296 Ops/s | 105.9160 Ops/s | $\color{#35bf28}+0.48\\%$ | | test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-400] | 18.2568ms | 15.5238ms | 64.4171 Ops/s | 62.7972 Ops/s | $\color{#35bf28}+2.58\\%$ | | test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-400] | 2.9196ms | 1.6388ms | 610.2095 Ops/s | 595.8575 Ops/s | $\color{#35bf28}+2.41\\%$ |