pytorch / tensordict

TensorDict is a pytorch dedicated tensor container.
MIT License
831 stars 73 forks source link

[Refactor] Remove non-ascii characters from code base #928

Closed vmoens closed 3 months ago

vmoens commented 3 months ago

The only remaining non ascii-characters are in the turorials, which should not be a problem for compiled jobs: In tensordict_memory.py:

  tensordict
  ├── a.memmap
  ├── b
  │   ├── c.memmap
  │   └── meta.json
  └── meta.json
github-actions[bot] commented 3 months ago

$\color{#D29922}\textsf{\Large\⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 219. Improved: $\large\color{#35bf28}13$. Worsened: $\large\color{#d91a1a}17$.

Expand to view detailed results | Name | Max | Mean | Ops | Ops on Repo `HEAD` | Change | | ------------------------------------------------- | --------- | --------- | --------------- | ------------------ | ----------------------------------- | | test_plain_set_nested | 55.8950μs | 20.9135μs | 47.8160 KOps/s | 48.0711 KOps/s | $\color{#d91a1a}-0.53\\%$ | | test_plain_set_stack_nested | 49.7940μs | 21.1195μs | 47.3496 KOps/s | 46.6807 KOps/s | $\color{#35bf28}+1.43\\%$ | | test_plain_set_nested_inplace | 66.1050μs | 23.2324μs | 43.0433 KOps/s | 43.4740 KOps/s | $\color{#d91a1a}-0.99\\%$ | | test_plain_set_stack_nested_inplace | 60.0430μs | 23.1648μs | 43.1690 KOps/s | 43.5240 KOps/s | $\color{#d91a1a}-0.82\\%$ | | test_items | 24.3460μs | 2.6786μs | 373.3312 KOps/s | 376.4264 KOps/s | $\color{#d91a1a}-0.82\\%$ | | test_items_nested | 0.4165ms | 0.3338ms | 2.9957 KOps/s | 2.8461 KOps/s | $\textbf{\color{#35bf28}+5.26\\%}$ | | test_items_nested_locked | 1.7827ms | 0.3391ms | 2.9493 KOps/s | 2.9434 KOps/s | $\color{#35bf28}+0.20\\%$ | | test_items_nested_leaf | 0.1447ms | 83.6917μs | 11.9486 KOps/s | 11.9549 KOps/s | $\color{#d91a1a}-0.05\\%$ | | test_items_stack_nested | 0.6778ms | 0.3399ms | 2.9423 KOps/s | 2.9312 KOps/s | $\color{#35bf28}+0.38\\%$ | | test_items_stack_nested_leaf | 0.1496ms | 82.4186μs | 12.1332 KOps/s | 11.5075 KOps/s | $\textbf{\color{#35bf28}+5.44\\%}$ | | test_items_stack_nested_locked | 0.4828ms | 0.3370ms | 2.9670 KOps/s | 2.9326 KOps/s | $\color{#35bf28}+1.18\\%$ | | test_keys | 33.7940μs | 3.9569μs | 252.7238 KOps/s | 257.2194 KOps/s | $\color{#d91a1a}-1.75\\%$ | | test_keys_nested | 0.2336ms | 0.1439ms | 6.9499 KOps/s | 6.9969 KOps/s | $\color{#d91a1a}-0.67\\%$ | | test_keys_nested_locked | 0.7937ms | 0.1490ms | 6.7115 KOps/s | 6.7243 KOps/s | $\color{#d91a1a}-0.19\\%$ | | test_keys_nested_leaf | 0.2065ms | 0.1230ms | 8.1297 KOps/s | 8.0761 KOps/s | $\color{#35bf28}+0.66\\%$ | | test_keys_stack_nested | 0.3017ms | 0.1439ms | 6.9489 KOps/s | 6.9290 KOps/s | $\color{#35bf28}+0.29\\%$ | | test_keys_stack_nested_leaf | 0.2205ms | 0.1239ms | 8.0697 KOps/s | 8.2628 KOps/s | $\color{#d91a1a}-2.34\\%$ | | test_keys_stack_nested_locked | 0.2430ms | 0.1490ms | 6.7111 KOps/s | 6.7337 KOps/s | $\color{#d91a1a}-0.34\\%$ | | test_values | 8.7288μs | 1.2979μs | 770.5033 KOps/s | 863.8743 KOps/s | $\textbf{\color{#d91a1a}-10.81\\%}$ | | test_values_nested | 91.1120μs | 50.3569μs | 19.8582 KOps/s | 20.0785 KOps/s | $\color{#d91a1a}-1.10\\%$ | | test_values_nested_locked | 87.1540μs | 50.1274μs | 19.9492 KOps/s | 19.9610 KOps/s | $\color{#d91a1a}-0.06\\%$ | | test_values_nested_leaf | 95.0090μs | 44.8082μs | 22.3173 KOps/s | 22.4302 KOps/s | $\color{#d91a1a}-0.50\\%$ | | test_values_stack_nested | 0.1135ms | 51.2187μs | 19.5241 KOps/s | 19.2609 KOps/s | $\color{#35bf28}+1.37\\%$ | | test_values_stack_nested_leaf | 82.5950μs | 44.2620μs | 22.5927 KOps/s | 22.4460 KOps/s | $\color{#35bf28}+0.65\\%$ | | test_values_stack_nested_locked | 89.6990μs | 51.5094μs | 19.4139 KOps/s | 19.7988 KOps/s | $\color{#d91a1a}-1.94\\%$ | | test_membership | 7.0676μs | 0.7451μs | 1.3420 MOps/s | 1.1002 MOps/s | $\textbf{\color{#35bf28}+21.97\\%}$ | | test_membership_nested | 25.6590μs | 2.6071μs | 383.5746 KOps/s | 390.7015 KOps/s | $\color{#d91a1a}-1.82\\%$ | | test_membership_nested_leaf | 32.4310μs | 2.5988μs | 384.7977 KOps/s | 384.4348 KOps/s | $\color{#35bf28}+0.09\\%$ | | test_membership_stacked_nested | 20.5890μs | 2.5891μs | 386.2343 KOps/s | 387.2802 KOps/s | $\color{#d91a1a}-0.27\\%$ | | test_membership_stacked_nested_leaf | 21.2200μs | 2.5976μs | 384.9716 KOps/s | 388.2333 KOps/s | $\color{#d91a1a}-0.84\\%$ | | test_membership_nested_last | 45.1750μs | 3.9175μs | 255.2680 KOps/s | 258.0816 KOps/s | $\color{#d91a1a}-1.09\\%$ | | test_membership_nested_leaf_last | 29.0650μs | 3.9208μs | 255.0492 KOps/s | 246.1454 KOps/s | $\color{#35bf28}+3.62\\%$ | | test_membership_stacked_nested_last | 36.9500μs | 12.6883μs | 78.8127 KOps/s | 202.0898 KOps/s | $\textbf{\color{#d91a1a}-61.00\\%}$ | | test_membership_stacked_nested_leaf_last | 44.2230μs | 12.5975μs | 79.3805 KOps/s | 200.7357 KOps/s | $\textbf{\color{#d91a1a}-60.46\\%}$ | | test_nested_getleaf | 39.9650μs | 10.4000μs | 96.1534 KOps/s | 94.0152 KOps/s | $\color{#35bf28}+2.27\\%$ | | test_nested_get | 34.2440μs | 9.9430μs | 100.5732 KOps/s | 98.4070 KOps/s | $\color{#35bf28}+2.20\\%$ | | test_stacked_getleaf | 44.1730μs | 10.4317μs | 95.8618 KOps/s | 93.8310 KOps/s | $\color{#35bf28}+2.16\\%$ | | test_stacked_get | 37.5510μs | 9.8203μs | 101.8303 KOps/s | 100.4981 KOps/s | $\color{#35bf28}+1.33\\%$ | | test_nested_getitemleaf | 34.4950μs | 11.1240μs | 89.8957 KOps/s | 90.3141 KOps/s | $\color{#d91a1a}-0.46\\%$ | | test_nested_getitem | 36.4280μs | 10.2585μs | 97.4798 KOps/s | 96.7080 KOps/s | $\color{#35bf28}+0.80\\%$ | | test_stacked_getitemleaf | 37.2200μs | 11.1032μs | 90.0641 KOps/s | 90.5660 KOps/s | $\color{#d91a1a}-0.55\\%$ | | test_stacked_getitem | 35.3760μs | 10.0074μs | 99.9265 KOps/s | 97.5860 KOps/s | $\color{#35bf28}+2.40\\%$ | | test_lock_nested | 86.1274ms | 0.5791ms | 1.7269 KOps/s | 1.9784 KOps/s | $\textbf{\color{#d91a1a}-12.71\\%}$ | | test_lock_stack_nested | 0.6589ms | 0.4458ms | 2.2433 KOps/s | 2.1268 KOps/s | $\textbf{\color{#35bf28}+5.48\\%}$ | | test_unlock_nested | 86.4025ms | 0.5030ms | 1.9880 KOps/s | 2.4306 KOps/s | $\textbf{\color{#d91a1a}-18.21\\%}$ | | test_unlock_stack_nested | 0.6935ms | 0.3653ms | 2.7373 KOps/s | 2.6223 KOps/s | $\color{#35bf28}+4.39\\%$ | | test_flatten_speed | 0.5884ms | 0.1043ms | 9.5909 KOps/s | 9.7046 KOps/s | $\color{#d91a1a}-1.17\\%$ | | test_unflatten_speed | 0.8891ms | 0.4360ms | 2.2936 KOps/s | 2.3091 KOps/s | $\color{#d91a1a}-0.67\\%$ | | test_common_ops | 5.2726ms | 1.0992ms | 909.7854 Ops/s | 937.8169 Ops/s | $\color{#d91a1a}-2.99\\%$ | | test_creation | 25.9090μs | 2.1440μs | 466.4166 KOps/s | 499.7376 KOps/s | $\textbf{\color{#d91a1a}-6.67\\%}$ | | test_creation_empty | 59.8730μs | 17.3999μs | 57.4714 KOps/s | 61.5791 KOps/s | $\textbf{\color{#d91a1a}-6.67\\%}$ | | test_creation_nested_1 | 61.0150μs | 20.6670μs | 48.3864 KOps/s | 52.1536 KOps/s | $\textbf{\color{#d91a1a}-7.22\\%}$ | | test_creation_nested_2 | 60.9150μs | 24.4505μs | 40.8989 KOps/s | 43.1526 KOps/s | $\textbf{\color{#d91a1a}-5.22\\%}$ | | test_clone | 0.1475ms | 16.7068μs | 59.8559 KOps/s | 59.4522 KOps/s | $\color{#35bf28}+0.68\\%$ | | test_getitem[int] | 1.2735ms | 16.7847μs | 59.5779 KOps/s | 60.8232 KOps/s | $\color{#d91a1a}-2.05\\%$ | | test_getitem[slice_int] | 0.1365ms | 31.5971μs | 31.6485 KOps/s | 32.9158 KOps/s | $\color{#d91a1a}-3.85\\%$ | | test_getitem[range] | 0.2190ms | 55.5106μs | 18.0146 KOps/s | 17.0609 KOps/s | $\textbf{\color{#35bf28}+5.59\\%}$ | | test_getitem[tuple] | 0.1620ms | 25.3508μs | 39.4465 KOps/s | 39.9816 KOps/s | $\color{#d91a1a}-1.34\\%$ | | test_getitem[list] | 0.2423ms | 50.5918μs | 19.7660 KOps/s | 19.2905 KOps/s | $\color{#35bf28}+2.47\\%$ | | test_setitem_dim[int] | 81.1320μs | 40.6198μs | 24.6185 KOps/s | 25.4427 KOps/s | $\color{#d91a1a}-3.24\\%$ | | test_setitem_dim[slice_int] | 0.1210ms | 71.8755μs | 13.9130 KOps/s | 13.8046 KOps/s | $\color{#35bf28}+0.78\\%$ | | test_setitem_dim[range] | 0.1543ms | 92.4978μs | 10.8111 KOps/s | 10.8165 KOps/s | $\color{#d91a1a}-0.05\\%$ | | test_setitem_dim[tuple] | 0.1273ms | 59.4550μs | 16.8194 KOps/s | 17.2425 KOps/s | $\color{#d91a1a}-2.45\\%$ | | test_setitem | 0.1114ms | 28.6108μs | 34.9519 KOps/s | 35.4679 KOps/s | $\color{#d91a1a}-1.45\\%$ | | test_set | 0.1375ms | 27.9890μs | 35.7284 KOps/s | 36.2040 KOps/s | $\color{#d91a1a}-1.31\\%$ | | test_set_shared | 1.1774ms | 0.2150ms | 4.6504 KOps/s | 4.6281 KOps/s | $\color{#35bf28}+0.48\\%$ | | test_update | 0.1687ms | 34.9897μs | 28.5798 KOps/s | 29.2789 KOps/s | $\color{#d91a1a}-2.39\\%$ | | test_update_nested | 0.1752ms | 44.8940μs | 22.2747 KOps/s | 22.6412 KOps/s | $\color{#d91a1a}-1.62\\%$ | | test_update__nested | 0.1271ms | 34.3340μs | 29.1256 KOps/s | 29.2021 KOps/s | $\color{#d91a1a}-0.26\\%$ | | test_set_nested | 0.1679ms | 29.9586μs | 33.3793 KOps/s | 33.0539 KOps/s | $\color{#35bf28}+0.98\\%$ | | test_set_nested_new | 0.1553ms | 35.2342μs | 28.3815 KOps/s | 28.7684 KOps/s | $\color{#d91a1a}-1.34\\%$ | | test_select | 0.1770ms | 51.8190μs | 19.2979 KOps/s | 19.2344 KOps/s | $\color{#35bf28}+0.33\\%$ | | test_select_nested | 0.1255ms | 59.0332μs | 16.9396 KOps/s | 16.7965 KOps/s | $\color{#35bf28}+0.85\\%$ | | test_exclude_nested | 0.1720ms | 79.2461μs | 12.6189 KOps/s | 13.1332 KOps/s | $\color{#d91a1a}-3.92\\%$ | | test_empty[True] | 0.7368ms | 0.3333ms | 3.0000 KOps/s | 3.1437 KOps/s | $\color{#d91a1a}-4.57\\%$ | | test_empty[False] | 11.3372μs | 1.1416μs | 875.9374 KOps/s | 867.4028 KOps/s | $\color{#35bf28}+0.98\\%$ | | test_unbind_speed | 0.4897ms | 0.3034ms | 3.2960 KOps/s | 3.2468 KOps/s | $\color{#35bf28}+1.51\\%$ | | test_unbind_speed_stack0 | 0.6677ms | 0.2919ms | 3.4264 KOps/s | 3.3461 KOps/s | $\color{#35bf28}+2.40\\%$ | | test_unbind_speed_stack1 | 89.4388ms | 0.7655ms | 1.3063 KOps/s | 1.3908 KOps/s | $\textbf{\color{#d91a1a}-6.08\\%}$ | | test_split | 88.0522ms | 2.1247ms | 470.6457 Ops/s | 469.0615 Ops/s | $\color{#35bf28}+0.34\\%$ | | test_chunk | 90.1936ms | 2.1176ms | 472.2249 Ops/s | 466.8412 Ops/s | $\color{#35bf28}+1.15\\%$ | | test_creation[device0] | 5.0300ms | 0.1241ms | 8.0597 KOps/s | 8.3812 KOps/s | $\color{#d91a1a}-3.84\\%$ | | test_creation_from_tensor | 0.2492ms | 0.1218ms | 8.2072 KOps/s | 8.1513 KOps/s | $\color{#35bf28}+0.69\\%$ | | test_add_one[memmap_tensor0] | 0.1382ms | 7.2398μs | 138.1250 KOps/s | 130.2737 KOps/s | $\textbf{\color{#35bf28}+6.03\\%}$ | | test_contiguous[memmap_tensor0] | 26.6100μs | 2.0646μs | 484.3538 KOps/s | 499.8383 KOps/s | $\color{#d91a1a}-3.10\\%$ | | test_stack[memmap_tensor0] | 46.0260μs | 5.4874μs | 182.2353 KOps/s | 180.0454 KOps/s | $\color{#35bf28}+1.22\\%$ | | test_memmaptd_index | 1.1911ms | 0.4017ms | 2.4896 KOps/s | 2.4497 KOps/s | $\color{#35bf28}+1.63\\%$ | | test_memmaptd_index_astensor | 1.0616ms | 0.4848ms | 2.0626 KOps/s | 2.0603 KOps/s | $\color{#35bf28}+0.11\\%$ | | test_memmaptd_index_op | 1.4618ms | 1.0103ms | 989.7774 Ops/s | 994.9565 Ops/s | $\color{#d91a1a}-0.52\\%$ | | test_serialize_model | 0.1221s | 0.1178s | 8.4904 Ops/s | 7.2329 Ops/s | $\textbf{\color{#35bf28}+17.39\\%}$ | | test_serialize_model_pickle | 0.5034s | 0.4016s | 2.4900 Ops/s | 2.5022 Ops/s | $\color{#d91a1a}-0.49\\%$ | | test_serialize_weights | 0.1281s | 0.1186s | 8.4283 Ops/s | 8.2493 Ops/s | $\color{#35bf28}+2.17\\%$ | | test_serialize_weights_returnearly | 0.1689s | 0.1614s | 6.1956 Ops/s | 6.3170 Ops/s | $\color{#d91a1a}-1.92\\%$ | | test_serialize_weights_pickle | 0.4581s | 0.4032s | 2.4801 Ops/s | 2.1725 Ops/s | $\textbf{\color{#35bf28}+14.16\\%}$ | | test_serialize_weights_filesystem | 0.1479s | 0.1412s | 7.0804 Ops/s | 6.4638 Ops/s | $\textbf{\color{#35bf28}+9.54\\%}$ | | test_serialize_model_filesystem | 0.2485s | 0.1667s | 5.9997 Ops/s | 6.5307 Ops/s | $\textbf{\color{#d91a1a}-8.13\\%}$ | | test_reshape_pytree | 94.7090μs | 40.6623μs | 24.5928 KOps/s | 25.1970 KOps/s | $\color{#d91a1a}-2.40\\%$ | | test_reshape_td | 0.1073ms | 48.1918μs | 20.7504 KOps/s | 21.8049 KOps/s | $\color{#d91a1a}-4.84\\%$ | | test_view_pytree | 0.1067ms | 40.2549μs | 24.8417 KOps/s | 25.1110 KOps/s | $\color{#d91a1a}-1.07\\%$ | | test_view_td | 0.1159ms | 54.0255μs | 18.5098 KOps/s | 18.8377 KOps/s | $\color{#d91a1a}-1.74\\%$ | | test_unbind_pytree | 96.0410μs | 37.8801μs | 26.3991 KOps/s | 26.8013 KOps/s | $\color{#d91a1a}-1.50\\%$ | | test_unbind_td | 0.3718ms | 46.4291μs | 21.5382 KOps/s | 22.0659 KOps/s | $\color{#d91a1a}-2.39\\%$ | | test_split_pytree | 97.6230μs | 40.6464μs | 24.6024 KOps/s | 25.2738 KOps/s | $\color{#d91a1a}-2.66\\%$ | | test_split_td | 0.5417ms | 57.9902μs | 17.2443 KOps/s | 17.6818 KOps/s | $\color{#d91a1a}-2.47\\%$ | | test_add_pytree | 0.1101ms | 46.6421μs | 21.4398 KOps/s | 21.0834 KOps/s | $\color{#35bf28}+1.69\\%$ | | test_add_td | 0.1525ms | 84.6597μs | 11.8120 KOps/s | 12.4464 KOps/s | $\textbf{\color{#d91a1a}-5.10\\%}$ | | test_compile_add_one_nested[tensordict-compile] | 0.1175ms | 55.2936μs | 18.0853 KOps/s | 18.2029 KOps/s | $\color{#d91a1a}-0.65\\%$ | | test_compile_add_one_nested[tensordict-eager] | 0.3708ms | 0.1968ms | 5.0803 KOps/s | 5.2901 KOps/s | $\color{#d91a1a}-3.97\\%$ | | test_compile_add_one_nested[pytree-compile] | 0.2413ms | 56.8180μs | 17.6001 KOps/s | 18.3255 KOps/s | $\color{#d91a1a}-3.96\\%$ | | test_compile_add_one_nested[pytree-eager] | 0.3816ms | 0.1438ms | 6.9550 KOps/s | 6.8048 KOps/s | $\color{#35bf28}+2.21\\%$ | | test_compile_copy_nested[tensordict-compile] | 70.3720μs | 21.2663μs | 47.0229 KOps/s | 49.2020 KOps/s | $\color{#d91a1a}-4.43\\%$ | | test_compile_copy_nested[tensordict-eager] | 0.1490ms | 66.1066μs | 15.1271 KOps/s | 15.4880 KOps/s | $\color{#d91a1a}-2.33\\%$ | | test_compile_copy_nested[pytree-compile] | 6.7715ms | 79.9490μs | 12.5080 KOps/s | 12.4110 KOps/s | $\color{#35bf28}+0.78\\%$ | | test_compile_copy_nested[pytree-eager] | 0.1415ms | 70.8658μs | 14.1112 KOps/s | 13.8767 KOps/s | $\color{#35bf28}+1.69\\%$ | | test_compile_add_one_flat[tensordict-compile] | 0.3903ms | 0.1811ms | 5.5218 KOps/s | 5.6172 KOps/s | $\color{#d91a1a}-1.70\\%$ | | test_compile_add_one_flat[tensordict-eager] | 0.3054ms | 0.2012ms | 4.9705 KOps/s | 5.2433 KOps/s | $\textbf{\color{#d91a1a}-5.20\\%}$ | | test_compile_add_one_flat[tensorclass-compile] | 0.1310ms | 39.4034μs | 25.3785 KOps/s | 25.2338 KOps/s | $\color{#35bf28}+0.57\\%$ | | test_compile_add_one_flat[tensorclass-eager] | 0.5019ms | 71.5390μs | 13.9784 KOps/s | 14.4472 KOps/s | $\color{#d91a1a}-3.24\\%$ | | test_compile_add_one_flat[pytree-compile] | 0.2929ms | 0.1782ms | 5.6111 KOps/s | 5.7122 KOps/s | $\color{#d91a1a}-1.77\\%$ | | test_compile_add_one_flat[pytree-eager] | 0.4962ms | 0.2908ms | 3.4385 KOps/s | 3.3962 KOps/s | $\color{#35bf28}+1.25\\%$ | | test_compile_add_self_flat[tensordict-eager] | 0.3847ms | 0.2143ms | 4.6659 KOps/s | 4.8111 KOps/s | $\color{#d91a1a}-3.02\\%$ | | test_compile_add_self_flat[tensordict-compile] | 0.4167ms | 0.1920ms | 5.2081 KOps/s | 5.6038 KOps/s | $\textbf{\color{#d91a1a}-7.06\\%}$ | | test_compile_add_self_flat[tensorclass-eager] | 0.8634ms | 65.1023μs | 15.3604 KOps/s | 15.9337 KOps/s | $\color{#d91a1a}-3.60\\%$ | | test_compile_add_self_flat[tensorclass-compile] | 0.1078ms | 40.7780μs | 24.5230 KOps/s | 25.2604 KOps/s | $\color{#d91a1a}-2.92\\%$ | | test_compile_add_self_flat[pytree-eager] | 0.3283ms | 0.2412ms | 4.1459 KOps/s | 4.1354 KOps/s | $\color{#35bf28}+0.25\\%$ | | test_compile_add_self_flat[pytree-compile] | 0.3397ms | 0.1793ms | 5.5786 KOps/s | 5.6946 KOps/s | $\color{#d91a1a}-2.04\\%$ | | test_compile_copy_flat[tensordict-compile] | 0.1933ms | 0.1116ms | 8.9584 KOps/s | 9.2160 KOps/s | $\color{#d91a1a}-2.79\\%$ | | test_compile_copy_flat[tensordict-eager] | 0.1149ms | 56.8293μs | 17.5966 KOps/s | 17.8602 KOps/s | $\color{#d91a1a}-1.48\\%$ | | test_compile_copy_flat[pytree-compile] | 0.1966ms | 80.8568μs | 12.3675 KOps/s | 12.4903 KOps/s | $\color{#d91a1a}-0.98\\%$ | | test_compile_copy_flat[pytree-eager] | 0.1390ms | 71.9326μs | 13.9019 KOps/s | 13.8367 KOps/s | $\color{#35bf28}+0.47\\%$ | | test_compile_assign_and_add[tensordict-compile] | 0.2968ms | 0.1886ms | 5.3021 KOps/s | 5.1017 KOps/s | $\color{#35bf28}+3.93\\%$ | | test_compile_assign_and_add[tensordict-eager] | 2.7095ms | 1.6171ms | 618.3755 Ops/s | 612.1825 Ops/s | $\color{#35bf28}+1.01\\%$ | | test_compile_assign_and_add[pytree-compile] | 0.2711ms | 0.1896ms | 5.2742 KOps/s | 5.1781 KOps/s | $\color{#35bf28}+1.86\\%$ | | test_compile_assign_and_add[pytree-eager] | 1.2010ms | 1.0775ms | 928.0374 Ops/s | 912.0161 Ops/s | $\color{#35bf28}+1.76\\%$ | | test_compile_assign_and_add_stack[compile] | 0.5438ms | 0.4161ms | 2.4035 KOps/s | 2.3810 KOps/s | $\color{#35bf28}+0.94\\%$ | | test_compile_assign_and_add_stack[eager] | 5.2937ms | 3.7575ms | 266.1316 Ops/s | 270.1539 Ops/s | $\color{#d91a1a}-1.49\\%$ | | test_compile_indexing[tensor-tensordict-compile] | 0.1052ms | 33.6768μs | 29.6940 KOps/s | 30.2582 KOps/s | $\color{#d91a1a}-1.86\\%$ | | test_compile_indexing[tensor-tensordict-eager] | 0.6651ms | 48.3530μs | 20.6812 KOps/s | 20.6701 KOps/s | $\color{#35bf28}+0.05\\%$ | | test_compile_indexing[tensor-tensorclass-compile] | 89.1480μs | 28.6534μs | 34.8999 KOps/s | 35.0313 KOps/s | $\color{#d91a1a}-0.38\\%$ | | test_compile_indexing[tensor-tensorclass-eager] | 84.5190μs | 30.1280μs | 33.1917 KOps/s | 33.6990 KOps/s | $\color{#d91a1a}-1.51\\%$ | | test_compile_indexing[tensor-pytree-compile] | 75.1410μs | 28.4928μs | 35.0966 KOps/s | 35.5876 KOps/s | $\color{#d91a1a}-1.38\\%$ | | test_compile_indexing[tensor-pytree-eager] | 87.7950μs | 29.6702μs | 33.7038 KOps/s | 33.4968 KOps/s | $\color{#35bf28}+0.62\\%$ | | test_compile_indexing[slice-tensordict-compile] | 0.1568ms | 74.0708μs | 13.5006 KOps/s | 13.8783 KOps/s | $\color{#d91a1a}-2.72\\%$ | | test_compile_indexing[slice-tensordict-eager] | 0.6984ms | 27.8253μs | 35.9386 KOps/s | 36.4400 KOps/s | $\color{#d91a1a}-1.38\\%$ | | test_compile_indexing[slice-tensorclass-compile] | 0.1279ms | 68.2677μs | 14.6482 KOps/s | 14.7012 KOps/s | $\color{#d91a1a}-0.36\\%$ | | test_compile_indexing[slice-tensorclass-eager] | 85.4910μs | 24.9493μs | 40.0814 KOps/s | 40.2148 KOps/s | $\color{#d91a1a}-0.33\\%$ | | test_compile_indexing[slice-pytree-compile] | 0.1514ms | 68.9228μs | 14.5090 KOps/s | 14.5549 KOps/s | $\color{#d91a1a}-0.32\\%$ | | test_compile_indexing[slice-pytree-eager] | 0.1043ms | 24.9140μs | 40.1381 KOps/s | 40.3622 KOps/s | $\color{#d91a1a}-0.56\\%$ | | test_compile_indexing[int-tensordict-compile] | 0.2036ms | 75.0126μs | 13.3311 KOps/s | 13.8032 KOps/s | $\color{#d91a1a}-3.42\\%$ | | test_compile_indexing[int-tensordict-eager] | 0.9781ms | 28.6248μs | 34.9347 KOps/s | 36.9114 KOps/s | $\textbf{\color{#d91a1a}-5.36\\%}$ | | test_compile_indexing[int-tensorclass-compile] | 0.1551ms | 67.6268μs | 14.7870 KOps/s | 14.7084 KOps/s | $\color{#35bf28}+0.53\\%$ | | test_compile_indexing[int-tensorclass-eager] | 0.1360ms | 23.5171μs | 42.5223 KOps/s | 41.7128 KOps/s | $\color{#35bf28}+1.94\\%$ | | test_compile_indexing[int-pytree-compile] | 0.1716ms | 66.8542μs | 14.9579 KOps/s | 14.7530 KOps/s | $\color{#35bf28}+1.39\\%$ | | test_compile_indexing[int-pytree-eager] | 65.1030μs | 23.3543μs | 42.8186 KOps/s | 41.3021 KOps/s | $\color{#35bf28}+3.67\\%$ | | test_mod_add[eager] | 81.4830μs | 24.3843μs | 41.0100 KOps/s | 42.5890 KOps/s | $\color{#d91a1a}-3.71\\%$ | | test_mod_add[compile] | 84.3590μs | 36.8983μs | 27.1015 KOps/s | 25.5905 KOps/s | $\textbf{\color{#35bf28}+5.90\\%}$ | | test_mod_add[compile-overhead] | 0.1031ms | 37.9706μs | 26.3361 KOps/s | 26.3700 KOps/s | $\color{#d91a1a}-0.13\\%$ | | test_mod_wrap[eager] | 0.4913ms | 0.2113ms | 4.7322 KOps/s | 4.5294 KOps/s | $\color{#35bf28}+4.48\\%$ | | test_mod_wrap[compile] | 1.7616ms | 0.2398ms | 4.1704 KOps/s | 4.2942 KOps/s | $\color{#d91a1a}-2.88\\%$ | | test_mod_wrap[compile-overhead] | 0.3717ms | 0.2330ms | 4.2916 KOps/s | 4.3338 KOps/s | $\color{#d91a1a}-0.97\\%$ | | test_mod_wrap_and_backward[eager] | 12.6280ms | 10.9539ms | 91.2916 Ops/s | 89.7051 Ops/s | $\color{#35bf28}+1.77\\%$ | | test_mod_wrap_and_backward[compile] | 12.9671ms | 11.0561ms | 90.4478 Ops/s | 89.4878 Ops/s | $\color{#35bf28}+1.07\\%$ | | test_mod_wrap_and_backward[compile-overhead] | 11.6052ms | 11.0307ms | 90.6562 Ops/s | 84.3992 Ops/s | $\textbf{\color{#35bf28}+7.41\\%}$ | | test_seq_add[eager] | 0.1789ms | 85.6239μs | 11.6790 KOps/s | 11.9758 KOps/s | $\color{#d91a1a}-2.48\\%$ | | test_seq_add[compile] | 0.1548ms | 62.5514μs | 15.9869 KOps/s | 16.2915 KOps/s | $\color{#d91a1a}-1.87\\%$ | | test_seq_add[compile-overhead] | 0.1531ms | 60.7957μs | 16.4485 KOps/s | 16.7450 KOps/s | $\color{#d91a1a}-1.77\\%$ | | test_seq_wrap[eager] | 0.5989ms | 0.3805ms | 2.6282 KOps/s | 2.7098 KOps/s | $\color{#d91a1a}-3.01\\%$ | | test_seq_wrap[compile] | 0.6426ms | 0.2724ms | 3.6708 KOps/s | 3.8180 KOps/s | $\color{#d91a1a}-3.86\\%$ | | test_seq_wrap[compile-overhead] | 0.6634ms | 0.2718ms | 3.6790 KOps/s | 3.8092 KOps/s | $\color{#d91a1a}-3.42\\%$ | | test_func_call_runtime[False-eager] | 0.8109ms | 0.5369ms | 1.8625 KOps/s | 1.8806 KOps/s | $\color{#d91a1a}-0.96\\%$ | | test_func_call_runtime[False-compile] | 0.6247ms | 0.5097ms | 1.9620 KOps/s | 2.0446 KOps/s | $\color{#d91a1a}-4.04\\%$ | | test_func_call_runtime[False-compile-overhead] | 0.6398ms | 0.5092ms | 1.9638 KOps/s | 2.0167 KOps/s | $\color{#d91a1a}-2.63\\%$ | | test_func_call_runtime[True-eager] | 1.2054ms | 0.7714ms | 1.2963 KOps/s | 1.2884 KOps/s | $\color{#35bf28}+0.62\\%$ | | test_func_call_runtime[True-compile] | 1.0028ms | 0.5237ms | 1.9095 KOps/s | 1.9283 KOps/s | $\color{#d91a1a}-0.98\\%$ | | test_func_call_runtime[True-compile-overhead] | 0.6837ms | 0.5236ms | 1.9098 KOps/s | 1.9246 KOps/s | $\color{#d91a1a}-0.77\\%$ | | test_func_call_cm_runtime[False-eager] | 0.9625ms | 0.5237ms | 1.9096 KOps/s | 1.8704 KOps/s | $\color{#35bf28}+2.09\\%$ | | test_func_call_cm_runtime[False-compile] | 0.6991ms | 0.5004ms | 1.9983 KOps/s | 1.9960 KOps/s | $\color{#35bf28}+0.11\\%$ | | test_func_call_cm_runtime[False-compile-overhead] | 0.8614ms | 0.5053ms | 1.9790 KOps/s | 2.0125 KOps/s | $\color{#d91a1a}-1.66\\%$ | | test_func_call_cm_runtime[True-eager] | 1.1757ms | 0.8839ms | 1.1313 KOps/s | 1.1001 KOps/s | $\color{#35bf28}+2.84\\%$ | | test_func_call_cm_runtime[True-compile] | 1.1295ms | 0.8364ms | 1.1956 KOps/s | 1.1667 KOps/s | $\color{#35bf28}+2.48\\%$ | | test_func_call_cm_runtime[True-compile-overhead] | 1.1396ms | 0.8347ms | 1.1980 KOps/s | 1.1533 KOps/s | $\color{#35bf28}+3.87\\%$ | | test_distributed | 0.2849ms | 0.1336ms | 7.4857 KOps/s | 7.4053 KOps/s | $\color{#35bf28}+1.09\\%$ | | test_tdmodule | 0.1050ms | 16.4703μs | 60.7155 KOps/s | 61.5580 KOps/s | $\color{#d91a1a}-1.37\\%$ | | test_tdmodule_dispatch | 60.6240μs | 34.9726μs | 28.5938 KOps/s | 28.8049 KOps/s | $\color{#d91a1a}-0.73\\%$ | | test_tdseq | 33.8630μs | 18.1545μs | 55.0827 KOps/s | 54.2143 KOps/s | $\color{#35bf28}+1.60\\%$ | | test_tdseq_dispatch | 66.2550μs | 38.6691μs | 25.8605 KOps/s | 26.0538 KOps/s | $\color{#d91a1a}-0.74\\%$ | | test_instantiation_functorch | 2.4046ms | 1.6278ms | 614.3112 Ops/s | 609.6228 Ops/s | $\color{#35bf28}+0.77\\%$ | | test_instantiation_td | 1.9927ms | 1.1770ms | 849.5987 Ops/s | 841.3629 Ops/s | $\color{#35bf28}+0.98\\%$ | | test_exec_functorch | 0.4204ms | 0.1766ms | 5.6612 KOps/s | 5.5206 KOps/s | $\color{#35bf28}+2.55\\%$ | | test_exec_functional_call | 0.3089ms | 0.1702ms | 5.8770 KOps/s | 5.9096 KOps/s | $\color{#d91a1a}-0.55\\%$ | | test_exec_td | 0.3076ms | 0.1734ms | 5.7680 KOps/s | 5.7812 KOps/s | $\color{#d91a1a}-0.23\\%$ | | test_exec_td_decorator | 0.7794ms | 0.2267ms | 4.4103 KOps/s | 4.4262 KOps/s | $\color{#d91a1a}-0.36\\%$ | | test_vmap_mlp_speed[True-True] | 0.8727ms | 0.5875ms | 1.7022 KOps/s | 1.7072 KOps/s | $\color{#d91a1a}-0.30\\%$ | | test_vmap_mlp_speed[True-False] | 0.8727ms | 0.5839ms | 1.7127 KOps/s | 1.7249 KOps/s | $\color{#d91a1a}-0.71\\%$ | | test_vmap_mlp_speed[False-True] | 0.6733ms | 0.4866ms | 2.0552 KOps/s | 2.0843 KOps/s | $\color{#d91a1a}-1.39\\%$ | | test_vmap_mlp_speed[False-False] | 0.7616ms | 0.4883ms | 2.0480 KOps/s | 2.0838 KOps/s | $\color{#d91a1a}-1.72\\%$ | | test_vmap_mlp_speed_decorator[True-True] | 1.4934ms | 0.6438ms | 1.5534 KOps/s | 1.5739 KOps/s | $\color{#d91a1a}-1.30\\%$ | | test_vmap_mlp_speed_decorator[True-False] | 0.9530ms | 0.6475ms | 1.5443 KOps/s | 1.5812 KOps/s | $\color{#d91a1a}-2.33\\%$ | | test_vmap_mlp_speed_decorator[False-True] | 0.9193ms | 0.5370ms | 1.8621 KOps/s | 1.8935 KOps/s | $\color{#d91a1a}-1.66\\%$ | | test_vmap_mlp_speed_decorator[False-False] | 0.8013ms | 0.5344ms | 1.8713 KOps/s | 1.8974 KOps/s | $\color{#d91a1a}-1.38\\%$ | | test_to_module_speed[True] | 2.1656ms | 1.3638ms | 733.2378 Ops/s | 755.3314 Ops/s | $\color{#d91a1a}-2.93\\%$ | | test_to_module_speed[False] | 2.0045ms | 1.3172ms | 759.1867 Ops/s | 768.9459 Ops/s | $\color{#d91a1a}-1.27\\%$ | | test_tc_init | 78.5380μs | 44.2864μs | 22.5803 KOps/s | 23.5105 KOps/s | $\color{#d91a1a}-3.96\\%$ | | test_tc_init_nested | 0.1379ms | 92.5203μs | 10.8084 KOps/s | 11.5971 KOps/s | $\textbf{\color{#d91a1a}-6.80\\%}$ | | test_tc_first_layer_tensor | 31.8300μs | 1.5099μs | 662.3045 KOps/s | 607.9684 KOps/s | $\textbf{\color{#35bf28}+8.94\\%}$ | | test_tc_first_layer_nontensor | 31.2290μs | 4.3767μs | 228.4848 KOps/s | 223.7483 KOps/s | $\color{#35bf28}+2.12\\%$ | | test_tc_second_layer_tensor | 30.5170μs | 2.8028μs | 356.7803 KOps/s | 343.9476 KOps/s | $\color{#35bf28}+3.73\\%$ | | test_tc_second_layer_nontensor | 33.1720μs | 5.6858μs | 175.8771 KOps/s | 172.8712 KOps/s | $\color{#35bf28}+1.74\\%$ | | test_unbind | 0.4575s | 13.9270ms | 71.8029 Ops/s | 72.7476 Ops/s | $\color{#d91a1a}-1.30\\%$ | | test_full_like | 12.8842ms | 7.9694ms | 125.4795 Ops/s | 79.4343 Ops/s | $\textbf{\color{#35bf28}+57.97\\%}$ | | test_zeros_like | 14.9293ms | 7.6830ms | 130.1583 Ops/s | 143.3534 Ops/s | $\textbf{\color{#d91a1a}-9.20\\%}$ | | test_ones_like | 15.2471ms | 7.5854ms | 131.8330 Ops/s | 128.2279 Ops/s | $\color{#35bf28}+2.81\\%$ | | test_clone | 16.6331ms | 9.4598ms | 105.7106 Ops/s | 105.0807 Ops/s | $\color{#35bf28}+0.60\\%$ | | test_squeeze | 70.5830μs | 13.1774μs | 75.8873 KOps/s | 76.6942 KOps/s | $\color{#d91a1a}-1.05\\%$ | | test_unsqueeze | 0.1771ms | 93.4534μs | 10.7005 KOps/s | 10.5072 KOps/s | $\color{#35bf28}+1.84\\%$ | | test_split | 0.4891ms | 0.2020ms | 4.9513 KOps/s | 4.9153 KOps/s | $\color{#35bf28}+0.73\\%$ | | test_permute | 0.3173ms | 0.2190ms | 4.5664 KOps/s | 4.4866 KOps/s | $\color{#35bf28}+1.78\\%$ | | test_stack | 30.1809ms | 25.0769ms | 39.8773 Ops/s | 38.8947 Ops/s | $\color{#35bf28}+2.53\\%$ | | test_cat | 32.7002ms | 25.1038ms | 39.8346 Ops/s | 38.8976 Ops/s | $\color{#35bf28}+2.41\\%$ |