Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Feature] spec.is_empty(recurse) #2596

Merged
merged 2 commits into from
Nov 24, 2024
Merged

Conversation

[ghstack-poisoned]
Copy link

pytorch-bot bot commented Nov 23, 2024

🔗 Helpful Links

🧪 See artifacts and rendered test results at hud.pytorch.org/pr/pytorch/rl/2596

Note: Links to docs will display an error until the docs builds have been completed.

❌ 17 New Failures, 1 Unrelated Failure

As of commit 20cda3e with merge base 152bc81 (image):

NEW FAILURES - The following jobs have failed:

BROKEN TRUNK - The following job failed but were present on the merge base:

👉 Rebase onto the `viable/strict` branch to avoid these failures

This comment was automatically generated by Dr. CI and updates every 15 minutes.

@facebook-github-bot facebook-github-bot added the CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed. label Nov 23, 2024
[ghstack-poisoned]
@vmoens vmoens merged commit 20cda3e into gh/vmoens/43/base Nov 24, 2024
40 of 58 checks passed
vmoens added a commit that referenced this pull request Nov 24, 2024
ghstack-source-id: faa3b1df5133c77462d6dd013d3854d684cc7e94
Pull Request resolved: #2596
@vmoens vmoens deleted the gh/vmoens/43/head branch November 24, 2024 08:16
Copy link

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of CPU Benchmark Tests

Total Benchmarks: 149. Improved: $\large\color{#35bf28}11$. Worsened: $\large\color{#d91a1a}11$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_simple 0.4300s 0.4286s 2.3330 Ops/s 2.2371 Ops/s $\color{#35bf28}+4.29\%$
test_transformed 0.6134s 0.6070s 1.6475 Ops/s 1.6387 Ops/s $\color{#35bf28}+0.54\%$
test_serial 1.3443s 1.3417s 0.7453 Ops/s 0.7242 Ops/s $\color{#35bf28}+2.93\%$
test_parallel 1.2692s 1.2620s 0.7924 Ops/s 0.7693 Ops/s $\color{#35bf28}+3.01\%$
test_step_mdp_speed[True-True-True-True-True] 73.0770μs 27.1423μs 36.8428 KOps/s 35.8039 KOps/s $\color{#35bf28}+2.90\%$
test_step_mdp_speed[True-True-True-True-False] 83.1260μs 15.9079μs 62.8620 KOps/s 62.4235 KOps/s $\color{#35bf28}+0.70\%$
test_step_mdp_speed[True-True-True-False-True] 48.3910μs 15.8511μs 63.0870 KOps/s 62.6555 KOps/s $\color{#35bf28}+0.69\%$
test_step_mdp_speed[True-True-True-False-False] 54.5620μs 9.1673μs 109.0837 KOps/s 108.2025 KOps/s $\color{#35bf28}+0.81\%$
test_step_mdp_speed[True-True-False-True-True] 98.0740μs 29.8507μs 33.5000 KOps/s 33.1241 KOps/s $\color{#35bf28}+1.13\%$
test_step_mdp_speed[True-True-False-True-False] 83.1860μs 17.5932μs 56.8402 KOps/s 55.3631 KOps/s $\color{#35bf28}+2.67\%$
test_step_mdp_speed[True-True-False-False-True] 57.9290μs 17.4964μs 57.1545 KOps/s 56.7087 KOps/s $\color{#35bf28}+0.79\%$
test_step_mdp_speed[True-True-False-False-False] 64.7620μs 10.8482μs 92.1810 KOps/s 92.2884 KOps/s $\color{#d91a1a}-0.12\%$
test_step_mdp_speed[True-False-True-True-True] 87.3240μs 30.8709μs 32.3930 KOps/s 31.7727 KOps/s $\color{#35bf28}+1.95\%$
test_step_mdp_speed[True-False-True-True-False] 83.5060μs 19.3153μs 51.7723 KOps/s 51.0114 KOps/s $\color{#35bf28}+1.49\%$
test_step_mdp_speed[True-False-True-False-True] 0.1007ms 17.4833μs 57.1975 KOps/s 56.8025 KOps/s $\color{#35bf28}+0.70\%$
test_step_mdp_speed[True-False-True-False-False] 79.5200μs 10.8997μs 91.7460 KOps/s 93.2607 KOps/s $\color{#d91a1a}-1.62\%$
test_step_mdp_speed[True-False-False-True-True] 65.6030μs 32.2679μs 30.9905 KOps/s 29.6305 KOps/s $\color{#35bf28}+4.59\%$
test_step_mdp_speed[True-False-False-True-False] 73.1780μs 20.7024μs 48.3035 KOps/s 46.9990 KOps/s $\color{#35bf28}+2.78\%$
test_step_mdp_speed[True-False-False-False-True] 62.4570μs 18.9315μs 52.8221 KOps/s 51.4531 KOps/s $\color{#35bf28}+2.66\%$
test_step_mdp_speed[True-False-False-False-False] 39.1730μs 12.3162μs 81.1939 KOps/s 79.9556 KOps/s $\color{#35bf28}+1.55\%$
test_step_mdp_speed[False-True-True-True-True] 94.6200μs 30.9853μs 32.2734 KOps/s 31.4265 KOps/s $\color{#35bf28}+2.69\%$
test_step_mdp_speed[False-True-True-True-False] 52.5390μs 19.2581μs 51.9263 KOps/s 50.8068 KOps/s $\color{#35bf28}+2.20\%$
test_step_mdp_speed[False-True-True-False-True] 87.2440μs 19.8495μs 50.3791 KOps/s 49.1566 KOps/s $\color{#35bf28}+2.49\%$
test_step_mdp_speed[False-True-True-False-False] 51.9880μs 12.0088μs 83.2721 KOps/s 81.6104 KOps/s $\color{#35bf28}+2.04\%$
test_step_mdp_speed[False-True-False-True-True] 0.1019ms 32.4158μs 30.8491 KOps/s 29.7069 KOps/s $\color{#35bf28}+3.84\%$
test_step_mdp_speed[False-True-False-True-False] 82.0320μs 20.7996μs 48.0778 KOps/s 46.8809 KOps/s $\color{#35bf28}+2.55\%$
test_step_mdp_speed[False-True-False-False-True] 3.2246ms 21.3062μs 46.9348 KOps/s 45.3807 KOps/s $\color{#35bf28}+3.42\%$
test_step_mdp_speed[False-True-False-False-False] 73.6590μs 13.6459μs 73.2822 KOps/s 71.7480 KOps/s $\color{#35bf28}+2.14\%$
test_step_mdp_speed[False-False-True-True-True] 76.6540μs 33.8940μs 29.5037 KOps/s 28.2138 KOps/s $\color{#35bf28}+4.57\%$
test_step_mdp_speed[False-False-True-True-False] 83.1560μs 22.1738μs 45.0984 KOps/s 43.0597 KOps/s $\color{#35bf28}+4.73\%$
test_step_mdp_speed[False-False-True-False-True] 56.0050μs 21.2725μs 47.0091 KOps/s 45.5442 KOps/s $\color{#35bf28}+3.22\%$
test_step_mdp_speed[False-False-True-False-False] 72.3660μs 13.4669μs 74.2560 KOps/s 72.1829 KOps/s $\color{#35bf28}+2.87\%$
test_step_mdp_speed[False-False-False-True-True] 71.3840μs 34.8344μs 28.7072 KOps/s 27.1991 KOps/s $\textbf{\color{#35bf28}+5.54\%}$
test_step_mdp_speed[False-False-False-True-False] 72.0550μs 23.6269μs 42.3247 KOps/s 40.5528 KOps/s $\color{#35bf28}+4.37\%$
test_step_mdp_speed[False-False-False-False-True] 80.9720μs 22.4911μs 44.4620 KOps/s 42.8042 KOps/s $\color{#35bf28}+3.87\%$
test_step_mdp_speed[False-False-False-False-False] 50.1740μs 14.8652μs 67.2713 KOps/s 65.0808 KOps/s $\color{#35bf28}+3.37\%$
test_values[generalized_advantage_estimate-True-True] 10.4036ms 10.1612ms 98.4139 Ops/s 101.4015 Ops/s $\color{#d91a1a}-2.95\%$
test_values[vec_generalized_advantage_estimate-True-True] 35.5963ms 33.6132ms 29.7502 Ops/s 26.7489 Ops/s $\textbf{\color{#35bf28}+11.22\%}$
test_values[td0_return_estimate-False-False] 0.2424ms 0.1972ms 5.0711 KOps/s 5.4238 KOps/s $\textbf{\color{#d91a1a}-6.50\%}$
test_values[td1_return_estimate-False-False] 29.2092ms 25.3396ms 39.4639 Ops/s 40.2811 Ops/s $\color{#d91a1a}-2.03\%$
test_values[vec_td1_return_estimate-False-False] 35.9565ms 33.7888ms 29.5956 Ops/s 27.8719 Ops/s $\textbf{\color{#35bf28}+6.18\%}$
test_values[td_lambda_return_estimate-True-False] 36.8527ms 36.1661ms 27.6502 Ops/s 27.8295 Ops/s $\color{#d91a1a}-0.64\%$
test_values[vec_td_lambda_return_estimate-True-False] 35.3464ms 33.6182ms 29.7458 Ops/s 28.0504 Ops/s $\textbf{\color{#35bf28}+6.04\%}$
test_gae_speed[generalized_advantage_estimate-False-1-512] 11.0590ms 8.8580ms 112.8926 Ops/s 116.2679 Ops/s $\color{#d91a1a}-2.90\%$
test_gae_speed[vec_generalized_advantage_estimate-True-1-512] 2.3898ms 1.9404ms 515.3556 Ops/s 541.3834 Ops/s $\color{#d91a1a}-4.81\%$
test_gae_speed[vec_generalized_advantage_estimate-False-1-512] 0.4552ms 0.3642ms 2.7456 KOps/s 2.7324 KOps/s $\color{#35bf28}+0.48\%$
test_gae_speed[vec_generalized_advantage_estimate-True-32-512] 45.5793ms 43.2037ms 23.1461 Ops/s 22.2142 Ops/s $\color{#35bf28}+4.20\%$
test_gae_speed[vec_generalized_advantage_estimate-False-32-512] 3.7781ms 3.0640ms 326.3757 Ops/s 328.3279 Ops/s $\color{#d91a1a}-0.59\%$
test_dqn_speed[False-None] 1.9423ms 1.3497ms 740.9056 Ops/s 731.1240 Ops/s $\color{#35bf28}+1.34\%$
test_dqn_speed[False-backward] 1.9183ms 1.8187ms 549.8319 Ops/s 535.7167 Ops/s $\color{#35bf28}+2.63\%$
test_dqn_speed[True-None] 0.8318ms 0.4710ms 2.1229 KOps/s 2.1152 KOps/s $\color{#35bf28}+0.37\%$
test_dqn_speed[True-backward] 1.0554ms 0.9093ms 1.0998 KOps/s 858.7738 Ops/s $\textbf{\color{#35bf28}+28.07\%}$
test_dqn_speed[reduce-overhead-None] 0.7365ms 0.4711ms 2.1225 KOps/s 2.1430 KOps/s $\color{#d91a1a}-0.96\%$
test_dqn_speed[reduce-overhead-backward] 0.9727ms 0.9013ms 1.1095 KOps/s 1.1102 KOps/s $\color{#d91a1a}-0.06\%$
test_ddpg_speed[False-None] 3.6113ms 2.8086ms 356.0475 Ops/s 349.6408 Ops/s $\color{#35bf28}+1.83\%$
test_ddpg_speed[False-backward] 4.1025ms 3.9354ms 254.1049 Ops/s 247.8475 Ops/s $\color{#35bf28}+2.52\%$
test_ddpg_speed[True-None] 1.4702ms 1.0030ms 997.0141 Ops/s 990.6864 Ops/s $\color{#35bf28}+0.64\%$
test_ddpg_speed[True-backward] 2.1468ms 1.9333ms 517.2613 Ops/s 448.2191 Ops/s $\textbf{\color{#35bf28}+15.40\%}$
test_ddpg_speed[reduce-overhead-None] 1.5691ms 1.0115ms 988.6446 Ops/s 984.2466 Ops/s $\color{#35bf28}+0.45\%$
test_ddpg_speed[reduce-overhead-backward] 2.2793ms 1.9799ms 505.0839 Ops/s 518.2422 Ops/s $\color{#d91a1a}-2.54\%$
test_sac_speed[False-None] 12.2113ms 8.3031ms 120.4364 Ops/s 125.3623 Ops/s $\color{#d91a1a}-3.93\%$
test_sac_speed[False-backward] 12.2596ms 11.0570ms 90.4403 Ops/s 93.2018 Ops/s $\color{#d91a1a}-2.96\%$
test_sac_speed[True-None] 2.6633ms 1.9426ms 514.7837 Ops/s 526.3495 Ops/s $\color{#d91a1a}-2.20\%$
test_sac_speed[True-backward] 4.1089ms 3.8437ms 260.1660 Ops/s 279.1433 Ops/s $\textbf{\color{#d91a1a}-6.80\%}$
test_sac_speed[reduce-overhead-None] 2.3748ms 1.9675ms 508.2714 Ops/s 526.1481 Ops/s $\color{#d91a1a}-3.40\%$
test_sac_speed[reduce-overhead-backward] 4.7044ms 3.8685ms 258.5007 Ops/s 274.0379 Ops/s $\textbf{\color{#d91a1a}-5.67\%}$
test_redq_speed[False-None] 15.4777ms 13.2318ms 75.5752 Ops/s 75.8912 Ops/s $\color{#d91a1a}-0.42\%$
test_redq_speed[False-backward] 24.1070ms 22.9910ms 43.4952 Ops/s 44.1353 Ops/s $\color{#d91a1a}-1.45\%$
test_redq_speed[True-None] 5.8390ms 5.0988ms 196.1240 Ops/s 207.7086 Ops/s $\textbf{\color{#d91a1a}-5.58\%}$
test_redq_speed[True-backward] 13.7417ms 12.0598ms 82.9199 Ops/s 81.7597 Ops/s $\color{#35bf28}+1.42\%$
test_redq_speed[reduce-overhead-None] 5.6340ms 4.7645ms 209.8860 Ops/s 206.3431 Ops/s $\color{#35bf28}+1.72\%$
test_redq_speed[reduce-overhead-backward] 13.2759ms 12.5563ms 79.6413 Ops/s 80.9986 Ops/s $\color{#d91a1a}-1.68\%$
test_redq_deprec_speed[False-None] 13.2258ms 12.4781ms 80.1405 Ops/s 76.4456 Ops/s $\color{#35bf28}+4.83\%$
test_redq_deprec_speed[False-backward] 28.5186ms 19.0505ms 52.4920 Ops/s 53.4353 Ops/s $\color{#d91a1a}-1.77\%$
test_redq_deprec_speed[True-None] 4.4821ms 3.8169ms 261.9953 Ops/s 267.6730 Ops/s $\color{#d91a1a}-2.12\%$
test_redq_deprec_speed[True-backward] 8.7801ms 8.3256ms 120.1120 Ops/s 116.4242 Ops/s $\color{#35bf28}+3.17\%$
test_redq_deprec_speed[reduce-overhead-None] 4.2974ms 3.6201ms 276.2335 Ops/s 267.9940 Ops/s $\color{#35bf28}+3.07\%$
test_redq_deprec_speed[reduce-overhead-backward] 9.1786ms 8.5762ms 116.6011 Ops/s 123.0184 Ops/s $\textbf{\color{#d91a1a}-5.22\%}$
test_td3_speed[False-None] 8.7166ms 8.0445ms 124.3078 Ops/s 127.6186 Ops/s $\color{#d91a1a}-2.59\%$
test_td3_speed[False-backward] 10.4409ms 10.1504ms 98.5185 Ops/s 97.0683 Ops/s $\color{#35bf28}+1.49\%$
test_td3_speed[True-None] 1.8825ms 1.7633ms 567.1205 Ops/s 567.5085 Ops/s $\color{#d91a1a}-0.07\%$
test_td3_speed[True-backward] 3.6470ms 3.4178ms 292.5851 Ops/s 266.0877 Ops/s $\textbf{\color{#35bf28}+9.96\%}$
test_td3_speed[reduce-overhead-None] 2.0376ms 1.7559ms 569.5105 Ops/s 569.8572 Ops/s $\color{#d91a1a}-0.06\%$
test_td3_speed[reduce-overhead-backward] 3.6109ms 3.3848ms 295.4367 Ops/s 290.1853 Ops/s $\color{#35bf28}+1.81\%$
test_cql_speed[False-None] 39.3154ms 35.9586ms 27.8097 Ops/s 27.3616 Ops/s $\color{#35bf28}+1.64\%$
test_cql_speed[False-backward] 49.6202ms 47.0136ms 21.2704 Ops/s 21.7865 Ops/s $\color{#d91a1a}-2.37\%$
test_cql_speed[True-None] 17.4548ms 15.6715ms 63.8099 Ops/s 63.4786 Ops/s $\color{#35bf28}+0.52\%$
test_cql_speed[True-backward] 23.2013ms 22.1758ms 45.0941 Ops/s 43.1734 Ops/s $\color{#35bf28}+4.45\%$
test_cql_speed[reduce-overhead-None] 17.6156ms 15.7880ms 63.3392 Ops/s 62.0250 Ops/s $\color{#35bf28}+2.12\%$
test_cql_speed[reduce-overhead-backward] 27.1138ms 23.1815ms 43.1379 Ops/s 43.8617 Ops/s $\color{#d91a1a}-1.65\%$
test_a2c_speed[False-None] 9.2577ms 7.2238ms 138.4318 Ops/s 140.0127 Ops/s $\color{#d91a1a}-1.13\%$
test_a2c_speed[False-backward] 15.9731ms 14.6377ms 68.3167 Ops/s 69.9329 Ops/s $\color{#d91a1a}-2.31\%$
test_a2c_speed[True-None] 4.7209ms 4.2848ms 233.3826 Ops/s 232.4374 Ops/s $\color{#35bf28}+0.41\%$
test_a2c_speed[True-backward] 12.0062ms 11.0109ms 90.8188 Ops/s 93.9824 Ops/s $\color{#d91a1a}-3.37\%$
test_a2c_speed[reduce-overhead-None] 4.7284ms 4.3441ms 230.1998 Ops/s 235.6404 Ops/s $\color{#d91a1a}-2.31\%$
test_a2c_speed[reduce-overhead-backward] 11.6899ms 10.8411ms 92.2419 Ops/s 93.3737 Ops/s $\color{#d91a1a}-1.21\%$
test_ppo_speed[False-None] 9.1710ms 7.5485ms 132.4767 Ops/s 135.6835 Ops/s $\color{#d91a1a}-2.36\%$
test_ppo_speed[False-backward] 17.4865ms 15.0236ms 66.5618 Ops/s 68.2634 Ops/s $\color{#d91a1a}-2.49\%$
test_ppo_speed[True-None] 4.1380ms 3.7721ms 265.1051 Ops/s 267.4194 Ops/s $\color{#d91a1a}-0.87\%$
test_ppo_speed[True-backward] 10.8664ms 9.6788ms 103.3188 Ops/s 105.3543 Ops/s $\color{#d91a1a}-1.93\%$
test_ppo_speed[reduce-overhead-None] 4.3698ms 3.7541ms 266.3738 Ops/s 264.4556 Ops/s $\color{#35bf28}+0.73\%$
test_ppo_speed[reduce-overhead-backward] 10.8962ms 9.7484ms 102.5809 Ops/s 104.3075 Ops/s $\color{#d91a1a}-1.66\%$
test_reinforce_speed[False-None] 7.7228ms 6.5975ms 151.5717 Ops/s 154.2849 Ops/s $\color{#d91a1a}-1.76\%$
test_reinforce_speed[False-backward] 10.2477ms 9.8495ms 101.5282 Ops/s 103.4426 Ops/s $\color{#d91a1a}-1.85\%$
test_reinforce_speed[True-None] 3.3290ms 2.6914ms 371.5549 Ops/s 372.9364 Ops/s $\color{#d91a1a}-0.37\%$
test_reinforce_speed[True-backward] 9.2542ms 8.7677ms 114.0547 Ops/s 116.4235 Ops/s $\color{#d91a1a}-2.03\%$
test_reinforce_speed[reduce-overhead-None] 3.8775ms 2.7524ms 363.3234 Ops/s 372.2374 Ops/s $\color{#d91a1a}-2.39\%$
test_reinforce_speed[reduce-overhead-backward] 9.8762ms 8.8914ms 112.4687 Ops/s 116.2996 Ops/s $\color{#d91a1a}-3.29\%$
test_iql_speed[False-None] 39.8708ms 33.6889ms 29.6833 Ops/s 31.4383 Ops/s $\textbf{\color{#d91a1a}-5.58\%}$
test_iql_speed[False-backward] 67.1864ms 46.6739ms 21.4252 Ops/s 22.5598 Ops/s $\textbf{\color{#d91a1a}-5.03\%}$
test_iql_speed[True-None] 12.4450ms 11.1515ms 89.6741 Ops/s 90.1259 Ops/s $\color{#d91a1a}-0.50\%$
test_iql_speed[True-backward] 24.2733ms 22.5695ms 44.3075 Ops/s 43.8918 Ops/s $\color{#35bf28}+0.95\%$
test_iql_speed[reduce-overhead-None] 12.0030ms 11.0778ms 90.2705 Ops/s 90.4567 Ops/s $\color{#d91a1a}-0.21\%$
test_iql_speed[reduce-overhead-backward] 23.9670ms 22.1036ms 45.2415 Ops/s 44.3015 Ops/s $\color{#35bf28}+2.12\%$
test_rb_sample[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 5.3067ms 5.0114ms 199.5437 Ops/s 190.8506 Ops/s $\color{#35bf28}+4.55\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 1.1398ms 0.5181ms 1.9303 KOps/s 1.9028 KOps/s $\color{#35bf28}+1.44\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.8026ms 0.4990ms 2.0040 KOps/s 1.9971 KOps/s $\color{#35bf28}+0.34\%$
test_rb_sample[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 7.6476ms 4.7905ms 208.7445 Ops/s 201.1124 Ops/s $\color{#35bf28}+3.79\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 3.6914ms 0.5050ms 1.9800 KOps/s 1.9532 KOps/s $\color{#35bf28}+1.37\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.7646ms 0.4818ms 2.0756 KOps/s 2.0066 KOps/s $\color{#35bf28}+3.44\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-sampler6-10000] 2.1479ms 1.6514ms 605.5294 Ops/s 587.1885 Ops/s $\color{#35bf28}+3.12\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-sampler7-10000] 2.3527ms 1.6074ms 622.1135 Ops/s 624.5325 Ops/s $\color{#d91a1a}-0.39\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 5.0756ms 4.8138ms 207.7340 Ops/s 198.2189 Ops/s $\color{#35bf28}+4.80\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 1.1539ms 0.6564ms 1.5236 KOps/s 1.5198 KOps/s $\color{#35bf28}+0.25\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.8596ms 0.6306ms 1.5857 KOps/s 1.5773 KOps/s $\color{#35bf28}+0.53\%$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 4.9905ms 4.6943ms 213.0241 Ops/s 195.5696 Ops/s $\textbf{\color{#35bf28}+8.92\%}$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 0.8669ms 0.5135ms 1.9476 KOps/s 1.8460 KOps/s $\textbf{\color{#35bf28}+5.50\%}$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.7423ms 0.4876ms 2.0507 KOps/s 1.9578 KOps/s $\color{#35bf28}+4.74\%$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 7.4483ms 4.7661ms 209.8137 Ops/s 198.5058 Ops/s $\textbf{\color{#35bf28}+5.70\%}$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 0.4706s 1.1450ms 873.3453 Ops/s 1.8944 KOps/s $\textbf{\color{#d91a1a}-53.90\%}$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.7346ms 0.4867ms 2.0545 KOps/s 1.9902 KOps/s $\color{#35bf28}+3.23\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 7.7247ms 4.9085ms 203.7300 Ops/s 194.4081 Ops/s $\color{#35bf28}+4.80\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 2.8189ms 0.6520ms 1.5337 KOps/s 1.5091 KOps/s $\color{#35bf28}+1.63\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.8508ms 0.6234ms 1.6041 KOps/s 1.5944 KOps/s $\color{#35bf28}+0.61\%$
test_rb_populate[TensorDictReplayBuffer-ListStorage-RandomSampler-400] 7.9243ms 4.3224ms 231.3549 Ops/s 235.9733 Ops/s $\color{#d91a1a}-1.96\%$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-400] 8.7389ms 2.3506ms 425.4185 Ops/s 424.6092 Ops/s $\color{#35bf28}+0.19\%$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-400] 4.0864ms 1.3301ms 751.8015 Ops/s 789.0513 Ops/s $\color{#d91a1a}-4.72\%$
test_rb_populate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-400] 0.4406s 12.9147ms 77.4313 Ops/s 248.9045 Ops/s $\textbf{\color{#d91a1a}-68.89\%}$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-400] 8.4059ms 2.3960ms 417.3617 Ops/s 446.5427 Ops/s $\textbf{\color{#d91a1a}-6.53\%}$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-400] 7.4097ms 1.3520ms 739.6593 Ops/s 753.6408 Ops/s $\color{#d91a1a}-1.86\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-ListStorage-None-400] 5.8234ms 4.3639ms 229.1552 Ops/s 249.5571 Ops/s $\textbf{\color{#d91a1a}-8.18\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-400] 6.5744ms 2.4210ms 413.0495 Ops/s 384.8982 Ops/s $\textbf{\color{#35bf28}+7.31\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-400] 6.2463ms 1.4762ms 677.4254 Ops/s 649.7357 Ops/s $\color{#35bf28}+4.26\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-True] 12.3681ms 11.2995ms 88.4998 Ops/s 88.3308 Ops/s $\color{#35bf28}+0.19\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-False] 15.1740ms 14.5217ms 68.8626 Ops/s 67.6868 Ops/s $\color{#35bf28}+1.74\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-True] 21.9564ms 19.9422ms 50.1450 Ops/s 49.4223 Ops/s $\color{#35bf28}+1.46\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-False] 14.7704ms 14.5446ms 68.7539 Ops/s 66.9007 Ops/s $\color{#35bf28}+2.77\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-True] 20.4570ms 19.9776ms 50.0561 Ops/s 48.5561 Ops/s $\color{#35bf28}+3.09\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-False] 19.1952ms 16.1831ms 61.7928 Ops/s 60.9840 Ops/s $\color{#35bf28}+1.33\%$

Copy link

$\color{#D29922}\textsf{\Large⚠\kern{0.2cm}\normalsize Warning}$ Result of GPU Benchmark Tests

Total Benchmarks: 149. Improved: $\large\color{#35bf28}27$. Worsened: $\large\color{#d91a1a}8$.

Expand to view detailed results
Name Max Mean Ops Ops on Repo HEAD Change
test_simple 0.7433s 0.7429s 1.3461 Ops/s 1.3188 Ops/s $\color{#35bf28}+2.07\%$
test_transformed 0.9902s 0.9894s 1.0107 Ops/s 0.9923 Ops/s $\color{#35bf28}+1.85\%$
test_serial 2.1436s 2.1423s 0.4668 Ops/s 0.4600 Ops/s $\color{#35bf28}+1.48\%$
test_parallel 2.0924s 1.9950s 0.5012 Ops/s 0.5136 Ops/s $\color{#d91a1a}-2.40\%$
test_step_mdp_speed[True-True-True-True-True] 0.1928ms 35.6011μs 28.0890 KOps/s 28.3398 KOps/s $\color{#d91a1a}-0.89\%$
test_step_mdp_speed[True-True-True-True-False] 54.0310μs 20.5937μs 48.5585 KOps/s 49.4883 KOps/s $\color{#d91a1a}-1.88\%$
test_step_mdp_speed[True-True-True-False-True] 59.3810μs 20.1561μs 49.6129 KOps/s 51.2085 KOps/s $\color{#d91a1a}-3.12\%$
test_step_mdp_speed[True-True-True-False-False] 42.0310μs 11.6992μs 85.4763 KOps/s 86.4855 KOps/s $\color{#d91a1a}-1.17\%$
test_step_mdp_speed[True-True-False-True-True] 69.7420μs 38.2469μs 26.1459 KOps/s 26.5723 KOps/s $\color{#d91a1a}-1.60\%$
test_step_mdp_speed[True-True-False-True-False] 67.9110μs 22.8188μs 43.8236 KOps/s 44.5658 KOps/s $\color{#d91a1a}-1.67\%$
test_step_mdp_speed[True-True-False-False-True] 48.4310μs 22.0738μs 45.3025 KOps/s 45.0677 KOps/s $\color{#35bf28}+0.52\%$
test_step_mdp_speed[True-True-False-False-False] 42.8610μs 13.8018μs 72.4543 KOps/s 74.4176 KOps/s $\color{#d91a1a}-2.64\%$
test_step_mdp_speed[True-False-True-True-True] 74.1210μs 40.7918μs 24.5147 KOps/s 24.7840 KOps/s $\color{#d91a1a}-1.09\%$
test_step_mdp_speed[True-False-True-True-False] 62.1810μs 25.0005μs 39.9992 KOps/s 40.9447 KOps/s $\color{#d91a1a}-2.31\%$
test_step_mdp_speed[True-False-True-False-True] 65.9010μs 22.3665μs 44.7097 KOps/s 45.1634 KOps/s $\color{#d91a1a}-1.00\%$
test_step_mdp_speed[True-False-True-False-False] 54.0410μs 13.9126μs 71.8772 KOps/s 73.1908 KOps/s $\color{#d91a1a}-1.79\%$
test_step_mdp_speed[True-False-False-True-True] 87.7510μs 42.6531μs 23.4450 KOps/s 23.4685 KOps/s $\color{#d91a1a}-0.10\%$
test_step_mdp_speed[True-False-False-True-False] 57.7410μs 27.0197μs 37.0100 KOps/s 37.5769 KOps/s $\color{#d91a1a}-1.51\%$
test_step_mdp_speed[True-False-False-False-True] 62.8910μs 24.0825μs 41.5240 KOps/s 43.0738 KOps/s $\color{#d91a1a}-3.60\%$
test_step_mdp_speed[True-False-False-False-False] 42.5810μs 15.8712μs 63.0073 KOps/s 63.8210 KOps/s $\color{#d91a1a}-1.27\%$
test_step_mdp_speed[False-True-True-True-True] 71.0410μs 40.3971μs 24.7543 KOps/s 24.5607 KOps/s $\color{#35bf28}+0.79\%$
test_step_mdp_speed[False-True-True-True-False] 57.1310μs 24.8680μs 40.2122 KOps/s 40.9849 KOps/s $\color{#d91a1a}-1.89\%$
test_step_mdp_speed[False-True-True-False-True] 56.9810μs 26.0908μs 38.3277 KOps/s 38.6806 KOps/s $\color{#d91a1a}-0.91\%$
test_step_mdp_speed[False-True-True-False-False] 46.1310μs 15.4754μs 64.6187 KOps/s 65.8252 KOps/s $\color{#d91a1a}-1.83\%$
test_step_mdp_speed[False-True-False-True-True] 75.9610μs 42.1374μs 23.7319 KOps/s 23.3617 KOps/s $\color{#35bf28}+1.58\%$
test_step_mdp_speed[False-True-False-True-False] 60.9110μs 26.8103μs 37.2990 KOps/s 38.1932 KOps/s $\color{#d91a1a}-2.34\%$
test_step_mdp_speed[False-True-False-False-True] 3.5004ms 27.9716μs 35.7506 KOps/s 36.5095 KOps/s $\color{#d91a1a}-2.08\%$
test_step_mdp_speed[False-True-False-False-False] 48.5210μs 17.5432μs 57.0021 KOps/s 58.2748 KOps/s $\color{#d91a1a}-2.18\%$
test_step_mdp_speed[False-False-True-True-True] 77.0110μs 44.5849μs 22.4291 KOps/s 22.8836 KOps/s $\color{#d91a1a}-1.99\%$
test_step_mdp_speed[False-False-True-True-False] 66.0310μs 29.0954μs 34.3697 KOps/s 34.9149 KOps/s $\color{#d91a1a}-1.56\%$
test_step_mdp_speed[False-False-True-False-True] 61.9010μs 28.1075μs 35.5778 KOps/s 36.5993 KOps/s $\color{#d91a1a}-2.79\%$
test_step_mdp_speed[False-False-True-False-False] 49.8610μs 17.7221μs 56.4267 KOps/s 58.7342 KOps/s $\color{#d91a1a}-3.93\%$
test_step_mdp_speed[False-False-False-True-True] 85.7520μs 46.2342μs 21.6290 KOps/s 21.9876 KOps/s $\color{#d91a1a}-1.63\%$
test_step_mdp_speed[False-False-False-True-False] 69.6210μs 30.9642μs 32.2954 KOps/s 33.3576 KOps/s $\color{#d91a1a}-3.18\%$
test_step_mdp_speed[False-False-False-False-True] 76.1810μs 29.8184μs 33.5364 KOps/s 35.4413 KOps/s $\textbf{\color{#d91a1a}-5.37\%}$
test_step_mdp_speed[False-False-False-False-False] 47.4110μs 19.3915μs 51.5689 KOps/s 53.6245 KOps/s $\color{#d91a1a}-3.83\%$
test_values[generalized_advantage_estimate-True-True] 25.7622ms 25.2539ms 39.5979 Ops/s 38.2661 Ops/s $\color{#35bf28}+3.48\%$
test_values[vec_generalized_advantage_estimate-True-True] 0.1052s 3.0070ms 332.5578 Ops/s 339.2700 Ops/s $\color{#d91a1a}-1.98\%$
test_values[td0_return_estimate-False-False] 0.1063ms 80.1120μs 12.4825 KOps/s 12.2985 KOps/s $\color{#35bf28}+1.50\%$
test_values[td1_return_estimate-False-False] 56.3214ms 56.0174ms 17.8516 Ops/s 16.6520 Ops/s $\textbf{\color{#35bf28}+7.20\%}$
test_values[vec_td1_return_estimate-False-False] 1.3717ms 1.0933ms 914.6572 Ops/s 905.8207 Ops/s $\color{#35bf28}+0.98\%$
test_values[td_lambda_return_estimate-True-False] 88.7552ms 88.3644ms 11.3168 Ops/s 10.3255 Ops/s $\textbf{\color{#35bf28}+9.60\%}$
test_values[vec_td_lambda_return_estimate-True-False] 1.4169ms 1.0897ms 917.7147 Ops/s 905.0033 Ops/s $\color{#35bf28}+1.40\%$
test_gae_speed[generalized_advantage_estimate-False-1-512] 25.2303ms 25.1100ms 39.8247 Ops/s 38.5693 Ops/s $\color{#35bf28}+3.25\%$
test_gae_speed[vec_generalized_advantage_estimate-True-1-512] 1.0414ms 0.7545ms 1.3254 KOps/s 1.3072 KOps/s $\color{#35bf28}+1.39\%$
test_gae_speed[vec_generalized_advantage_estimate-False-1-512] 0.7644ms 0.6724ms 1.4872 KOps/s 1.4588 KOps/s $\color{#35bf28}+1.95\%$
test_gae_speed[vec_generalized_advantage_estimate-True-32-512] 1.5195ms 1.4810ms 675.1996 Ops/s 670.3008 Ops/s $\color{#35bf28}+0.73\%$
test_gae_speed[vec_generalized_advantage_estimate-False-32-512] 0.7315ms 0.6890ms 1.4513 KOps/s 1.4272 KOps/s $\color{#35bf28}+1.69\%$
test_dqn_speed[False-None] 7.2628ms 1.4461ms 691.5142 Ops/s 692.5934 Ops/s $\color{#d91a1a}-0.16\%$
test_dqn_speed[False-backward] 2.1432ms 2.0387ms 490.5117 Ops/s 486.3877 Ops/s $\color{#35bf28}+0.85\%$
test_dqn_speed[True-None] 0.7344ms 0.5339ms 1.8731 KOps/s 1.8542 KOps/s $\color{#35bf28}+1.02\%$
test_dqn_speed[True-backward] 1.2545ms 1.1990ms 834.0550 Ops/s 897.4166 Ops/s $\textbf{\color{#d91a1a}-7.06\%}$
test_dqn_speed[reduce-overhead-None] 0.6216ms 0.5477ms 1.8257 KOps/s 1.7423 KOps/s $\color{#35bf28}+4.78\%$
test_dqn_speed[reduce-overhead-backward] 1.1109ms 1.0636ms 940.2341 Ops/s 1.0389 KOps/s $\textbf{\color{#d91a1a}-9.49\%}$
test_ddpg_speed[False-None] 3.0286ms 2.6944ms 371.1354 Ops/s 366.6090 Ops/s $\color{#35bf28}+1.23\%$
test_ddpg_speed[False-backward] 4.4527ms 4.0221ms 248.6287 Ops/s 252.9814 Ops/s $\color{#d91a1a}-1.72\%$
test_ddpg_speed[True-None] 1.1320ms 1.0849ms 921.7561 Ops/s 915.9997 Ops/s $\color{#35bf28}+0.63\%$
test_ddpg_speed[True-backward] 2.3578ms 2.3075ms 433.3656 Ops/s 457.6015 Ops/s $\textbf{\color{#d91a1a}-5.30\%}$
test_ddpg_speed[reduce-overhead-None] 1.1355ms 1.0832ms 923.1709 Ops/s 877.2352 Ops/s $\textbf{\color{#35bf28}+5.24\%}$
test_ddpg_speed[reduce-overhead-backward] 1.8325ms 1.7748ms 563.4510 Ops/s 607.7221 Ops/s $\textbf{\color{#d91a1a}-7.28\%}$
test_sac_speed[False-None] 8.3551ms 7.7767ms 128.5897 Ops/s 127.0418 Ops/s $\color{#35bf28}+1.22\%$
test_sac_speed[False-backward] 11.3981ms 10.9184ms 91.5887 Ops/s 92.2933 Ops/s $\color{#d91a1a}-0.76\%$
test_sac_speed[True-None] 1.5956ms 1.5397ms 649.4933 Ops/s 631.6368 Ops/s $\color{#35bf28}+2.83\%$
test_sac_speed[True-backward] 3.2460ms 3.1989ms 312.6054 Ops/s 308.2435 Ops/s $\color{#35bf28}+1.42\%$
test_sac_speed[reduce-overhead-None] 22.4889ms 12.4542ms 80.2940 Ops/s 80.2632 Ops/s $\color{#35bf28}+0.04\%$
test_sac_speed[reduce-overhead-backward] 1.4066ms 1.3262ms 754.0308 Ops/s 753.4352 Ops/s $\color{#35bf28}+0.08\%$
test_redq_speed[False-None] 8.1420ms 7.3167ms 136.6742 Ops/s 133.2472 Ops/s $\color{#35bf28}+2.57\%$
test_redq_speed[False-backward] 12.0552ms 11.0253ms 90.7004 Ops/s 88.6398 Ops/s $\color{#35bf28}+2.32\%$
test_redq_speed[True-None] 2.0419ms 1.9934ms 501.6448 Ops/s 497.1544 Ops/s $\color{#35bf28}+0.90\%$
test_redq_speed[True-backward] 3.7712ms 3.6272ms 275.6959 Ops/s 258.0235 Ops/s $\textbf{\color{#35bf28}+6.85\%}$
test_redq_speed[reduce-overhead-None] 2.3191ms 2.0020ms 499.5084 Ops/s 495.2383 Ops/s $\color{#35bf28}+0.86\%$
test_redq_speed[reduce-overhead-backward] 3.6582ms 3.6217ms 276.1172 Ops/s 255.7163 Ops/s $\textbf{\color{#35bf28}+7.98\%}$
test_redq_deprec_speed[False-None] 9.2142ms 8.7375ms 114.4494 Ops/s 112.3524 Ops/s $\color{#35bf28}+1.87\%$
test_redq_deprec_speed[False-backward] 12.0469ms 11.6207ms 86.0535 Ops/s 82.2731 Ops/s $\color{#35bf28}+4.59\%$
test_redq_deprec_speed[True-None] 2.3946ms 2.3149ms 431.9891 Ops/s 428.6947 Ops/s $\color{#35bf28}+0.77\%$
test_redq_deprec_speed[True-backward] 3.9968ms 3.9499ms 253.1729 Ops/s 237.4764 Ops/s $\textbf{\color{#35bf28}+6.61\%}$
test_redq_deprec_speed[reduce-overhead-None] 2.4709ms 2.3605ms 423.6399 Ops/s 414.9989 Ops/s $\color{#35bf28}+2.08\%$
test_redq_deprec_speed[reduce-overhead-backward] 4.0784ms 3.9411ms 253.7343 Ops/s 239.7391 Ops/s $\textbf{\color{#35bf28}+5.84\%}$
test_td3_speed[False-None] 7.8531ms 7.5779ms 131.9634 Ops/s 129.8520 Ops/s $\color{#35bf28}+1.63\%$
test_td3_speed[False-backward] 10.3239ms 9.8252ms 101.7796 Ops/s 98.0814 Ops/s $\color{#35bf28}+3.77\%$
test_td3_speed[True-None] 1.6264ms 1.5776ms 633.8803 Ops/s 639.3638 Ops/s $\color{#d91a1a}-0.86\%$
test_td3_speed[True-backward] 3.2654ms 3.0824ms 324.4190 Ops/s 303.1164 Ops/s $\textbf{\color{#35bf28}+7.03\%}$
test_td3_speed[reduce-overhead-None] 80.1571ms 25.7003ms 38.9101 Ops/s 37.6738 Ops/s $\color{#35bf28}+3.28\%$
test_td3_speed[reduce-overhead-backward] 1.3346ms 1.2876ms 776.6446 Ops/s 693.5688 Ops/s $\textbf{\color{#35bf28}+11.98\%}$
test_cql_speed[False-None] 16.0594ms 15.5748ms 64.2063 Ops/s 62.2276 Ops/s $\color{#35bf28}+3.18\%$
test_cql_speed[False-backward] 21.3814ms 20.6675ms 48.3852 Ops/s 46.2247 Ops/s $\color{#35bf28}+4.67\%$
test_cql_speed[True-None] 2.9648ms 2.9158ms 342.9643 Ops/s 339.4590 Ops/s $\color{#35bf28}+1.03\%$
test_cql_speed[True-backward] 5.5929ms 5.0643ms 197.4594 Ops/s 189.6835 Ops/s $\color{#35bf28}+4.10\%$
test_cql_speed[reduce-overhead-None] 22.9544ms 13.0333ms 76.7266 Ops/s 76.1663 Ops/s $\color{#35bf28}+0.74\%$
test_cql_speed[reduce-overhead-backward] 1.5387ms 1.4609ms 684.5153 Ops/s 611.0486 Ops/s $\textbf{\color{#35bf28}+12.02\%}$
test_a2c_speed[False-None] 3.3029ms 3.0880ms 323.8308 Ops/s 316.2381 Ops/s $\color{#35bf28}+2.40\%$
test_a2c_speed[False-backward] 6.5935ms 5.9218ms 168.8676 Ops/s 157.4708 Ops/s $\textbf{\color{#35bf28}+7.24\%}$
test_a2c_speed[True-None] 1.0498ms 0.9948ms 1.0052 KOps/s 991.6300 Ops/s $\color{#35bf28}+1.37\%$
test_a2c_speed[True-backward] 2.6898ms 2.6218ms 381.4118 Ops/s 357.4816 Ops/s $\textbf{\color{#35bf28}+6.69\%}$
test_a2c_speed[reduce-overhead-None] 0.3833s 12.4015ms 80.6354 Ops/s 86.3825 Ops/s $\textbf{\color{#d91a1a}-6.65\%}$
test_a2c_speed[reduce-overhead-backward] 1.0430ms 0.9921ms 1.0079 KOps/s 1.0199 KOps/s $\color{#d91a1a}-1.17\%$
test_ppo_speed[False-None] 3.6422ms 3.5536ms 281.4087 Ops/s 275.4021 Ops/s $\color{#35bf28}+2.18\%$
test_ppo_speed[False-backward] 7.0083ms 6.5764ms 152.0590 Ops/s 149.0190 Ops/s $\color{#35bf28}+2.04\%$
test_ppo_speed[True-None] 1.0731ms 0.9487ms 1.0540 KOps/s 1.0475 KOps/s $\color{#35bf28}+0.62\%$
test_ppo_speed[True-backward] 2.6108ms 2.5675ms 389.4777 Ops/s 363.4696 Ops/s $\textbf{\color{#35bf28}+7.16\%}$
test_ppo_speed[reduce-overhead-None] 0.5733ms 0.4989ms 2.0045 KOps/s 1.9069 KOps/s $\textbf{\color{#35bf28}+5.12\%}$
test_ppo_speed[reduce-overhead-backward] 1.0155ms 0.9693ms 1.0317 KOps/s 1.0111 KOps/s $\color{#35bf28}+2.03\%$
test_reinforce_speed[False-None] 2.2244ms 2.1385ms 467.6081 Ops/s 458.6622 Ops/s $\color{#35bf28}+1.95\%$
test_reinforce_speed[False-backward] 3.5581ms 3.1208ms 320.4284 Ops/s 313.3217 Ops/s $\color{#35bf28}+2.27\%$
test_reinforce_speed[True-None] 0.8993ms 0.8288ms 1.2065 KOps/s 1.2002 KOps/s $\color{#35bf28}+0.52\%$
test_reinforce_speed[True-backward] 2.4559ms 2.4206ms 413.1130 Ops/s 382.3106 Ops/s $\textbf{\color{#35bf28}+8.06\%}$
test_reinforce_speed[reduce-overhead-None] 22.6284ms 11.8302ms 84.5296 Ops/s 85.8946 Ops/s $\color{#d91a1a}-1.59\%$
test_reinforce_speed[reduce-overhead-backward] 1.1108ms 1.0534ms 949.2820 Ops/s 949.0889 Ops/s $\color{#35bf28}+0.02\%$
test_iql_speed[False-None] 9.3458ms 8.8609ms 112.8557 Ops/s 111.3480 Ops/s $\color{#35bf28}+1.35\%$
test_iql_speed[False-backward] 12.9631ms 12.5090ms 79.9422 Ops/s 79.0567 Ops/s $\color{#35bf28}+1.12\%$
test_iql_speed[True-None] 1.8500ms 1.7559ms 569.5098 Ops/s 573.3743 Ops/s $\color{#d91a1a}-0.67\%$
test_iql_speed[True-backward] 4.6512ms 4.2149ms 237.2537 Ops/s 224.7342 Ops/s $\textbf{\color{#35bf28}+5.57\%}$
test_iql_speed[reduce-overhead-None] 20.1908ms 11.5721ms 86.4145 Ops/s 88.1361 Ops/s $\color{#d91a1a}-1.95\%$
test_iql_speed[reduce-overhead-backward] 1.5234ms 1.4203ms 704.0773 Ops/s 707.8258 Ops/s $\color{#d91a1a}-0.53\%$
test_rb_sample[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 7.9096ms 6.2692ms 159.5087 Ops/s 156.5586 Ops/s $\color{#35bf28}+1.88\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 0.6508ms 0.3106ms 3.2199 KOps/s 3.0489 KOps/s $\textbf{\color{#35bf28}+5.61\%}$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.6902ms 0.3278ms 3.0507 KOps/s 3.1977 KOps/s $\color{#d91a1a}-4.60\%$
test_rb_sample[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 6.3204ms 6.0870ms 164.2856 Ops/s 163.0440 Ops/s $\color{#35bf28}+0.76\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 0.8088ms 0.2630ms 3.8023 KOps/s 3.5827 KOps/s $\textbf{\color{#35bf28}+6.13\%}$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 0.5682ms 0.2458ms 4.0676 KOps/s 4.0294 KOps/s $\color{#35bf28}+0.95\%$
test_rb_sample[TensorDictReplayBuffer-LazyMemmapStorage-sampler6-10000] 1.5130ms 1.2480ms 801.2602 Ops/s 781.3254 Ops/s $\color{#35bf28}+2.55\%$
test_rb_sample[TensorDictReplayBuffer-LazyTensorStorage-sampler7-10000] 1.3988ms 1.1898ms 840.4431 Ops/s 815.9808 Ops/s $\color{#35bf28}+3.00\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 6.4447ms 6.2607ms 159.7257 Ops/s 158.5781 Ops/s $\color{#35bf28}+0.72\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 0.8430ms 0.4305ms 2.3227 KOps/s 2.2362 KOps/s $\color{#35bf28}+3.87\%$
test_rb_sample[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.6011ms 0.4155ms 2.4067 KOps/s 2.4703 KOps/s $\color{#d91a1a}-2.57\%$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-RandomSampler-4000] 6.3002ms 6.1197ms 163.4055 Ops/s 162.8468 Ops/s $\color{#35bf28}+0.34\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-10000] 0.8187ms 0.3180ms 3.1444 KOps/s 2.7192 KOps/s $\textbf{\color{#35bf28}+15.64\%}$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-10000] 0.5373ms 0.3050ms 3.2783 KOps/s 3.0735 KOps/s $\textbf{\color{#35bf28}+6.66\%}$
test_rb_iterate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-4000] 6.3412ms 6.0381ms 165.6150 Ops/s 164.0526 Ops/s $\color{#35bf28}+0.95\%$
test_rb_iterate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-10000] 0.5463ms 0.2773ms 3.6064 KOps/s 3.5450 KOps/s $\color{#35bf28}+1.73\%$
test_rb_iterate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-10000] 6.7330ms 0.2495ms 4.0078 KOps/s 3.1812 KOps/s $\textbf{\color{#35bf28}+25.98\%}$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-ListStorage-None-4000] 6.3864ms 6.2939ms 158.8852 Ops/s 159.6100 Ops/s $\color{#d91a1a}-0.45\%$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-10000] 1.9099ms 0.4292ms 2.3298 KOps/s 2.1609 KOps/s $\textbf{\color{#35bf28}+7.82\%}$
test_rb_iterate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-10000] 0.6294ms 0.4250ms 2.3529 KOps/s 2.1387 KOps/s $\textbf{\color{#35bf28}+10.02\%}$
test_rb_populate[TensorDictReplayBuffer-ListStorage-RandomSampler-400] 6.9775ms 5.3011ms 188.6408 Ops/s 183.4129 Ops/s $\color{#35bf28}+2.85\%$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-RandomSampler-400] 9.5013ms 2.0358ms 491.2122 Ops/s 426.3824 Ops/s $\textbf{\color{#35bf28}+15.20\%}$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-RandomSampler-400] 7.1430ms 1.1821ms 845.9833 Ops/s 882.5436 Ops/s $\color{#d91a1a}-4.14\%$
test_rb_populate[TensorDictReplayBuffer-ListStorage-SamplerWithoutReplacement-400] 0.4784s 14.8681ms 67.2580 Ops/s 187.8401 Ops/s $\textbf{\color{#d91a1a}-64.19\%}$
test_rb_populate[TensorDictReplayBuffer-LazyMemmapStorage-SamplerWithoutReplacement-400] 10.5634ms 1.9954ms 501.1470 Ops/s 464.5345 Ops/s $\textbf{\color{#35bf28}+7.88\%}$
test_rb_populate[TensorDictReplayBuffer-LazyTensorStorage-SamplerWithoutReplacement-400] 6.6398ms 1.1896ms 840.6311 Ops/s 792.0464 Ops/s $\textbf{\color{#35bf28}+6.13\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-ListStorage-None-400] 7.1642ms 5.5750ms 179.3707 Ops/s 33.2309 Ops/s $\textbf{\color{#35bf28}+439.77\%}$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyMemmapStorage-None-400] 10.7028ms 2.2262ms 449.2025 Ops/s 453.5397 Ops/s $\color{#d91a1a}-0.96\%$
test_rb_populate[TensorDictPrioritizedReplayBuffer-LazyTensorStorage-None-400] 3.6458ms 1.2934ms 773.1549 Ops/s 850.9063 Ops/s $\textbf{\color{#d91a1a}-9.14\%}$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-True] 13.1307ms 12.9451ms 77.2496 Ops/s 76.1509 Ops/s $\color{#35bf28}+1.44\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-10000-10000-100-False] 18.6687ms 17.1962ms 58.1524 Ops/s 59.8760 Ops/s $\color{#d91a1a}-2.88\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-True] 18.0152ms 17.5888ms 56.8545 Ops/s 55.3634 Ops/s $\color{#35bf28}+2.69\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-100000-10000-100-False] 17.5618ms 17.3427ms 57.6611 Ops/s 58.8247 Ops/s $\color{#d91a1a}-1.98\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-True] 17.6414ms 17.3866ms 57.5156 Ops/s 55.8545 Ops/s $\color{#35bf28}+2.97\%$
test_rb_extend_sample[ReplayBuffer-LazyTensorStorage-RandomSampler-1000000-10000-100-False] 18.9671ms 18.5875ms 53.7997 Ops/s 55.0177 Ops/s $\color{#d91a1a}-2.21\%$

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
CLA Signed This label is managed by the Facebook bot. Authors need to sign the CLA before a PR can be reviewed.
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants