vincekurtz / rddp

Reward-Driven Diffusion Policy
1 stars 0 forks source link

Big refactor for speed #22

Closed vincekurtz closed 1 month ago

vincekurtz commented 2 months ago

A big reorganization to make the dataset generation much more efficient with brax envs. The upshot: previously dataset generation for the cart-pole took >70s. Now it takes <10s.

Also reorganizes how we generate data in a more sane way: