Trajectory Optimization API

Caltech-AMBER / ambersim

In-house tools built on GPU-accelerated simulation

MIT License

7 stars 2 forks source link

Agreed on the first two requested tests - those should be trivial to spin up.

Sanity check of choice: randomly sample a batch of initial policies and also shoot them forward. pass those guesses to vanilla predictive sampling, which returns new trajectories that should be no worse than the guesses. verify this property in a test.

TODO list:

[x] refactor q and v to x everywhere
[x] test cost function and its derivatives (thanks for suggesting this, literally all my implementations were wrong)
[x] smoke test for VanillaPredictiveSampler + optimize + jit
[x] sanity check for VanillaPredictiveSampler
[x] drop example file from PR after all other tests are implemented

Caltech-AMBER / ambersim

Trajectory Optimization API #46