Trained baseline models perform poorly

rr-learning / CausalWorld

CausalWorld: A Robotic Manipulation Benchmark for Causal Structure and Transfer Learning

MIT License

212 stars 26 forks source link

Hi zrjnz,

Thanks for trying out the package and sorry for the late reply.

You should be able to reproduce the baseline results reported in the paper as shown here https://github.com/rr-learning/CausalWorld/tree/master/scripts We trained the baseline policies on curriculum 0 and 1 to verify this and as expected we were able to get the same results with slight variance due to the random seeds. After training the policies, the policies should behave similar to the ones here https://github.com/rr-learning/CausalWorld/tree/master/causal_world/actors.

Can you post the evaluation results using the evaluation protocols as well as the videos, so we can help you better.

rr-learning / CausalWorld