Default Parameters Paper <-> Repository

katerakelly / oyster

Implementation of Efficient Off-policy Meta-learning via Probabilistic Context Variables (PEARL)

MIT License

472 stars 125 forks source link

Hello, First, thank you for your research! I am currently trying (and struggling a little) to reproduce your results on the HalfCheetahVel environment. I noticed some differences between the algorithm in your paper (https://arxiv.org/pdf/1903.08254.pdf, page 5) and the default settings in this repository, i.e.

the KL loss is scaled by .1 in the default settings
the next observation is not included in the sampled context by default, whereas in the paper I got the impression the next observation would always be included in the context.

I would highly appreciate if you can make some statement about the effect of those differences. Is my assumption correct, that your results were produced with the default parameters from the repo?

Thank you in advance!

katerakelly / oyster

Default Parameters Paper <-> Repository #24