jannerm / trajectory-transformer

Code for the paper "Offline Reinforcement Learning as One Big Sequence Modeling Problem"
https://trajectory-transformer.github.io
MIT License
455 stars 63 forks source link

imitation learning results on HalfCheetah env #10

Open Howuhh opened 2 years ago

Howuhh commented 2 years ago

Hi! I noticed that I can't get good results on the HalfCheetah environment with imitation learning (with plain beam search decoding by logprob) even after long training and without overfitting (but can on Hopper). I also noticed that in the paper only results on Hopper and Walker2d are presented for imitation learning section.

Have you encountered the same difficulties? Or haven't considered testing in this environment? If so, where there any particular reasons for this?