Closed ezhang7423 closed 1 year ago
Unfortunately I don't have saved checkpoints for this. However, it takes less than a day to train one from scratch on a single GPU. If that is still too long, you might want to check out my JAX implementation of CQL, which can train a CQL agent on a single GPU in around 2 hours.
Thank you for the response!
Hi, would it be possible to release the checkpoints for this implementation? Would be very grateful for this.