Open XiudingCai opened 1 month ago
@XiudingCai Hi, sorry for the late response. This is a little tricky issue because Q-learning and Decision Transformer are completely different algorithms and it's difficult to share some interface. One possible workaround is to use callback
option at fit
method. You can add arbitrary logics at every steps.
I have encountered an issue while trying to evaluate the performance of the Decision Transformer (DT) using the
d3rlpy
library. Unlike other methods such as CQL, it seems that DT does not support passing evaluators like:This limitation is problematic, especially in settings where an environment is not available for evaluation. It hinders the ability to compare the performance of DT with other methods under these conditions.
Is there a workaround or a recommended approach for this situation?
Thanks so much! :>