[QUESTION] Issue with Evaluating Decision Transformer Using Evaluators in d3rlpy

takuseno / d3rlpy

An offline deep reinforcement learning library

MIT License

1.29k stars 230 forks source link

I have encountered an issue while trying to evaluate the performance of the Decision Transformer (DT) using the d3rlpy library. Unlike other methods such as CQL, it seems that DT does not support passing evaluators like:

evaluators={
    'action_diff': d3rlpy.metrics.ContinuousActionDiffEvaluator(test_episodes),
}

This limitation is problematic, especially in settings where an environment is not available for evaluation. It hinders the ability to compare the performance of DT with other methods under these conditions.

Is there a workaround or a recommended approach for this situation?

Thanks so much! :>

takuseno / d3rlpy

[QUESTION] Issue with Evaluating Decision Transformer Using Evaluators in d3rlpy #406