Closed DT6A closed 1 year ago
@odelalleau thanks for your comments.
Yes, you are right I missed that TwinQ returns minimum of two Qs and this changes can be reverted.
Talking about LOG_STD_MIN
I just decided to check https://github.com/typoverflow/OfflineRL-Lib implementation and took this value from there and as far as I know this is more common to use -20.
set_to_none=True
is removed to eliminate confusion as it is used rarely.
@DT6A Here is the finished sweep for the fixed IQL. Can you please update the corresponding report? And then I believe we are good to merge.
@vkurenkov report is updated
@DT6A we also need to update the readme Ok, separate PR for readme
Fixing IQL.
One of the problems was pointed here: https://github.com/tinkoff-ai/CORL/pull/41
Another problem is a usage of only one of the critics during updates while two must be used
Reruns on all datasets are needed