I ran the AWAC code and plotted the output 'progress.csv' file for HalfCheetah as below
The return only matches the online training part shown in the paper, but fails to reproduce the return obtained from offline training. May I have your suggestions on how to reproduce the same plot in the paper? Thanks.
I ran the AWAC code and plotted the output 'progress.csv' file for HalfCheetah as below
The return only matches the online training part shown in the paper, but fails to reproduce the return obtained from offline training. May I have your suggestions on how to reproduce the same plot in the paper? Thanks.