Closed Xijieupenn closed 2 years ago
Hi, thanks for pointing this out. The main reason is the set error of argument _policydict in the policy_test function. If the policy name is with _"checkpoint", the _policydict should be True. Otherwise, it should be False.
I have fixed this bug, and you can try to train with the letest version. Thanks
Thank you so much. I will try to retrain it. However, I just met a new problem when trying to test my newly trained model.
I do not know why the model cannot be pickled. I believe the path to the model is correct.
Hi, I have tested the policy test function in the policy_test.py, and there is no error like yours.
The pickle.load() function is to load the argments that saved in the pickle.dump() in train_process.py, and is only related to the python module pickle. You can test the pickle.dump() and pickle.load() solely to find the main reason. If there are still this error, you may try the other version of pickle
The new version works fine. Thank you so much.
When training to 250 epochs, I get the following error.
Could you help me see what the mistake might be?