Open ratthachat opened 9 months ago
I have the same problem with you. The agent always die at {'coins': 0, 'flag_get': False, 'life': 2, 'score': 200, 'stage': 1, 'status': 'small', 'time': 368, 'world': 1, 'x_pos': 898, 'y_pos': 79}
Additional: I do not have to exploration_rate,my policy is exactly stable. my code is '''mario.net.load_state_dict(torch.load('trained_mario.chkpt')['model'])'''
Hi, thanks for the amazing repo!
I download the trained weight here https://drive.google.com/file/d/1RRwhSMUrpBBRyAsfHLPGt1rlYFoiuus2/view?usp=sharing mentioned in README.
And then load statedict into Mario network successfully.
However, when trying to play using this trained model, the mario always dies very fast at the beginning (e.g. 40 frames) Is the above path still a correct pretrained path?