Closed bardia01 closed 5 months ago
Can you please follow the template for filing an issue?
Additionally, the proposal presented here doesn't seem feasible to me at the moment. Perhaps exploring the option --accelerator cpu
might yield something different for you? However, troubleshooting would be more straightforward if you included your command, as exemplified in the provided template.
Hi, I changed the format - using the accelerator flag doesn't seem to help. Additionally, I made the change I suggested locally and it does fix the issue
Hi bardia, It seems it not follows the mase coding consistency. You can try the modification in commit: 6947cc3f50f7cd1e71414ec2fbc1024421a1c7a3 to check whether it can solve your problem or not.
Question: When loading a trained model onto a cpu machine, an error occurs in mase/machop/chop/tools/checkpoint_load.py due to not having a CPU
Commit hash: https://github.com/DeepWok/mase/commit/e98079f83827c3457be56e7f2b83d22d62fe780f
Command to reproduce:
./ch search --accelerator cpu --config configs/examples/jsc_bardia_by_type.toml --load /content/mase/mase_output/jsc_bardia_e_50_b_128_l_001/software/training_ckpts/best.ckpt
Error log:
Comments: Please would you consider changing "src_state_dict = torch.load(checkpoint)["state_dict"]" to something like:
so that this doesn't break when using CPU
The accelerator flag doesn't seem to help - the print below shows that the accelerator is correctly overridden to cpu