CASIA-IVA-Lab / AnomalyGPT

[AAAI 2024 Oral] AnomalyGPT: Detecting Industrial Anomalies Using Large Vision-Language Models
https://anomalygpt.github.io
Other
773 stars 94 forks source link

在使用其他数据集训练时出现无法保存模型的错误。 #67

Closed Fritzyuan closed 7 months ago

Fritzyuan commented 8 months ago

作者你好,我在使用AnomalyGPT进行其他数据集训练时出现了这样的错误,无法保存训练的model.pt文件: Traceback (most recent call last): File "/workspace/AnomalyGPT/code/train_mvtec.py", line 115, in main(**args) File "/workspace/AnomalyGPT/code/train_mvtec.py", line 109, in main agent.save_model(args['save_path'], 0) File "/workspace/AnomalyGPT/code/model/agent.py", line 75, in save_model torch.save(checkpoint, f'{path}/pytorch_model.pt') File "/opt/conda/lib/python3.10/site-packages/torch/serialization.py", line 441, in save _save(obj, opened_zipfile, pickle_module, pickle_protocol) File "/opt/conda/lib/python3.10/site-packages/torch/serialization.py", line 653, in _save pickler.dump(obj) TypeError: cannot pickle 'torch._C._distributed_c10d.ProcessGroup' object

其中对./code/datasets/mvtec.py的修改在: image image

训练用到的数据集的格式是和MVTec数据集一致的。想请教一下这个TypeError该如何解决?

Grace-yafei commented 6 months ago

您好,看到您关闭了问题,请问是怎么解决的

Fritzyuan commented 6 months ago

您好,看到您关闭了问题,请问是怎么解决的

使用requirement中对应版本的Pytorch。 使用2.0以上的pytorch会出现这个问题

F-Yuan303 commented 4 months ago

您好,可以请问一下您使用的数据集是什么吗?非常感谢