Open zhangyuanyang21 opened 1 year ago
你好,出现这两行说明程序只运行到了数据集读取的地方,后面是程序直接退出了还是卡住不动了?
后面是Process finished with exit code 0,应该是运行结束了
你看下args.start_epoch设置对了吗
seed: 42 workers: 8 dataset_dir: "./datasets"
batch_size: 128 image_size: 224 start_epoch: 1000 epochs: 1000 dataset: "CIFAR-10" # CIFAR-10 / CIFAR-100 / STL-10 / ImageNet-10 / ImageNet-dogs / tiny-ImageNet
resnet: "ResNet34" # ResNet18 / ResNet34 / ResNet50 feature_dim: 128 model_path: "save/CIFAR-10" reload: False
learning_rate: 0.0003 weight_decay: 0. instance_temperature: 0.5 cluster_temperature: 1.0 这个config的配置我没有动还是默认的
start_epoch应该是0,可能是你下载代码的那会我还没改过来
你好,将start_epoch设置为0后,报错RuntimeError: CUDA out of memory. Tried to allocate 26.00 MiB (GPU 0; 8.00 GiB total capacity; 7.13 GiB already allocated; 0 bytes free; 7.31 GiB reserved in total by PyTorch) If reserved memory is >> allocated memory try setting max_split_size_mb to avoid fragmentation. See documentation for Memory Management and PYTORCH_CUDA_ALLOC_CONF,但是其实我的GPU是够用的
8GB应该不够,我记得要10几20GB的显存
好的谢谢非常感谢你的回复!
你好,我想问下,我用的是window的系统,用pycharm运行train.py,只出现Files already downloaded and verified Files already downloaded and verified,请问这是怎么回事呢?我想跑几个数据集和你的方法做对比实验,请问你可以提供一些帮助吗?