Training a network from scratch: not doable

Hi Giorgio,

Thank you for checking our work out. It is possible to train/prune a different model than ResNet20. The current prune_cifar.py supports all BasicBlock CifarResNets (namely ResNet 20/32/56/110 as reported in the paper). The case of ResNet18 is a bit tricky as it is normally viewed as a ResNet for ImageNet, in which we opt to support the BottleNeck variants of the family (ResNet50 and ResNet101) in the prune_imagenet.py series of pipeline for ImageNet-like evaluations. Given ResNet18 follows the BottleNeck architecture, the prune_imagenet.py won't be directly applicable either.

However, this should not affect the training of ResNet18 as our method is post-train. The supplied training code here is vanilla and model-agnostic. Suppose you supply a ResNet18 model definition, randomly initialize it, save the randomly initialized model as a checkpoint, and then hit it with a script like:

!python main.py \
--exp_desc resnet18_ba_train \
--setting_dir /content/drive/MyDrive/adv_robust_gkp/settings/cifar_ba_train_setting.json \  
--dataset cifar10 \
--model_dir /content/drive/MyDrive/adv_robust_gkp/ckpts/resnet18_init.pt \
--output_folder_dir /content/drive/MyDrive/adv_robust_gkp/output/resnet18_ba_trained/ \
--task train \
--adv_attack no_attack

(The cifar_ba_train_setting.json here is gkp_cifar10_finetune.json but with initial "lr": 0.1 if you want to be consistent with our cifar baseline training setting).

It should totally work. Based on the error you are showing, my guess is you tried to run the gkp_main.py file, which is dedicated to pruning but not training/finetuning. This is my bad, I will update the doc/demo to highlight that.

For now, you do need to modify the prune_imagenet code to make it prune ResNet18 (trained on cifar or not). We do plan to support more models, but not in a hard-coded way as we are doing here. We are working on an index-based implementation for some typical pruning granularities (basically filter and grouped kernel), in which setting we can easily expand our pruning implementation to ResNet18. It might take a while, though, as we are still working on that implementation. My plan is to get the model checkpoints and (hard-coded) implementation for other pruning methods out first.

henryzhongsc / adv_robust_gkp

Training a network from scratch: not doable #1