Closed Pandint closed 1 week ago
Do all results appear corrupted or only a handful? Because the approach leverages activation editing, sometimes the outputs can have a corrupted appearance. You can try using a larger number of images to extract the mean activations and use more than only 10 examples in the reinforce_train step.
Thank you for your response! I have tried to increase the number of images and use more examples, some results are improved!
Hi! I followed your instructions in the README file and tried to reproduce the segmentation results. However, the visual results seem to be corrupted like this:
Here are my steps to obtain the results and the training logs.
python collect_attention_heads.py --model mae_vit_large_patch16 --base_dir path/to/pascal-5i --output_dir ./output_dir_official --ckpt ./checkpoint-3400.pth --device cuda --num_collections 100
python reinforce_train.py --model mae_vit_large_patch16 --base_dir path/to/pascal-5i --output_dir ./output_dir_official --ckpt ./checkpoint-3400.pth --split 0 --device cuda --task 0
python reinforce_evaluate.py --model mae_vit_large_patch16 --base_dir path/to/pascal-5i --ckpt ./checkpoint-3400.pth --split 0 --device cuda --setup official --output_dir ./output_dir_official --task 0 --load_model path/to/bernoullis_0_1_0_0_10_0.1_-1.0_best.pkl --save_images 1