Open nabeel3133 opened 2 years ago
If you are running 3D model, the accuracy might be lower since the lesioned regions/sub-volumes tend to be in a very small proportion in comparison with the whole 3D volume. This is why 2D models are better. To improve it, combine both 2D and 3D together. Have not tried on this data set yet, but it worked for 2D/3D brain images.
So, the reported 76.6% accuracy for ViT in the paper is for 2D model?
I managed to run your code and start the training on the pre-trained model however, I am getting the same results (about 50% accuracy) as shown in the jupyter notebook
Can you let me know what changes are required to be done to achieve 76.6% accuracy as mentioned in the paper?