Open jedrzejwalega opened 3 days ago
First, short run with learning_rate = 0.0004
, same as the one we use for ConvNext.
Doesn't work so well. Discussed with @Peterdes. I'm going to apply a few enhancements to my code:
10 epoch run seemed too short to conclude, I'm letting it run for 50.
Now that we've recreated the authors' ResNet18 results (mostly to confirm the lightning and neptune framework works same way as their skorch) we want to try more recent models than ResNet.
The goal is to fine-tune ConvNext (pretrained can be taken from timm) on our dataset).
We are benchmarking against the authors' val F1 = 0.91.
We would like to use LoRA for this. The choice of ViT is not the most important part in this experiment (we could try it with some other nn), but we want to check whether LoRA would be suitable for our modest (2.9k train) set of images available.
I'm going to learn
perf
library to apply LoRA onto ViT.