Fine-tune ConvNext - Githubissues

Appsilon / image_flow_cytometry_fine_tune

MIT License

0 stars 0 forks source link

Fine-tune ConvNext #6

Open jedrzejwalega opened 3 days ago

jedrzejwalega commented 3 days ago

Now that we've reproduced the authors' ResNet18 results (mostly to confirm the lightning and neptune framework works same way as their skorch) we want to try more recent models than ResNet.

The goal is to fine-tune ConvNext (pretrained can be taken from timm) on our dataset).

We are benchmarking against the authors' val F1 = 0.91.

jedrzejwalega commented 3 days ago

First trial run for ConvNext here.

Lessons learned:

The learning_rate is way too big, around step = 15k we can see the moment ConvNext starts to harshly overfit in a giant drop of val_accuracy
500 epochs is way too much. To launch a lot of experiments I'll go with 5-10 epochs from now on.

To find a more reasonable learning rate I'll launch a training with the model's recommended lr and implement fastai lr_finder for future usage, while the model trains.

jedrzejwalega commented 3 days ago

Learning rate finder implementation here

jedrzejwalega commented 3 days ago

Picked learning_rate = 0.001 for a shorter training (actually went for more epochs than assumed above, that's due to the training launching before the decision was made). Results here.

The learning_rate = 0.0004 it started with seemed to work well for the model, so I've followed up with another run with this particular learning_rate here.

Major observation: scheduler seems to be broken. It's supposed to be a OneCycleLR scheduler, yet it does not do full cycles. Need to fix before movign forward.

jedrzejwalega commented 3 days ago

Scheduler fixed, can progress with the fine-tuning.