Open robertdvdk opened 5 months ago
AIM uses a batch size of 1024 for finetuning their probe, we use 512. We should check whether 1024 improves performance.
AIM uses a batch size of 1024 for finetuning their probe, we use 512. We should check whether 1024 improves performance.