Open Delaunay opened 3 hours ago
The recipe full_finetune_distributed Appear to be much slower in v0.3 than v0.2.1
Everything seems to work as usual, but my job that used to work in v0.2.1 time out in v0.3.0.
I don't have much detail yet, but maybe as you are more familiar with the code base you could have an idea already based on what changed recently!
Can you share a few more details around which models you're using, size of dataset, machine type?
Off the very top of my head, not sure what would be going on.
The recipe full_finetune_distributed Appear to be much slower in v0.3 than v0.2.1
Everything seems to work as usual, but my job that used to work in v0.2.1 time out in v0.3.0.
I don't have much detail yet, but maybe as you are more familiar with the code base you could have an idea already based on what changed recently!