Open Samoed opened 2 months ago
Oh interesting I'll check this and get back to you - sorry!
The problem still exists. It seems there is a 2048 token limit for the Phi-3 mini/medium model, but not for other models in Unsloth
Apologies I'll escalate this to higher priority - will try getting a fix for this
Hi! I'm encountering an issue while tuning phi-3 on long sequences with batch sizes greater than 1. Below is the code to reproduce the problem:
Working Code:
Code with Error:
Notebook with example.
Any insights on how to resolve this issue would be greatly appreciated!