Closed ByronHsu closed 1 week ago
Can you elaborate on
qwen2 vl flce is broken. we should fix later
What specifically is broken?
Sorry the note is quite scrappy. If you uncomment https://github.com/linkedin/Liger-Kernel/blob/85d34efbd423cd97d3e97525af419193fbb07354/test/convergence/test_mini_models.py#L525, and run the conv test. You will see the errors. The root cause is that transformers keeps updating qwen2_vl forward and the existing one is outdated now. Would you help fixing? Thanks in advance
Also do you know why qwen2 vl and mllama have to exist in both LM and multi modal convergence test?
Yeah I'll look into fixing!
And my knowledge is a bit out of date here, but there used to be the two different convergence test files for LLMs: the one that tested with FLCE and the one that tested with just CE.
So in order to get complete coverage of the patches, I implemented the multimodal convergence tests to mirror the ones that tested with just CE, and also added these VLMs to the convergence test file for FLCE (but only tested with text inputs there).
It looks like we've changed to only testing FLCE for the plain LLMs. So in order to mirror that change, the VLMs could be removed from the LM convergence test file and the multimodal test file should be changed to test FLCE.
Do you remember why the convergence tests covering CE were removed?
@tyler-romero i removed the CE conv test because
By the way, i have some thoughts for improvement of monkey patch + conv test. We can schedule a meeting to discuss offline. I will ping you on discord.
🐛 Describe the bug
https://github.com/linkedin/Liger-Kernel/pull/354
Reproduce
No response
Versions
na