Open octavflorescu opened 1 month ago
They are different. The demo of https://huggingface.co/spaces/Qwen/Qwen2-VL uses the API of Qwen2-VL-Max, but it's the same model behind the scenes.
Thank you for your answer! So is it the same model and same configuration as per the demo in this repo, so Max==7b (not 72b) ?
Max==72b
the hf demo code is here. https://huggingface.co/spaces/Qwen/Qwen2-VL/blob/main/app.py the same setup for the demo is the default setup for api: top_p=0.1, rp=1.1
What is the Qwen2-VL Max HF Demo config? https://huggingface.co/spaces/Qwen/Qwen2-VL
In the demo from this repo, i found the setup for 7B, but is Qwen2-VL-Max the same? Could someone please provide the same setup as for the demo, but in 'tutorial' mode? (not hf demo/worker mode)
Thank you!
e.g., but such that all results will be the same as being run in the hf demo...