codefuse-ai / CodeFuse-MFT-VLM

Apache License 2.0
33 stars 8 forks source link

The model you provide in modelscope and huggingface is not complete #3

Open lynquantumman opened 9 months ago

lynquantumman commented 9 months ago

The model you provide in modelscope and huggingface only include the LLM model. The cross attention part and visual part is missing. Ergo, based on the ckpt, we cannot re-implement your exps. Hope you can make it complete. Plus, if it is what it expected to be on the png image, it should be a greate work.

Zhazhan commented 8 months ago

Thank you for the reminder! We have supplemented the cross-attention part and the vision encoder part on both modelscope and huggingface. We have also added a script in the code that merges them into a single model and illustrated it in the README. We greatly appreciate your support for CodeFuse-VLM!