OFA-Sys / OFA-Compress

OFA-Compress is a unified framework which provides OFA model finetuning, distillation and inference capabilities in Huggingface version, and is committed to promoting the lightweighting of large models.
Apache License 2.0
27 stars 0 forks source link

About OFA-tiny zero-shot performance. #3

Open ZhangYuanhan-AI opened 1 year ago

ZhangYuanhan-AI commented 1 year ago

Hello,

Thank you for providing a clear version of the OFA model.

image

I have a question regarding the model displayed in the image above. Is this the OFA-tiny model, or is it the OFA-tiny model fine-tuned on Refcoco?

I assume it is the latter, as the OFA-tiny model exhibits poor performance during the zero-shot evaluation on Refcoco.

Thanks for your reply.

faychu commented 1 year ago

Yes, this is the result after finetuning the OFA-tiny model on Refcoco.

ZhangYuanhan-AI commented 1 year ago

Hi,

Thanks for your answering,

Another question is, what do these parameters mean?

"image_bucket_size" : https://github.com/OFA-Sys/OFA-Compress/blob/a8618ab5e4ed7613d69af80acec50eb0da9aa34d/ofa/configuration_ofa.py#L130 "token_bucket_size"

ZhangYuanhan-AI commented 1 year ago

Could you provide any insights regarding the rationale behind employing the "sum" of loss as a method during the pre-training phase?

https://github.com/OFA-Sys/OFA-Compress/blob/a8618ab5e4ed7613d69af80acec50eb0da9aa34d/adaptors.py#L51