Closed MinhDang685 closed 2 years ago
Hi @MinhDang685 , for MLM pretraining we used mesh-tensorflow. The models on HuggingFace are ready for finetuning only.
You don't need to add prefix in finetunning.
Hi @justinphan3110, thanks for your quick reply.
Thanks
@MinhDang685
We have just published an example code for finetunning with huggingface
Can you double-check again if there is still a generated unchanged sequence issue?
Hi @justinphan3110, thanks for your help, I try to generate with the model again and it works now, the output sequences now changes base on the input
I notice that you have updated the model config.json file by removing task specific prefixes, is it the cause of the issue (that I miss the "summarization" prefix before the input to indicate I want the model to perform summarization task)?
@MinhDang685 ,
You need prefix vietnews:
for VietAI/vit5-large-vietnews-summarization
.
For VietAI/vit5-base-vietnews-summarization
you don't need any prefix.
You can have a look over the eval scripts with HuggingFace
thank you @justinphan3110 for pointing that out
Hello VietAI team,
Thanks for sharing the pretrained models in your research paper. I am interested on fine tuning the VietAI/vit5-base language model for the abstractive summarization task. I have some questions:
Thanks a lot