First of all, thanks for this great work !
I'm using ms-swift(2.6.0.post1) and transformers(4.46.3) to LORA-finetune the qwen-2.5-coder-32B model.
The SFT and export process was processed fine but when I try to load the LORA-merged model, the tokenizer will raise "Data did not match any variant of untagged enum" error.
I looked into the merged tokenizer.json file and I found out that the format of "merges" part was changed as follows:
Before SFT:
First of all, thanks for this great work ! I'm using ms-swift(2.6.0.post1) and transformers(4.46.3) to LORA-finetune the qwen-2.5-coder-32B model.
The SFT and export process was processed fine but when I try to load the LORA-merged model, the tokenizer will raise "Data did not match any variant of untagged enum" error. I looked into the merged tokenizer.json file and I found out that the format of "merges" part was changed as follows: Before SFT:
After SFT:
Here are some discussions that may help. https://github.com/huggingface/transformers/issues/30324 https://github.com/unslothai/unsloth/issues/1059