Closed IzzyHibbert closed 1 week ago
Interesting, I am not aware of Phi3 supporting "sysmte" in the prompt. It looks like there is something wrong with the dataset preprocessing. I have done some Phi3 fine-tuning and it works fine for me. FYI https://huggingface.co/mzbac/Phi-3-mini-4k-instruct-function-calling
Interesting, I am not aware of Phi3 supporting "sysmte" in the prompt. It looks like there is something wrong with the dataset preprocessing. I have done some Phi3 fine-tuning and it works fine for me. FYI https://huggingface.co/mzbac/Phi-3-mini-4k-instruct-function-calling
Good point. Time to retry...
Describe the bug Using mlx_lm I made a fine tuned model from
microsoft/Phi-3-mini-128k-instruct
. Training was all fine. AFter it, when I test the tuned model withgenerate
I noticed one issue in the format of the prompt reply, which comes with multiple repetition of a<unk>
like :To Reproduce
My code snippet
The
system_prompt
I am using is the specific one coming frommicrosoft/Phi-3-mini-128k-instruct
, so it's like :The dataset used is
vibhorag101/phr-mental-therapy-dataset-conversational-format-1024-tokens
Expected behavior I made the same fine-tuning for other models, and using the same dataset. I haven't found the "additional tokens" issue.. The expectation is to have the answer only (no other repetition of tokens)
Desktop (please complete the following information):