Closed Nero10578 closed 2 weeks ago
Btw Phi 3.5 trains just fine on axolotl otherwise as long as you upgrade to latest transformers.
Yea I can't figure out why the newline doesn't appear after <|end|> and also if I don't set eos_token in the config to <|end|> it will keep putting <|endoftext|> on every turn.
it(372, 372) .(29889, 29889) <|end|>(32007, 32007) <|endoftext|>(32000, 32000) <|user|>(-100, 32010) User(-100, 4911) :(-100, 29901) *(-100, 334)
Hey, the former sounds like a weird bug.
Regarding your double EOS issue, it happens when axolotl would check the last token for the EOS and place it if not found. By setting the config to <|end|>
, you satisfied that criteria. However, that is not to say the checker is wrong. I think it's because fastchat hardcodes the EOS to <|end|>
which causes this issue.
Since we deprecated fastchat, could you try this dataset config instead?
type: chat_template
chat_template: phi_35
Thanks for the reply. Yes chat_templates seems to be much simpler to use and it just works! Thanks!
Please check that this issue hasn't been reported before.
Expected Behavior
I am training phi 3.5 and I modified Fastchat in order to follow phi 3.5 chat template:
Expected behaviour is for the tokenization to show newline tokens after <|end|>
Current Behaviour
There is no newline after <|end|>
<|end|>(-100, 32007) <|assistant|>(-100, 32001) Bol(8922, 8922)
Steps to reproduce
pip3 install -e ".[model_worker]" --no-deps
pip3 install -e ".[flash-attn,deepspeed]" --no-deps
python -m axolotl.cli.preprocess lora-sft.yml --debug
on phi-3.5 training dataset.Config yaml
Possible solution
No response
Which Operating Systems are you using?
Python Version
3.11
axolotl branch-commit
0aeb277456f0ed79ab46191a12998fccc257d414
Acknowledgements