some models fail to prepare tokens: No chat template

ml-explore / mlx-swift-examples

Examples using MLX Swift

MIT License

1.03k stars 110 forks source link

some models fail to prepare tokens: No chat template #150

Open davidkoski opened 3 weeks ago

davidkoski commented 3 weeks ago

Model loaded -> id("mlx-community/phi-2-hf-4bit-mlx")
Error: chatTemplate("No chat template was specified")

For models that have a chat template this is fine, but for those that do not:

I don't see a way to probe for the presence of a chat template
nor a way to provide a default

I think we need to do the config-mutate path and inject a template where needed.

awni commented 3 weeks ago

Base models don't usually have a chat template. Is there something like https://github.com/ml-explore/mlx-examples/blob/main/llms/mlx_lm/generate.py#L213C1-L214C1 in Swift Transformers?

davidkoski commented 3 weeks ago

Base models don't usually have a chat template. Is there something like https://github.com/ml-explore/mlx-examples/blob/main/llms/mlx_lm/generate.py#L213C1-L214C1 in Swift Transformers?

No, nothing like that: https://github.com/huggingface/swift-transformers/blob/main/Sources/Tokenizers/Tokenizer.swift#L359

Perhaps there should be? @pcuenca @maiqingqiang

johnmai-dev commented 3 weeks ago

Base models don't usually have a chat template. Is there something like https://github.com/ml-explore/mlx-examples/blob/main/llms/mlx_lm/generate.py#L213C1-L214C1 in Swift Transformers?

No, nothing like that: https://github.com/huggingface/swift-transformers/blob/main/Sources/Tokenizers/Tokenizer.swift#L359

Perhaps there should be? @pcuenca @maiqingqiang

Alternatively, it could capture throw TokenizerError.chatTemplate and use tokenizer.encode instead.