Open DreamGenX opened 4 weeks ago
The request is for Mistral Tokenizer V2, similar to your repo for V1 and V3 [1], but based on the V2 tokenizer data: https://github.com/mistralai/mistral-common/blob/main/src/mistral_common/data/mistral_instruct_tokenizer_240216.model.v2
This is tokenizer used by mistral-small-latest, mistral-large-latest.
mistral-small-latest, mistral-large-latest
N/A
I am not familiar with the necessary ocnversion, and it would be great to have the package in the official "Xenova" repo.
Model description
The request is for Mistral Tokenizer V2, similar to your repo for V1 and V3 [1], but based on the V2 tokenizer data: https://github.com/mistralai/mistral-common/blob/main/src/mistral_common/data/mistral_instruct_tokenizer_240216.model.v2
This is tokenizer used by
mistral-small-latest, mistral-large-latest
.Prerequisites
Additional information
N/A
Your contribution
I am not familiar with the necessary ocnversion, and it would be great to have the package in the official "Xenova" repo.