Open JINO-ROHIT opened 2 weeks ago
3.11
-
running the example snippet from tokenization guide throws an error - https://docs.mistral.ai/guides/tokenization/
running this line -
tokenizer = MistralTokenizer.v3(is_tekken=True)
throws an error
UnicodeDecodeError: 'charmap' codec can't decode byte 0x81 in position 32549: character maps to <undefined>
no error
No response
Python -VV
Pip Freeze
Reproduction Steps
running the example snippet from tokenization guide throws an error - https://docs.mistral.ai/guides/tokenization/
running this line -
throws an error
Expected Behavior
no error
Additional Context
No response
Suggested Solutions
No response