Hk669 / bpetokenizer

(py package) train your own tokenizer based on BPE algorithm for the LLMs (supports the regex pattern and special tokens)
https://pypi.org/project/bpetokenizer/
2 stars 1 forks source link

Updates for the pretrained tokenizers. #11

Closed Hk669 closed 3 months ago

Hk669 commented 3 months ago

Why are these changes needed?

Related issue number

Checks