Open TenzinGayche opened 3 months ago
Training a Bilingual tokenizer with 32k vocab size and low fertility score using sentencepiece library
Bo_en Tokenizer V1
Description :
Training a Bilingual tokenizer with 32k vocab size and low fertility score using sentencepiece library
To dos: