Closed alielfilali01 closed 4 months ago
This issue has been automatically marked as stale because it has not had recent activity. It will be closed if no further activity occurs. Thank you for your consideration.
Closing the issue, since no updates observed. Feel free to re-open if you need any further assistance.
Check before submitting issues
Type of Issue
Other issues
Base Model
None
Operating System
None
Describe your issue in detail
I'am sorry but this more like a question to the team behind this impressive paper, rather than an issue
First thing first, thank you so much for these efforts 🙏🏻, we are working on quite the same thing for Arabic, and would love to see how you guys managed to extend the vocabulary of the original tokenizer without training the tokenizer from scratch again ?
Dependencies (must be provided for code-related issues)
No response
Execution logs or screenshots
No response