A malayalam large language model finetuned on top of open source models
Tokenizer experiments : https://colab.research.google.com/drive/1pcuhGiYX6WAtKKQfDLcVOQjkH_0Z143j?usp=sharing