allmalab / problems

Challenges to solve in Azerbaijani NLP
7 stars 0 forks source link

Foundation model #3

Open ceferisbarov opened 6 months ago

ceferisbarov commented 6 months ago

There is no high-quality open-source foundation model that was designed with Azerbaijani in mind. Most multilingual models contain little or no data in Azerbaijani in their training data. ai-forever/mGPT-1.3B-azerbaijan on Hugging Face seems the only model that has been trained specifically for Azerbaijani, but that, too, is built on top of the mGPT model.

We believe that at least two different series of foundation models are necessary:

mammadhajili commented 4 months ago

https://huggingface.co/hajili/roberta-base-azerbaijani