Qwen models - Githubissues

IlyaGusev / rulm

Language modeling and instruction tuning for Russian

Apache License 2.0

438 stars 51 forks source link

Qwen models #40

Open Displacer opened 4 months ago

Displacer commented 4 months ago

According https://huggingface.co/spaces/lmsys/chatbot-arena-leaderboard Qwen1.5 model is one of the best OpenSource (Free) models with large context and Russian language support. It would be nice to see Qwen workflow for fine-tuning and Saiga-Qwen fine-tuned models in rulm.

defdet commented 2 weeks ago

Hey, I've fine-tuned Qwen chat models on Ilya's team's datasets and here's what I got: https://huggingface.co/collections/Defetya/qwen-saiga-66399ab51064a8510843556b