nomic-ai / gpt4all

GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.
https://nomic.ai/gpt4all
MIT License
69.01k stars 7.57k forks source link

Models: Add Yi-1.5-9B-Chat-16K #2750

Open ThiloteE opened 1 month ago

ThiloteE commented 1 month ago

Resolves https://github.com/nomic-ai/gpt4all/issues/176

Adds model support for Yi-1.5-9B-Chat-16K

Description of Model

It is a bilingual model and at the date of writing with strong results in benchmarks (for its parameter size). It supports a context of up to 16K.

Personal Impression:

I got the impression the model is very task focused and this is the reason, why I chose Below is an instruction that describes a task. Write a response that appropriately completes the request. as system prompt. I have seen refusals when it was tasked with certain things and it has the typical "know it better than the user" vibe and seems to be finetuned with a particular alignment. For instance, roleplay caused refusals, but tasking it to write a cover letter was no problem. Its long context and quality of responses makes it a good model, if you can bear its alignment or your use case happens to fall within the originally intended use cases of the model. It mainly will appeal to English and Chinese speaking users.

Checklist before requesting a review

manyoso commented 1 month ago

Is this model for mainland chinese or taiwanese? I'd like our maintainers of the translations for these to have a look

manyoso commented 1 month ago

Also, we really need sections key in our models.json so we don't just have a huge list of models, but we can overhaul the GUI to provide sections for a model that is more specialized, right?

ThiloteE commented 1 month ago

Unfortunately I am not fluent in Chinese. The original model card does not specify, if mainland or taiwanese.

manyoso commented 1 month ago

@supersonictw can you comment on this model's chinese abilities? is it traditional chinese or simplified? wondering if we should advertise its purported bilingual abilities

supersonictw commented 1 month ago

Yi is a simplified chinese based model. They call that as "零一万物"(01.ai). The model is provided for Mainland China mainly, though it is found by Taiwanese Scientist.

supersonictw commented 1 month ago

The model is very friendly for people in Mainland China. But if you want to add more models for Mainland China, it's better to add Qwen/Qwen2 models also oh I found they're already added, wow https://github.com/nomic-ai/gpt4all/pull/2759 .

People in Taiwan are prefer to use LLaMa(or ChatGPT-4, lol 🤪), it's more general and can be accepted. For best Traditional Chinese model, it might be "TaiwanLLM", but it's not so required. LLaMa model family is useful enough for us.

manyoso commented 1 month ago

The model is very friendly for people in Mainland China. But if you want to add more models for Mainland China, ~it's better to add Qwen/Qwen2 models also~ oh I found they're already added, wow #2759 .

People in Taiwan are prefer to use LLaMa(or ChatGPT-4, lol 🤪), it's more general and can be accepted. For best Traditional Chinese model, it might be "TaiwanLLM", but it's not so required. LLaMa model family is useful enough for us.

This one is larger than the Qwen models so I think it should probably be an addition, right?

ThiloteE commented 1 month ago

If this model is not good enough, I can also try to find a finetune of it, but it is hard to find good finetunes nowadays, since the huggingface open leaderboard 2 has been quite inactive since weeks/months now.

My motivation for supporting this model specifically:

ThiloteE commented 1 month ago

I will add a PR for Qwen2 as well. Maybe one of its finetunes as well. I think there are more finetunes for Qwen2

ThiloteE commented 1 month ago

image image image