Please add support to connect falcon and llama models with this .

OpenBMB / ChatDev

Create Customized Software using Natural Language Idea (through LLM-powered Multi-Agent Collaboration)

https://arxiv.org/abs/2307.07924

Apache License 2.0

25.71k stars 3.23k forks source link

Please add support to connect falcon and llama models with this . #33

Closed hemangjoshi37a closed 1 year ago

hemangjoshi37a commented 1 year ago

Please add support to connect falcon and llama models with this .

hemangjoshi37a commented 1 year ago

This has been referenced in #27

j-loquat commented 1 year ago

One idea for this is that you could allow up to two local models to be loaded and assigned to one or more agents. We could load one model into the GPU and the other into the CPU with some RAM allocated to it. So say llama2 into the gpu and use it for most of the agents, and then a Python optimized smaller model into cpu for the engineer agent.

andraz commented 1 year ago

In theory a long running process could:

accumulate a queue of prompts/tasks for a specific agent
swap the engine to that agent and load it in 10-30s
perform actions and save the queue results of the agent
then prepare queues for other agents from results
repeat at 1.

This would allow us to use big models more efficiently, not accumulating a lot of time penalty for VRAM loading times on swaps.

hemangjoshi37a commented 1 year ago

@andraz @j-loquat your solution and suggestions are looking good to implement.

j-loquat commented 1 year ago

One thing to consider with local LLM agents is that we should keep the prompts shorter than for OpenAI and reduce the temperature to perhaps lower than 0.5. Lower temp and shorter prompts makes a huge difference in local response times as per GPT4All project.

Alphamasterliu commented 1 year ago

Hello, regarding the use of other GPT models or local models, you can refer to the discussion on our GitHub page: https://github.com/OpenBMB/ChatDev/issues/27. Some of these models have corresponding configurations in this Pull Request: https://github.com/OpenBMB/ChatDev/pull/53. You may consider forking the project and giving them a try. While our team currently lacks the time to test every model, it's worth noting that they have received positive feedback and reviews. If you have any other questions, please don't hesitate to ask. We truly appreciate your support and suggestions. We are continuously working to improve more significant features, so please stay tuned.😊