memory requirements - Githubissues

cocktailpeanut / dalai

The simplest way to run LLaMA on your local machine

https://cocktailpeanut.github.io/dalai

13.09k stars 1.42k forks source link

Open mishav78 opened 1 year ago

mishav78 commented 1 year ago

maybe I'm an idiot but I have to ask is below memory requirements for cpu or gpu ram?

Runs on most modern computers. Unless your computer is very very old, it should work.

7B => ~4 GB 13B => ~8 GB 30B => ~16 GB 65B => ~32 GB

trevtravtrev commented 1 year ago

I'm running the 30B alpaca and my memory usage is roughly 78% of my 32GB RAM while in use.

mishav78 commented 1 year ago

cpu or gpu ram?

trevtravtrev commented 1 year ago

My PC memory/RAM. It also uses your CPU. As far as I know there are no current configurable settings to use GPU.

mishav78 commented 1 year ago

will it work faster with a gpu?

trevtravtrev commented 1 year ago

Please see my above edited comment.

mishav78 commented 1 year ago

very strange. Don't these models usually use gpus?

mishav78 commented 1 year ago

DustinBrett commented 1 year ago

very strange. Don't these models usually use gpus?

This project is using llama.cpp/alpaca.cpp which "Runs on the CPU"

trevtravtrev commented 1 year ago

very strange. Don't these models usually use gpus?

To train them not to run them.

mishav78 commented 1 year ago

does it work as good as chatgpt? Or close?

trevtravtrev commented 1 year ago

I'd say 30B is closing in at about 80% of chat gpt 3.5. 7B/13B maybe 60%+.

DustinBrett commented 1 year ago

I'd say 30B is closing in at about 80% of chat gpt 3.5. 7B/13B maybe 60%+.

I'd be interested to know what prompts you've tried and what parameter (temperature, etc) values you have.

For me even 30B feels like 10% of what I see with ChaGPT 3.5.