Investigate whether we can use GPU with LlamaV2

khoj-ai / khoj

Your AI second brain. Self-hostable. Get answers from the web or your docs. Build custom agents, schedule automations, do deep research. Turn any online or local LLM into your personal, autonomous AI (e.g gpt, claude, gemini, llama, qwen, mistral).

https://khoj.dev

GNU Affero General Public License v3.0

14.77k stars 735 forks source link

Investigate whether we can use GPU with LlamaV2 #388

Closed sabaimran closed 1 year ago

sabaimran commented 1 year ago

We decided to use the gpt4all library for ease of use and modularity with support for our first offline LLM, but the library does not yet support GPU usage. There's a PR Open to add support for some GPUs, but it hasn't yet merged.

We can look into going lower in the stack and experimenting using the ctransformers library directly to run llama v2 with GPUs.

sabaimran commented 1 year ago

This should be addressed with #363! Closing.