c0sogi / llama-api

An OpenAI-like LLaMA inference API
MIT License
111 stars 9 forks source link

how to run this api in cpu only mode #23

Open delta-whiplash opened 10 months ago

delta-whiplash commented 10 months ago

Hello can someone guide me to run this nice API in CPU mode only

delta-whiplash commented 10 months ago

@c0sogi could you help me to figure how to run it please I love the project but some of my models are too big for my gtx 1080