Open LoggeL opened 11 months ago
Very interesting project 😄 Thanks for doing Do you have a rough estimation on the hardware required and the speed that you get with certain hardware?
Edit: Found the RAM requirements but don't have the hardware to judge the speed. https://huggingface.co/TheBloke/WizardCoder-15B-1.0-GGML#provided-files
I use the 4090 for it. Speed-wise, I think you're better off using extllama or exllama2, which have the best speed with GPU acceleration. https://github.com/turboderp/exllama https://github.com/turboderp/exllamav2
Very interesting project 😄 Thanks for doing Do you have a rough estimation on the hardware required and the speed that you get with certain hardware?
Edit: Found the RAM requirements but don't have the hardware to judge the speed. https://huggingface.co/TheBloke/WizardCoder-15B-1.0-GGML#provided-files