Closed LarryHawkingYoung closed 5 hours ago
What model variant did you use and what GPU did you use?
One guess is that you may run out of GPU memory if you try to run the 7B un-quantized model on a 16GB GPU. You can either try the 7B quantized model or a 2B model and it should work.
Hi @LarryHawkingYoung,
Could you please confirm if this issue is resolved for you with the above comment ? Please feel free to close the issue if it is resolved ?
Thank you.
Hi @LarryHawkingYoung,
Closing this issue due to lack of recent activity, Please feel free reopen if this is still a valid request.
Thank you!
When I run
It returns the error:
docker: Error response from daemon: could not select device drit device driver "" with capabilities: [[gpu]].
while if I run on CPU with command:
It works out OK.