Open HiddenAlaska opened 8 months ago
can not load model with half precision. And haven't figured out how to transfer model to CPU or GPU?
run model gpt-j-6B as in the demo use local huggingface method
return a repsonse.
requests.exceptions.HTTPError: {'message': '"LayerNormKernelImpl" not implemented for \'Half\''}
Thanks in advance.
Description of the bug
can not load model with half precision. And haven't figured out how to transfer model to CPU or GPU?
To Reproduce
run model gpt-j-6B as in the demo use local huggingface method
Expected behavior
return a repsonse.
Error Logs/Screenshots
requests.exceptions.HTTPError: {'message': '"LayerNormKernelImpl" not implemented for \'Half\''}
Environment (please complete the following information)
Thanks in advance.