Closed aleskubicek closed 9 months ago
Adding Llama2 HuggingFace dependencies to allow multi-GPU inference and quantization:
Closes #6
Adding Llama2 HuggingFace dependencies to allow multi-GPU inference and quantization:
Closes #6