Closed aleskubicek closed 1 year ago
Adding Llama2 HuggingFace dependencies to allow multi-GPU inference and quantization:
Closes #6
Adding Llama2 HuggingFace dependencies to allow multi-GPU inference and quantization:
Closes #6