issues
search
turboderp
/
exllama
A more memory-efficient rewrite of the HF transformers implementation of Llama for use with quantized weights.
MIT License
2.66k
stars
214
forks
source link
Does it support safetytensor formate?>
#309
Open
lucasjinreal
opened
7 months ago
lucasjinreal
commented
7 months ago
Does it support safetytensor formate?>
Does it support safetytensor formate?>