tloen / llama-int8

Quantized inference code for LLaMA models
GNU General Public License v3.0
1.05k stars 105 forks source link

Any chance to share quantized int8 7B and 13B models? #6

Open progressionnetwork opened 1 year ago

progressionnetwork commented 1 year ago

Any chance to share quantizes weights of 7B and 13B models?