Closed fengyang95 closed 1 day ago
+1
Which coding datasets are good for this specific model?
Which coding datasets are good for this specific model?
I'm not an expert; perhaps the pile dataset is enough
Which coding datasets are good for this specific model? May I ask how much resources are needed to quantize such a large model with over 200b parameters?Using a private training dataset should yield better results, right?
Which coding datasets are good for this specific model?
May I ask if you have any plans to do this quantification in the near future?
I will release a quantized version of the model as soon as I have time to do it.
This cost me about $110. Hope it suffices. I only ran a test of perplexity so far which landed at 5.325. https://huggingface.co/casperhansen/deepseek-coder-v2-instruct-awq
I noticed that deepseek-v2 is already supported. Could you please release deepseek-coder-v2-instruct-awq to Hugging Face?