Open htcml opened 1 year ago
Are you able to add quantization code so that the model can be run on a smaller GPU?
Are you able to add quantization code so that the model can be run on a smaller GPU?