Open RobotSail opened 5 months ago
We currently only support 4-bit quantization via BitsAndBytes. We should support other options such as 8-bit, (potentially) 6-bit, etc.
This issue has been automatically marked as stale because it has not had activity within 90 days. It will be automatically closed if no further activity occurs within 30 days.
We currently only support 4-bit quantization via BitsAndBytes. We should support other options such as 8-bit, (potentially) 6-bit, etc.