Closed aaronbolton closed 5 hours ago
Keeping an eye on the community thread for this, as long as it keeps working well for y'all I'll merge tomorrow. Doesn't have any negative impact if this new env var is missing, just continues defaulting to 32768 num_ctx. Cheers!
Adding DEFAULT_NUM_CTX to enable easier adjust on context sizes
DEFAULT_NUM_CTX VAR set to a default of 32768 but can be adjusted in .env.local, also adding some example values to .env.example
Example Context Values for qwen2.5-coder:32b
32768 # Consumes 36GB of VRAM 24576 # Consumes 32GB of VRAM 12288 # Consumes 26GB of VRAM 6144 # Consumes 24GB of VRAM