Searching for models gives no option to select the quant size and seems to favour f32 which is a bit overkill IMO 🤣, searching with a specific quant size returns a completely different model
Steps to Reproduce
Search for Phi 3.1
Returns results for F32
No option to select the quant type/size
Search for Phi 3.1 Q6_K
Note that it returns results for llama3?
Expected Behaviour
Be able to search for a model and have it suggest a reasonable quant size (e.g. Q6_K, Q5_K_M and Q4_K_M) by default.
Be able to search for a model and it's quant size and have it return that model and quant size (rather than a different model entirely)
Bug Report
Searching for models gives no option to select the quant size and seems to favour f32 which is a bit overkill IMO 🤣, searching with a specific quant size returns a completely different model
Steps to Reproduce
Expected Behaviour
Be able to search for a model and have it suggest a reasonable quant size (e.g. Q6_K, Q5_K_M and Q4_K_M) by default.
Be able to search for a model and it's quant size and have it return that model and quant size (rather than a different model entirely)
Your Environment