In llama, if choose gptq method to quant, its actorder by hessian was canceled?
I found some modify records in ldlq method, in function 'round_vecbal_Hsort', including Hdiag.sort. But if choose GPTQ, method is class GPTQ, not Balance, which is related to ldlq.
Despite of the above, by your pre/postproc, gptq still can achieve a better result
In llama, if choose gptq method to quant, its actorder by hessian was canceled? I found some modify records in ldlq method, in function 'round_vecbal_Hsort', including Hdiag.sort. But if choose GPTQ, method is class GPTQ, not Balance, which is related to ldlq. Despite of the above, by your pre/postproc, gptq still can achieve a better result