Closed MilesQLi closed 8 months ago
Hi, these updates are all precomputed in advance by the Cholesky decomposition, see also "Step 3: Cholesky Reformulation" in our paper.
Step 3 in the paper is not specific: with no formula to show how Cholesky decomposition is used to update (5) and (4). That makes this step very hard to understand. Is there a clearer description of this part?
I did the derivation myself.
After each quantization step, H_inv should be updated, but in the code fasterquant, H_inv is not updated. Is it a bug?