Closed patelprateek closed 11 months ago
Thanks, these are great questions!
vectorwise_dequant
does the dequantization where A is row-wise normalized and B is tensor-wise normalized. I think this function is obsolete and should be replaced by vectorwise_mm_dequant
for the case row
vectorwise_mm_dequant
does the dequantization after the matmul for row-wise A and column-wise BCB
is the row-major Int8 matrix. SCB
holds the state of the CB
tensor, which are normalization/quantization statistics. Let me know if you have any more questions!
This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.
Hi , i was going though the code base here and have few questions
https://github.com/TimDettmers/bitsandbytes/blob/6bc2b992be0bb7511ea881f8ebbbd2ba7f1b5109/bitsandbytes/functional.py#L1833 :
vectorwise_quant
: implements different quantization types1) are
row
andvector
same type of quantization ? what is the difference ? from code it seems same2)
vectorwise_dequant
: this is only implemented for "vector" , and returns None for all other quantization ? Is this correct or just not implemented ?3)
vectorwise_mm_dequant
: i see some params like S1, S2 but nt sure what they mean and how it differs from vectorwise_quant , any guidance here will be helpful4) in test_modules i observe
quant
which is different thanvectorwise_quant
, any reason ? i see the test code implementsmin-max
but thevectorwise_quant
doesnt implement . Similarlydequant
in test_module takes parameters S1 and S2 which i dont really know how it relates toquant
. Usually i would expect something produced by quantization method can be dequantized , similar to encode decode or serialize deserialize apis .5)
double_quant
: what exactly are row_stats and col_stats here and similarlyreturn out_row, out_col, row_stats, col_stats, coo_tensor
, could you please elaborate on the outputs here ?6) in
Int8Params
class i observeCB, CBt, SCB, SCBt, coo_tensorB = bnb.functional.double_quant(B)
, could you please help in elaborating a bit on whatCB
andSCB
param meanThanks