Some questions about the code

hatchetProject / QuEST

QuEST: Efficient Finetuning for Low-bit Diffusion Models

26 stars 2 forks source link

I apologize for any inconvenience, and I have a few questions regarding the code snippet:

Why did you choose to calculate each sample individually using the for i in range(activation_fp["input"][0].shape[0]) loop instead of computing the entire batch at once?
Could you please clarify the purpose of calculating head? In my testing so far, head seems to always be 1 (currently, I've only tested it with ImageNet, and not yet with other models).
I noticed that err += loss_func(output_quant, activation_fp["output"][i].unsqueeze(0).cuda()) is repeated k times for k in activation.keys():. Is there any specific reason for this repetition, or is there another significance to it?

I would greatly appreciate it if you could take some time to respond to these queries. Thank you very much for your assistance.

hatchetProject / QuEST