OpenGVLab / OmniQuant

[ICLR2024 spotlight] OmniQuant is a simple and powerful quantization technique for LLMs.
MIT License
626 stars 49 forks source link

question about let #85

Open mxjmtxrm opened 2 weeks ago

mxjmtxrm commented 2 weeks ago

Hi, Why exclude down_proj when executing a let? like model.mlp.down_proj.temp_weight = model.mlp.down_proj.weight. I think the down_proj can be smoothed with up_proj, like smooth_fc_fc_temporary(model.mlp.up_proj,model.mlp.down_proj, model.down_smooth_scale, model.down_smooth_shift) Am I right?