Closed guotong1988 closed 1 year ago
Thank you very much!
This is the key file https://github.com/FMInference/FlexGen/blob/9d092d848f106cd9eaf305c12ef3590f7bcb0277/flexgen/flex_opt.py#L582.
You can implement something similar for your own model.
Thank you very much!