Closed AISuperMa closed 1 year ago
https://github.com/FMInference/FlexGen/blob/25438f6bc3507e1fbd6f88e4812beb2a102d7315/flexgen/flex_opt.py#L1162
I think all OPT models share the same tokenizer.
https://github.com/FMInference/FlexGen/blob/25438f6bc3507e1fbd6f88e4812beb2a102d7315/flexgen/flex_opt.py#L1162