Open xiezhq-hermann opened 1 year ago
Reopen https://github.com/FMInference/FlexGen/pull/71 which was closed by mistake. Minimal modification to extend FlexGen to CPU and M1/M2 GPU platforms.
Reopen https://github.com/FMInference/FlexGen/pull/71 which was closed by mistake. Minimal modification to extend FlexGen to CPU and M1/M2 GPU platforms.