issues
search
FMInference
/
FlexLLMGen
Running large language models on a single GPU for throughput-oriented scenarios.
Apache License 2.0
9.21k
stars
547
forks
source link
FlexGen for Data wrangle Tasks.
#91
Closed
BinhangYuan
closed
1 year ago