FMInference / FlexLLMGen

Running large language models on a single GPU for throughput-oriented scenarios.
Apache License 2.0
9.21k stars 547 forks source link

Is this support korean?? #22

Closed waikoreaweatherpjt closed 1 year ago

waikoreaweatherpjt commented 1 year ago

I am afraid of non supporting.

shlyakpavel commented 1 year ago

FlexGen is a high-throughput generation engine for running large language models. I suppose that depends on the model