issues
search
FMInference
/
FlexLLMGen
Running large language models on a single GPU for throughput-oriented scenarios.
Apache License 2.0
9.21k
stars
549
forks
source link
Fixed 'significantly' typo
#139
Closed
sunchaesk
closed
2 months ago
sunchaesk
commented
4 months ago
fixed typo
fixed typo