issues
search
FMInference
/
FlexLLMGen
Running large language models on a single GPU for throughput-oriented scenarios.
Apache License 2.0
9.22k
stars
549
forks
source link
This project is software or hardware?
#12
Closed
guotong1988
closed
1 year ago
guotong1988
commented
1 year ago
Thank you very much!
merrymercy
commented
1 year ago
It is a piece of software based on PyTorch
Thank you very much!