issues
search
FMInference
/
FlexLLMGen
Running large language models on a single GPU for throughput-oriented scenarios.
Apache License 2.0
9.18k
stars
548
forks
source link
Update docs/paper.md
#102
Closed
shotarok
closed
1 year ago
shotarok
commented
1 year ago
What
Update paper.md to add the link to
https://arxiv.org/abs/2303.06865
Why
It took a while for me to find the above link.
If something is wrong, please feel free to close this PR. Thanks
What
Why