issues
search
AnswerDotAI
/
cold-compress
Cold Compress is a hackable, lightweight, and open-source toolkit for creating and benchmarking cache compression methods built on top of GPT-Fast, a simple, PyTorch-native generation codebase.
https://www.answer.ai/posts/2024-08-01-cold-compress.html
BSD 3-Clause "New" or "Revised" License
85
stars
8
forks
source link
Merge latest version of main branch into gist tokens
#20
Closed
uSaiPrashanth
closed
4 months ago