issues
search
yale-sys
/
prompt-cache
Modular and structured prompt caching for low-latency LLM inference
MIT License
14
stars
1
forks
source link
Dev sslee
#8
Closed
shsym
closed
8 months ago
shsym
commented
8 months ago
Reorganization for test cases
Size of document (to be cached) is limited to 2560 characters
Other minor bug/typofixes