Open rasbt opened 3 months ago
In addition to kv-caching, it makes sense to also add prompt caching. I.e., instead re-computing the system-prompt, we can cache these prefilled prompts to avoid recalculations
@rasbt is this open for contribution? If so, can you guide me through?
In addition to kv-caching, it makes sense to also add prompt caching. I.e., instead re-computing the system-prompt, we can cache these prefilled prompts to avoid recalculations