Open jorgeantonio21 opened 5 months ago
It seems that clearing cache on current Falcon model implementation is currently not working properly. Every time a second query is run, the cache is not cleared.
Actually there was no way to flush the kv cache in falcon, I've added a function for this in #2066 , you should call it on further queries.
Thanks @LaurentMazare !
It seems that clearing cache on current Falcon model implementation is currently not working properly. Every time a second query is run, the cache is not cleared.