Open hnyls2002 opened 1 month ago
When the chunked prefill size is too small and the prefill length is too long, python would reach maximum recursion depth in the radix cache.
Same as proposed in #154
Hello @hnyls2002 please assign this to me
I'm just wondering, why did you choose to Implement radix trie in python rather than use off the shelf library?
Motivation
When the chunked prefill size is too small and the prefill length is too long, python would reach maximum recursion depth in the radix cache.
Same as proposed in #154