Closed 3outeille closed 10 months ago
Refactors and fixes #25
Fix some bug where we need to enlarge kv cache after enlarging rotary embeding frequency table so that flash_attn_with_kvcache don't overwrite position_idx
flash_attn_with_kvcache
position_idx
xDD
Refactors and fixes #25
Fix some bug where we need to enlarge kv cache after enlarging rotary embeding frequency table so that
flash_attn_with_kvcache
don't overwriteposition_idx