SGLang is a structured generation language designed for large language models (LLMs). It makes your interaction with models faster and more controllable.
Apache License 2.0
2.75k
stars
176
forks
source link
AttributeError: module 'flashinfer' has no attribute 'batch_prefill_with_paged_kv_cache' #533
Description:
When running the 'test_batch_prefill_with_paged_kv_cache' test in test_flashinfer.py, an AttributeError occurs, indicating that the flashinfer module has no attribute 'batch_prefill_with_paged_kv_cache'.
Suggested Fix:
Update the test test_batch_prefill_with_paged_kv_cache in test_flashinfer.py to use the correct method.
Description: When running the 'test_batch_prefill_with_paged_kv_cache' test in test_flashinfer.py, an AttributeError occurs, indicating that the flashinfer module has no attribute 'batch_prefill_with_paged_kv_cache'.
Suggested Fix: Update the test test_batch_prefill_with_paged_kv_cache in test_flashinfer.py to use the correct method.