Open BirdChristopher opened 5 hours ago
Hi, I'm pretty interested in the flexattention-like api mentioned in October lmsys meetup. But I can't find any introduction related. Is there anything I miss in documentation?
You can try https://github.com/flashinfer-ai/flashinfer/blob/main/tests/test_jit_example.py at this moment.
The APIs are subject to changes because this feature is still experimental.
Hi, I'm pretty interested in the flexattention-like api mentioned in October lmsys meetup. But I can't find any introduction related. Is there anything I miss in documentation?