Open LiuZhetan opened 1 week ago
I did modify the original sglang code for experiments & for providing profiling numbers. You would have to install from source if you want to run it.
The flashinfer version on the main branch I believe might also be from an older version v0.3 as well. I have some profiling numbers on the latest version, but haven't updated the main branch.
I try to use preble to deploy a model by sglang, but get an error:
I found that the built-in sglang version is v0.1.16, but there is no GPUConfig class in the official v0.1.16 code, and I did not find it in the previous version either.
So did the author modify the original sglang code?
My environment: