flexflow / FlexFlow

FlexFlow Serve: Low-Latency, High-Performance LLM Serving
https://flexflow.readthedocs.io
Apache License 2.0
1.59k stars 218 forks source link

Request for Graph Pruning Algorithm Code Location in FlexLLM #1425

Open zbtrs opened 1 week ago

zbtrs commented 1 week ago

I read your FlexLLM paper and I'm very interested in the Graph Pruning algorithm described in section 5.2. However, I couldn't find the related code in the peft branch of this repository. Could you please provide the specific location of this code?