flexflow / FlexFlow

FlexFlow Serve: Low-Latency, High-Performance LLM Serving
https://flexflow.readthedocs.io
Apache License 2.0
1.59k stars 218 forks source link

Local Backing: Gradient Tensor Allocation #1415

Open reyna-abhyankar opened 2 weeks ago