flexflow / FlexFlow

FlexFlow Serve: Low-Latency, High-Performance LLM Serving
https://flexflow.readthedocs.io
Apache License 2.0
1.59k stars 218 forks source link

Implement `is_valid` (or remove it) in parallel shape inference #1405

Open lockshaw opened 1 month ago