flexflow / FlexFlow

FlexFlow Serve: Low-Latency, High-Performance LLM Serving
https://flexflow.readthedocs.io
Apache License 2.0
1.59k stars 218 forks source link

Fix `cudnnSetTensorDescriptorFromArrayShape` #1421

Open reyna-abhyankar opened 1 week ago

reyna-abhyankar commented 1 week ago

Remove parallel dim assumption