flexflow / FlexFlow

FlexFlow Serve: Low-Latency, High-Performance LLM Serving
https://flexflow.readthedocs.io
Apache License 2.0
1.67k stars 223 forks source link

Update CUDA toolchain version #1399

Closed oOTigger closed 3 months ago

oOTigger commented 4 months ago

Update Flake files to support Sapling CUDA environment


This change is Reviewable