Disable fp8 - Githubissues

replicate / cog-flux

Cog inference for flux models

https://replicate.com/black-forest-labs/flux-dev

Apache License 2.0

272 stars 28 forks source link

Disable fp8 #29

Closed daanelson closed 3 weeks ago

daanelson commented 3 weeks ago

Disabling fp8 compilation & go_fast for GPUs that don't support _scaled_mm - which is all GPUs w/compute capability < 8.9.

This way folks can still pull and run the model w/o issue. Defaulting to just running the model slow if a user attempts to run the fast model on the wrong GPU for ease of use.