replicate / cog-flux

Cog inference for flux models
https://replicate.com/black-forest-labs/flux-dev
Apache License 2.0
272 stars 28 forks source link

Make fp8 work on older GPUs #34

Open yorickvP opened 2 weeks ago

yorickvP commented 2 weeks ago

Tested on RTX A5000.

daanelson commented 2 weeks ago

@yorickvP this is great, making 8-bit inference work regardless of hardware is useful. That said, when I run cog predict -i prompt=<whatever> on an A40, it takes ~10 minutes to compile and then there's no output. can you take a look? wary of pushing a broken path here