Closed gregorym closed 8 months ago
Each Replicate API request will be queued. This is different to queuing within ComfyUI itself. So if you make 100 API predictions, you will be charged only for the time it takes to generate each of them, not the queue time.
Essentially there is no queue time in ComfyUI, just the Replicate API queue.
Does that make sense?
Perfect, TY.
Small follow up to this, I have added some changes to guarantee that the ComfyUI server has no running workflows or queues at the start of any prediction:
https://github.com/fofr/cog-comfyui/commit/8ae95f53ca463d54216e3406fbe3769e122a0515
Let's say I create 100 predictions using the Replicate API, how does it impact the run time cost?
Does the queue time in comfyui is counted toward the Replicate run time?