Closed ycfreeman closed 1 week ago
cloud hosting
self hosting
@Rzz8 suggested Hugging Face Inference
and after a bit of dig through, it seem to actually relatively cheap and easy to use
paid for the PRO tier and it pretty much works without much issue so far
https://huggingface.co/docs/api-inference/en/getting-started
will lock down to this for now
Azure free tier is 1000 token per minute, which means it's almost just 1-2 prompt per minute, not quote useful
we'll either need to host and expose a machine, or buy a hosted service. would need some research on how and how much