teacheraide / moodle-local_teacheraide

3 stars 0 forks source link

buy or host OpenAI compatible API that can serve more than 1000 tokens for the showcase instance #23

Closed ycfreeman closed 1 week ago

ycfreeman commented 2 months ago

Azure free tier is 1000 token per minute, which means it's almost just 1-2 prompt per minute, not quote useful

we'll either need to host and expose a machine, or buy a hosted service. would need some research on how and how much

ycfreeman commented 3 weeks ago

cloud hosting

self hosting

ycfreeman commented 1 week ago

@Rzz8 suggested Hugging Face Inference

and after a bit of dig through, it seem to actually relatively cheap and easy to use

paid for the PRO tier and it pretty much works without much issue so far

https://huggingface.co/docs/api-inference/en/getting-started

will lock down to this for now