instructlab / instructlab-bot

GitHub bot to assist with the taxonomy contribution workflow
Apache License 2.0
16 stars 18 forks source link

Add concurrent usage when using endpoints in the worker #185

Open russellb opened 7 months ago

russellb commented 7 months ago

When using a model endpoint for precheck and the sdg-svc for generate, we should be able to have multiple concurrent requests to these endpoints to help scale this out. Some thoughts on how to do this ...

So, we need some testing, some design decisions, some deployment automation

russellb commented 7 months ago

For the sdg-svc part

as of this AM testing - i feel comfortable that you could put about 10 concurrent requests