Controlling Queries per sec(QPS) for llms on embedding or chat api

Hi,

I was trying to use the chat application with together.ai for creating embeddings and llm queries.While trying to create embeddings from the documents I am getting

INFO:httpx:HTTP Request: POST https://api.together.ai/v1/embeddings "HTTP/1.1 429 Too Many Requests"

as my QPS is 1 on free tier

Can anyone help me configure to make query per sec these apis.

thanks

langchain-ai / chat-langchain

Controlling Queries per sec(QPS) for llms on embedding or chat api #303