Retry strategies for OpenAI RateLimitError

MaartenGr / BERTopic

Leveraging BERT and c-TF-IDF to create easily interpretable topics.

MIT License

6.15k stars 764 forks source link

bertopic.representation.OpenAI( model="gpt-35-turbo", chat=True, #delay_in_seconds=1, generator_kwargs = {"engine": "gpt-35-turbo", "temperature": 0.1}, prompt=f""" Output a concise, English, lowercase topic label for the following keywords. Output only the label, no punctuation. Prefer single terms. If you are unable to perform the task, output: None. [KEYWORDS] """ )

Would a different strategy be possible that catches a RateLimitError when it occurs and then adapts?

You could use exponential_backoff for that in OpenAI:

    exponential_backoff: Retry requests with a random exponential backoff. 
                         A short sleep is used when a rate limit error is hit, 
                         then the requests is retried. Increase the sleep length
                         if errors are hit until 10 unsuccesfull requests. 
                         If True, overrides `delay_in_seconds`.

If the RateLimitError is predictable (e.g. depending on dataset size) - is it avoidable?

It depends on the number of clusters you created since there will be one call for each cluster to create a label. If you have many clusters and you set it to a couple of seconds delay, then it will be the number of seconds times the number of clusters.

MaartenGr / BERTopic

Retry strategies for OpenAI RateLimitError #1560