Closed Tharsanan1 closed 2 hours ago
We are introducing AI APIs to apk. In the AI usecases its a common usecase to limit the requests based on the token counts(prompt, completion, total).
Add a feature to support ratelimiting AI requests based on the token count.
Adapter
1.2.0
No response
Feature added using this PR: https://github.com/wso2/apk/issues/2479
Problem
We are introducing AI APIs to apk. In the AI usecases its a common usecase to limit the requests based on the token counts(prompt, completion, total).
Solution
Add a feature to support ratelimiting AI requests based on the token count.
Affected Component
Adapter
Version
1.2.0
Implementation
No response
Related Issues
No response
Suggested Labels
No response