-
Currently, we are handling the 429 at the endpoint level (skipping the deployments). **However those TPM/RPM are defined at the deployment level**
We can have multiple deployments at a single insta…
-
**Is your feature request related to a problem? Please describe.**
A clear and concise description of what the problem is. Ex. I'm always frustrated when [...]
**Describe the solution you'd like**…
-
using gpt4 the response speed top_k=5 is ~25s
On top of switching to gpt3.5 turbo, implementing streaming, and reducing the number of output tokens, check the following sources for more ideas:
…
-
gpt4 turbo支持上传文件进行分析,这个特性有计划加进插件中么?
-
Gemini pro gives you 60 queries per minute free at the moment, would be nice to have this supported.
Gemini pro's coding abilities are comparable to GPT4-turbo.
https://ai.google.dev/pricing
-
Hi,
This method looks like a legacy.
Unfortunately with my API key the "model": "gpt-3.5-turbo" does not work.
![Screenshot_20](https://github.com/lambiengcode/compose-chatgpt-kotlin-android-c…
-
In different scenarios, various models (text-davinci-003, text-davinci-002, gpt-3.5-turbo, gpt4) are used, are there any specific reasons? Since text-davinci-003 is much expensive than gpt-3.5-turbo,…
-
在哪儿看错误代码?我选了 GPT4-turbo,但是总结到一半就出错,没有具体错误代码,不知道什么原因。
_Originally posted by @EdisonWendy in https://github.com/Kenshin/simpread/discussions/5158#discussioncomment-8863172_
-
Hello, I find that in instruction-following and coding domain, get_gpt4_score function use gpt4_turbo_generate for scoring, but in math domain it use gpt4_generate. What is the reason for this setup?
-
### Describe the need of your request
Code Completion sounds fantastic, and I would love to test it. When do you plan to activate this awesome feature for Azure OpenAI or Custom OpenAI Providers?
##…