Closed sbmkvp closed 1 year ago
Is there a way to increase this?
cria
replicates the OpenAI API, so you can set the max_tokens
value into your POST request. Here is an example :
curl http://localhost:3000/v1/completions \
-H "Content-Type: application/json" \
-H "Authorization: Bearer $OPENAI_API_KEY" \
-d '{
"model": "gpt-3.5-turbo-instruct",
"prompt": "Say this is a test",
"max_tokens": 512,
"temperature": 0
}'
The response cuts off at around 256tokens.
Input
Output