epam / ai-dial-assistant

Model agnostic assistant/addon implementation for AI DIAL. It allows to use self-hosted OpenAI plugins as DIAL addons
https://epam-rail.com
Apache License 2.0
9 stars 2 forks source link

Document design of dial assistant context management #22

Open avavilau opened 10 months ago

avavilau commented 10 months ago
Oleksii-Klimov commented 9 months ago

Target state:

Usage scenario:

  1. Tokenize the completion request using the tokenization endpoint.
  2. If the request fits then go to step #⁠4.
  3. If the request does not fit, reduce the number of messages or message sizes then repeat step #⁠1.
  4. Send the request to the chat completion endpoint. The truncation endpoint aids in managing history size by allowing for the discarding of old messages.