Closed ErezSha closed 4 months ago
In order to reduce latency, try moving the rule engine LLM to a smaller & faster LLM (like gpt-4o)
gpt-3.5 should be enough for the entry point node gpt-4o should be enough for rule engine node
In order to reduce latency, try moving the rule engine LLM to a smaller & faster LLM (like gpt-4o)