Anthropic message normalization

AbanteAI / mentat

Mentat - The AI Coding Assistant

Apache License 2.0

2.42k stars 226 forks source link

System messages are converted to annotated user messages and repeated messages are concatenated.

This is sort of an ugly PR. I'd rather convert system messages to user messages natively and then have no difference between how gpt/claude are handled. I think a better philosophy than using system messages to inject information from the environment would be to tell the llm its talking to a bot that is relaying information from the end user and the environment and then have all system messages besides the parser prompt be user messages annotated with "Code message" and annotate the actual user messages as "End user message:".

But I kind of want to get this out quickly because it seems clear claude 3 is better than gpt-4 and I want to make it possible for other people to use it.

Pull Request Checklist

[x] Documentation has been updated, or this change doesn't require that

AbanteAI / mentat

Anthropic message normalization #537

Pull Request Checklist