Open karthink opened 3 months ago
Closely related suggestion, based on the reading of docs for the Mistral-7B-Instruct-v0.2 model:
In order to leverage instruction fine-tuning, your prompt should be surrounded by [INST] and [/INST] tokens.
A :process-directive
option to gptel-make-ollama
(and others) that would accept a function taking backend
, model
, and directive
arguments and return a processed directive. This is probably more user-friendly than a generic fn.
This might also be useful for Anthropic models, which like XML-style tagging.
According to Anthropic documentation:
That's how I work: system prompt defines the context, while the user prompt sends the specific directive and the text to be operated on.
This isn't clear-cut and is definitely up for debate, but I think we should at least have the option.
Originally posted by @jwr in https://github.com/karthink/gptel/issues/276#issuecomment-2035112175