Open mrseanryan opened 4 months ago
Approach:
use a larger context window
repeat and concatenate, with higher temperature
multiply programmatically (post-process)
LLM - Mistral 7B
Context size: a sliding 4K window - see https://huggingface.co/mistralai/Mistral-7B-v0.1/discussions/4
There are many flavours of Mistral-7B:
regular mistral-7B, quantized
cognitivecomputations/dolphin-2.6-mistral-7b-dpo
4 - instruct prompting, and newer:
a base version, suitable for FT
hermes-2.5 - mistral-7B
phi from Microsoft
Approach:
use a larger context window
repeat and concatenate, with higher temperature
multiply programmatically (post-process)
LLM - Mistral 7B
Context size: a sliding 4K window - see https://huggingface.co/mistralai/Mistral-7B-v0.1/discussions/4
There are many flavours of Mistral-7B:
regular mistral-7B, quantized
cognitivecomputations/dolphin-2.6-mistral-7b-dpo
4 - instruct prompting, and newer:
a base version, suitable for FT
hermes-2.5 - mistral-7B
phi from Microsoft