posts/2024-06-15-isafpr-first-finetune

Thanks for the write up. Interestingly, I struggle with Mistral as my finetuned model rarely outputs the token and it continues generating tokens, but I don't have this issue with finetuned llama3-8b models. I couldn't figure out what I'm doing wrong with Mistral that leads to this issue.

The template-free data format is interesting, but I can't understand what does it mean to mask some parts of the training data so model wouldn't learn them. My intuition is we want our fine-tuned model to learn everything in the training dataset (including the system prompt, the instructions, and the response). I need to dig into the axolotl docs to read more on this.

strickvl / mlops-dot-systems

posts/2024-06-15-isafpr-first-finetune #8

Alex Strick van Linschoten - Finetuning my first LLM(s) for structured data extraction with axolotl