Closed sorobedio closed 1 month ago
Hi @sorobedio,
You should find all further answers to your questions in the modelcard: https://huggingface.co/VAGOsolutions/Llama-3.1-SauerkrautLM-8b-Instruct We used neither qlora nor lora for training but spectrum fft on 25% of the layers.
Regards, David
thank you.
Hello,
I would like to ask a few questions regarding the evaluation of the Llama-Instruct model. Specifically:
use_chat_template
and use multi-turn few-shot prompts during evaluation?Additionally, I am working on reproducing the results of the
VAGOsolutions/Llama-3.1-SauerkrautLM-8b-Instruct
, which I believe is based on your work. Could you clarify if the work I mentioned here is fine-tuned using LoRA, or if it's utilizing a combination of Spectrum and QLoRA?Thank you for your time and assistance.