Open spring1915 opened 9 months ago
I used supervised-fine-tune.py for fine-tuning. Does this mean that in inference.py, should I use from llama_attn_replace_sft import replace_llama_attn in place of your currently specified from llama_attn_replace import replace_llama_attn?
I used supervised-fine-tune.py for fine-tuning. Does this mean that in inference.py, should I use from llama_attn_replace_sft import replace_llama_attn in place of your currently specified from llama_attn_replace import replace_llama_attn?