Nota-NetsPresso / shortened-llm

Compressed LLMs for Efficient Text Generation [ICLR'24 Workshop]
63 stars 8 forks source link

loading using AutoModelforCausalLm.frompretrained method #17

Open sriyachakravarthy opened 2 weeks ago

sriyachakravarthy commented 2 weeks ago

Hi! Since we are saving the final finetuned model, can we load it using AutoModelforCausalLm.frompretrained method? If so, how? is loading using peft (one extra step from above) necessary?