I'm newly working on evaluation of RM. I ran the pipeline of RM traning and got one. I want to use it directly to get score for a single response, but I did not find appropriate API for it.
So I turned to use it as AutoModelForSequenceClassification, but I got
Some weights of LlamaForSequenceClassification were not initialized from the model checkpoint
at /workspace/OpenRLHF/ckpt/vallina_arena_7b_llama_1epoch and are newly initialized: ['score.weight']
Hello!
I'm newly working on evaluation of RM. I ran the pipeline of RM traning and got one. I want to use it directly to get score for a single response, but I did not find appropriate API for it.
So I turned to use it as
AutoModelForSequenceClassification
, but I gotWhat can I do?
Sincerely thanks for your help!