Closed Anindyadeep closed 8 months ago
Hello, the harness supports all decoder transformers
models. The error is because you didn't provide a --prompt
argument in you command, I'll update the code to show a more informative message.
HumanEvaSynthesize requires providing a prompt (see docs) type which says how the instruction should be formatted, you can use --prompt instruct
for example which concatenates the english instruction with function signature (see)
Also, are you trying to do greedy since you set temperature to 0? In that case I advice you do --do_sample False
instead, and you won't need a 100 samples per problem since we don't sample in greedy and generations will be the same, you can try this for evaluation on the first 10 problems in greedy:
accelerate launch main.py \
--model microsoft/phi-1_5 \
--tasks humanevalsynthesize-rust \
--limit 10 \
--do_sample False \
--prompt instruct \
--n_samples 1 \
--batch_size 1 \
--allow_code_execution \
--save_generations \
--trust_remote_code \
--max_length_generation 2048 \
Hello, the harness supports all decoder
transformers
models. The error is because you didn't provide a--prompt
argument in you command, I'll update the code to show a more informative message.HumanEvaSynthesize requires providing a prompt (see docs) type which says how the instruction should be formatted, you can use
--prompt instruct
for example which concatenates the english instruction with function signature (see)Also, are you trying to do greedy since you set temperature to 0? In that case I advice you do
--do_sample False
instead, and you won't need a 100 samples per problem since we don't sample in greedy and generations will be the same, you can try this for evaluation on the first 10 problems in greedy:accelerate launch main.py \ --model microsoft/phi-1_5 \ --tasks humanevalsynthesize-rust \ --limit 10 \ --do_sample False \ --prompt instruct \ --n_samples 1 \ --batch_size 1 \ --allow_code_execution \ --save_generations \ --trust_remote_code \ --max_length_generation 2048 \
Thanks and it worked. At the time I put the issue, I was new here. But yes, implementing those strategies worked. Thanks
System info
The command I used:
I got this error:
It seems like a
NotImplementedError
, so is this evaluation not available for Phi models?