Evaluate on APPS - Githubissues

deepseek-ai / DeepSeek-Coder

DeepSeek Coder: Let the Code Write Itself

https://coder.deepseek.com/

MIT License

6.01k stars 433 forks source link

Evaluate on APPS #87

Open Cheungki opened 6 months ago

Cheungki commented 6 months ago

Thx for your nice work!

As you mentioned here that deepseek-coder models show SOTA performance on APPS, while you might not report the exact scores or the code for evaluation on APPS benchmark. Will you share the evaluation scripts for APPS?