Closed cassanof closed 2 months ago
Should we stop uploading these as plain code, and instead upload jsonl files? idk
Should we stop uploading these as plain code, and instead upload jsonl files? idk
hmm i think plain code is nice because you can easily inspect it; especially since there is a canonical solution to look at
As HumanEval gets saturated, we should start using LeetCode as a source dataset. The current leetcode datasets were sourced from a random HF repo and most problems are wrong. The one in this PR comes from the DeepSeekCoder paper, it's not squeaky clean either, but it's definitely much better.