bigcode-project / bigcode-evaluation-harness

A framework for the evaluation of autoregressive code generation language models.
Apache License 2.0
825 stars 219 forks source link

[WIP] Shadereval tasks #173

Open Vipitis opened 11 months ago

Vipitis commented 11 months ago

revival of #97

This PR adds the Shadereval tasks, Shadereval looks at "creative coding" by comparing models on shadercode(glsl with Shadertoy.com syntax). There is (currently) two tasks:

The paper for the first task and dataset has been written but not not be published - it can be omitted from this PR since it was more of a prototype. The second task is nearly done as part of my Thesis, we hope to publish this eventually too. The implementation should be done by October. The dataset is essentially locked but will be renamed. This PR is our reference implementation.


Remaining tasks: