bigcode-project / bigcode-evaluation-harness

A framework for the evaluation of autoregressive code generation language models.
Apache License 2.0
702 stars 180 forks source link

[WIP] Shadereval tasks #173

Open Vipitis opened 6 months ago

Vipitis commented 6 months ago

revival of #97

This PR adds the Shadereval tasks, Shadereval looks at "creative coding" by comparing models on shadercode(glsl with Shadertoy.com syntax). There is (currently) two tasks:

The paper for the first task and dataset has been written but not yet published, the second task is still in development. The goal is to finish the implementation by the end of this year. Most changes will be made to the dataset or metric which are not part of this PR. The code to run generations included here should be about complete.


Remaining tasks: