bigcode-project / selfcodealign

[NeurIPS'24] SelfCodeAlign: Self-Alignment for Code Generation
https://arxiv.org/abs/2410.24198
Apache License 2.0
276 stars 20 forks source link

can you release the snippet->concept data? #7

Closed huu4ontocord closed 6 months ago

huu4ontocord commented 6 months ago

Hi - can you release on HF the concepts for each of the snippets?

Thank you!

UniverseFly commented 6 months ago

Yes, I think @cassanof can help with this.

cassanof commented 6 months ago

Hi - can you release on HF the concepts for each of the snippets?

Thank you!

Hey! We just released every single step of the dataset pipeline. Scroll at the bottom of the model card and you will find each dataset for each step: https://huggingface.co/bigcode/starcoder2-15b-instruct-v0.1