Open davanstrien opened 2 weeks ago
Great idea, thanks @davanstrien , will take a look!
Great idea, thanks @davanstrien , will take a look!
Awesome! Happy to review if useful :)
hey @davanstrien , I was thinking this would be a nice addition for you to add if you're up for it? if time is scarce, happy to pick it up. you seem to have figured out what needs doing anyhow, so makes sense for you to ship this.
It's great to see a library focusing on using local LLMs for synthetic data generation!
When doing the
push_to_hub
, it could be nice to add a template dataset card or some tags to the dataset so it's easier to find datasets created usingpromptwright
. To add tags, something like:would already help with discoverability. Example repo with these tags: https://huggingface.co/datasets/davanstrien/promptwright-test. You could also add a more expansive dataset card using a template in the future. There is a nice example from
distilabel
of this kind of template: https://github.com/argilla-io/distilabel/blob/main/src/distilabel/utils/card/distilabel_template.md