argilla-io / argilla

Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets
https://docs.argilla.io
Apache License 2.0
3.91k stars 368 forks source link

Add documentation to provide examples of how to run prepare_for_training for a specific framework #2107

Closed mgiardinelli closed 1 year ago

mgiardinelli commented 1 year ago

Is your feature request related to a problem? Please describe. It wasn't easy to understand what to apply to the "prepare_for_training" method that will generate data for a specific framework (i.e. spaCy). I am submitting this at the request of David Berenstein (from Slack post).

Describe the solution you'd like Add documentation and examples for the prominent frameworks - spaCy, HuggingFace, etc.

Additional context An end-to-end example for this, like many others provided, would be really valuable. Upload data, label the data, prepare for training, train model, upload results -> rinse & repeat.

davidberenstein1957 commented 1 year ago

Thanks @mgiardinelli I will create a PR for this and would love to get your feedback on it:)

davidberenstein1957 commented 1 year ago

@mgiardinelli I added a PR with additional documentation you can check here.

Additionally, I noticed we did not implement a method for doing this with TextClassification so I decided to create an issue for this. Do you feel this needs to be added and even better, do you think you could handle this in a PR yourself? 🤓