argilla-io / argilla

Argilla is a collaboration tool for AI engineers and domain experts to build high-quality datasets
https://docs.argilla.io
Apache License 2.0
4.06k stars 382 forks source link

[FEATURE] Label breakdown in Feedback dataset stats #4637

Open nataliaElv opened 8 months ago

nataliaElv commented 8 months ago

Is your feature request related to a problem? Please describe. A clear and concise description of what the problem is. Ex. I'm always frustrated when [...]

Several users that have moved from DatasetforTextClassification to FeedbackDataset have expressed that they miss the breakdown of labels in the stats panel.

This doesn't have an easy correlation in the context of Feedback Datasets as one could have many types of questions with many labels, but we can think of a smart solution to see this breakdown in the UI.

Describe the solution you'd like A clear and concise description of what you want to happen.

Describe alternatives you've considered A clear and concise description of any alternative solutions or features you've considered.

Additional context Add any other context or screenshots about the feature request here.

dosubot[bot] commented 8 months ago

It seems like you've got this under control, if you want help or have specific questions, let me know what I can do for you!

#### About Dosu This response is meant to be useful and save you time. It is **not meant to be a precise solution**, but rather a starting point for your own research. Help me be more useful! Please leave a 👍 if this is helpful and 👎 if it is irrelevant. If you want to continue the conversation, start your reply with @dosu-bot.
splevine commented 8 months ago

The ability to breakdown data by counts for responses and metadata.

For example, for label questions it would be great to have a breakdown of how many labels I have for each response.

For example, I have annotations for a dozen clients and I save it as metadata I would like to see how many sample I have for each client.

Future it would be great to get counts for multiple conditions. If client = x and breakdown of responses.

github-actions[bot] commented 5 months ago

This issue is stale because it has been open for 90 days with no activity.

github-actions[bot] commented 2 months ago

This issue is stale because it has been open for 90 days with no activity.