flairNLP / fabricator

[EMNLP 2023 Demo] fabricator - annotating and generating datasets with large language models.
Apache License 2.0
98 stars 12 forks source link

Sampling method that covers all labels #5

Closed HallerPatrick closed 1 year ago

HallerPatrick commented 1 year ago

Currently examples are randomly sampled. We do not guarantee, that all labels are covered. For text classification this is easy, but for Sequence Labeling tasks, aka multi-label multi-class, we have to apply some heuristics