Closed mabu-dev closed 2 years ago
Maybe of interest to @Narsil and @patrickvonplaten
Pretty cool idea! @mabu-dev do you know whether there is a research paper on it? For NLI zero-shot we have this paper: https://huggingface.co/facebook/bart-large-mnli#nli-based-zero-shot-text-classification
This issue has been automatically marked as stale because it has not had recent activity. If you think this still needs to be addressed please comment on this thread.
Please note that issues that do not follow the contributing guidelines are likely to be ignored.
Similar to the very helpful NLI-based zero-shot classification pipeline using a ModelForSequenceClassification, it would be great to have zero-shot on audio data.
Pass wav file(s) with candidate labels to the pipeline and get a prediction. Is this at all on the roadmap?