Open JohnnyRacer opened 9 months ago
Hi @JohnnyRacer and thanks for the details you put in the issue!
We talked about this internally and while we understand the use case we couldn't figure out a way to prioritise this work. I imagine writing a custom component would be a good workaround in the meantime, but I'm also labelling this issue as contributions wanted in case anybody want to give it a shot.
Hello, I think adding a keyword extractor with KeyBERT would be quite useful. The keywords extracted could be used for paraphrasing or summarizing with
logit_bias
to allow for more consistent word usage in those tasks. Alternatively the keywords can also be used for locating a set documents from a collection that matches the keywords or phrases (sparse keyword based retrieval) . I have created snippet below based on the usage section KeyBERT's README.MD and the example from NamedEntityExtractor below.