speakleash / speakleash-instruct-creator

Generate instructions datasets for the fine-tuning purposes.
4 stars 6 forks source link

Fix input field issues #35

Closed mglabska closed 6 months ago

mglabska commented 7 months ago

Add the following conditions for more restrictive filtering of the input scope: if nlp.vocab.__contains__(token.lemma_) and token.text.islower() and len(token.text) > 4