RasaHQ / rasa

💬 Open source machine learning framework to automate text- and voice-based conversations: NLU, dialogue management, connect to Slack, Facebook, and more - Create chatbots and voice assistants
https://rasa.com/docs/rasa/
Apache License 2.0
18.94k stars 4.64k forks source link

Add custom features to NER #4214

Closed jamesmf closed 4 years ago

jamesmf commented 5 years ago

Description of Problem: Right now custom entities can only use pos features from spacy and a handful of simple features. This seems to be in contrast to the flexibility and power of the other pipeline components which can take advantage of any combination of built-in and custom featurizers. Ideally, there would be a way to pass ner_features to the CRFEntityExtractor. In particular, this would let you train NER that used word/token vectors straight from spacy (or other pretrained models)

Overview of the Solution:

Examples (if relevant): The skeleton of this (both adding a spacy-based featurizer and making CRFEntityExtractor use ner_features) is implemented in this PR https://github.com/RasaHQ/rasa/pull/4187 Please let me know if this looks like a useful feature and if this PR is heading in the right direction.

Still necessary:

Definition of Done: