explosion / spaCy

💫 Industrial-strength Natural Language Processing (NLP) in Python
https://spacy.io
MIT License
30.15k stars 4.4k forks source link

Use of Custom Model withouth any inbuilt language model #3088

Closed softwareacharya closed 5 years ago

softwareacharya commented 5 years ago

Hi There,

Can someone please let me know below

Is it possible to have our own model without using any inbuilt language model for NER ? If yes, how can we do ? Please provide any relevant url to refer? Also, I have to match set of possible values (user names, user id, account names, account id) Is it possible to apply multiple match phraser on given text and find custom entity? How to achieve this ? My use case is basically rule based (multiple rules) + statistical model combination.

svlandeg commented 5 years ago

I haven't done something like this myself yet, but here are some links to the docs to get you started:

ines commented 5 years ago

Also, I have to match set of possible values (user names, user id, account names, account id) Is it possible to apply multiple match phraser on given text and find custom entity? How to achieve this ?

To addd to the comment above, you can also use the Matcher and PhraseMatcher for rule-based entity extraction. The Matcher lets you write abstract token-based rules, while the PhraseMatcher lets you match large terminology lists.

The upcoming version will also introduce the EntityRuler that makes rule-based named entity recoginition (and a combination of rule-based and statistical NER) simpler. You can read more about this here: #2513. Note that this component is currently only available for testing in the alpha version (installable as spacy-nightly).

lock[bot] commented 5 years ago

This thread has been automatically locked since there has not been any recent activity after it was closed. Please open a new issue for related bugs.