AlexPoint / OpenNlp

Open source NLP tools (sentence splitter, tokenizer, chunker, coref, NER, parse trees, etc.) in C#
MIT License
283 stars 100 forks source link

How to train own model for NER? #14

Closed UNICASTERSOL closed 3 years ago

UNICASTERSOL commented 7 years ago

Anyone knows how to train own NER model?

Sorry, the question should be:- Anyone knows how to train own NER model? I have tried to train my own but I have reached to the stage whereby it run out memory. Anyone know what should I configure for optimum for CUTOFF and Iteration please?

LucasWBritz commented 4 years ago

Did you got any solution @UNICASTERSOL ? I'm trying to get this working as well

MystaraCorvus commented 3 years ago

I would love to know more. Did you receive any information?

AlexPoint commented 3 years ago

I haven't trained any NER model so far. A good start would be to have a look at the original java project documentation. From what I see in the code, you have to use the MaximumEntropyNameFinder class, and more specifically the TrainModel method. The input is a training file which should have a specific tagging with examples of the entities to detect (see NameFinderEventReader class to see how this training file is used). Let me know if you manage to train your own model.