laito / cleartk

Automatically exported from code.google.com/p/cleartk
0 stars 0 forks source link

Mallet data alphabet keep growing when using a CRF #408

Open GoogleCodeExporter opened 9 years ago

GoogleCodeExporter commented 9 years ago
What steps will reproduce the problem?

1. Train a Mallet CRF
2. Use it to tag new text.

What is the expected output?

The data alphabet of the transducer should not change once the training is over.

What do you see instead?

Each feature that is seen when tagging new documents are added to the alphabet, 
leaking memory for long running tagging processes.

What version of the product are you using? On what operating system?

Version 1.4.1 and 2.0.0.

I attached a patch with a fix and a unit. 

Original issue reported on code.google.com by textjui...@gmail.com on 17 Aug 2014 at 3:31

Attachments:

GoogleCodeExporter commented 9 years ago

Original comment by steven.b...@gmail.com on 20 Aug 2014 at 9:43