ufal / korektor

Statistical spell- and (occasional) grammar-checker.
lindat.mff.cuni.cz/services/korektor
BSD 2-Clause "Simplified" License
17 stars 4 forks source link

Allow parametrizing Viterbi decoding. #7

Closed foxik closed 9 years ago

foxik commented 9 years ago

Allow parametrizing the Viterbi decoding algorithm.

foxik commented 9 years ago

Done in:

foxik commented 9 years ago

Using viberbi_beam_size of 15 and viterbi_stage_pruning of 2.5 decreases Korektor runtime by a factor of 2 (diacritization) and 3 (spellchecking) in korektor-czech-130202 model, while having no effect on the test data results. On other data it can have some effect, but it will be probably minor.

On the other hand, decreasing viberbi_order to 1 has a larger effect on result accuracy, so we are not using it for korektor-czech-130202. But it will be useful if we try models with higher order than 3.