Open AlexeyKupershtokh opened 12 years ago
Using phpmorphy 0.3.7, the same debug (in the getWordDescriptor() method) and the following code:
$morphy = new phpMorphy('dicts/ru_ru_aot_withjo', 'ru_RU');
$morphy->getPredictByDatabaseMorphier()->getWordDescriptor('АБУШКАН');
Produces the following annots:
+
Array ( [count] => 1 [offset] => 169664 [cplen] => 0 [plen] => 0 [flen] => 2 [common_ancode] => 667 [forms_count] => 75 [packed_forms_count] => 46 [affixes_size] => 486 [form_no] => 34 [pos_id] => 21 [freq] => 12 [base_prefix] => [base_suffix] => ТЬ )
Array ( [count] => 2 [offset] => 1383424 [cplen] => 0 [plen] => 0 [flen] => 0 [common_ancode] => 663 [forms_count] => 35 [packed_forms_count] => 21 [affixes_size] => 172 [form_no] => 13 [pos_id] => 1 [freq] => 6 [base_prefix] => [base_suffix] => ЫЙ )
Array ( [count] => 2 [offset] => 8832 [cplen] => 0 [plen] => 0 [flen] => 0 [common_ancode] => 691 [forms_count] => 36 [packed_forms_count] => 29 [affixes_size] => 338 [form_no] => 0 [pos_id] => 0 [freq] => 6 [base_prefix] => [base_suffix] => )
-
I've modified class phpMorphy_Morphier_MorphierAbstract this way:
Then I use standard aot dictionary from sourceforge and the following morphier:
And this shows me the following debug info:
I think there are too many duplicates. Is this behavior wrong?