morfologik / morfologik-stemming

Tools for finite state automata construction and dictionary-based morphological dictionaries. Includes Polish stemming dictionary.
BSD 3-Clause "New" or "Revised" License
187 stars 44 forks source link

Dictionary format #50

Closed Mility closed 9 years ago

Mility commented 9 years ago

Hi,all I want to konw the format of morfologik's dictionaries, I think it's not just like below: Abe+I Abel+J Abelard+F Abelson+E Aberconwy+E I want to know more detail about it, and how to use it. Thanks advance.

dweiss commented 9 years ago

The source code is the best documentation we have. Please do not open issues, use the mailing list (morfologik-devel@lists.sourceforge.net) for this type of communication.

https://github.com/morfologik/morfologik-stemming/blob/master/morfologik-stemming/src/main/java/morfologik/stemming/DictionaryLookup.java#L281

dweiss commented 9 years ago

The encoder actually has some more information: https://github.com/morfologik/morfologik-stemming/blob/master/morfologik-tools/src/main/java/morfologik/tools/SequenceEncoders.java

Mility commented 9 years ago

Thank you very much.